Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Gregory W. Gundersen is active.

Publication


Featured researches published by Gregory W. Gundersen.


Nucleic Acids Research | 2016

Enrichr: a comprehensive gene set enrichment analysis web server 2016 update

Maxim V. Kuleshov; Matthew R. Jones; Andrew D. Rouillard; Nicolas F. Fernandez; Qiaonan Duan; Zichen Wang; Simon Koplev; Sherry L. Jenkins; Kathleen M. Jagodnik; Alexander Lachmann; Michael G. McDermott; Caroline D. Monteiro; Gregory W. Gundersen; Avi Ma'ayan

Enrichment analysis is a popular method for analyzing gene sets generated by genome-wide experiments. Here we present a significant update to one of the tools in this domain called Enrichr. Enrichr currently contains a large collection of diverse gene set libraries available for analysis and download. In total, Enrichr currently contains 180 184 annotated gene sets from 102 gene set libraries. New features have been added to Enrichr including the ability to submit fuzzy sets, upload BED files, improved application programming interface and visualization of the results as clustergrams. Overall, Enrichr is a comprehensive resource for curated gene sets and a search engine that accumulates biological knowledge for further biological discoveries. Enrichr is freely available at: http://amp.pharm.mssm.edu/Enrichr.


Database | 2016

The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins

Andrew D. Rouillard; Gregory W. Gundersen; Nicolas F. Fernandez; Zichen Wang; Caroline D. Monteiro; Michael G. McDermott; Avi Ma’ayan

Genomics, epigenomics, transcriptomics, proteomics and metabolomics efforts rapidly generate a plethora of data on the activity and levels of biomolecules within mammalian cells. At the same time, curation projects that organize knowledge from the biomedical literature into online databases are expanding. Hence, there is a wealth of information about genes, proteins and their associations, with an urgent need for data integration to achieve better knowledge extraction and data reuse. For this purpose, we developed the Harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins from over 70 major online resources. We extracted, abstracted and organized data into ∼72 million functional associations between genes/proteins and their attributes. Such attributes could be physical relationships with other biomolecules, expression in cell lines and tissues, genetic associations with knockout mouse or human phenotypes, or changes in expression after drug treatment. We stored these associations in a relational database along with rich metadata for the genes/proteins, their attributes and the original resources. The freely available Harmonizome web portal provides a graphical user interface, a web service and a mobile app for querying, browsing and downloading all of the collected data. To demonstrate the utility of the Harmonizome, we computed and visualized gene–gene and attribute–attribute similarity networks, and through unsupervised clustering, identified many unexpected relationships by combining pairs of datasets such as the association between kinase perturbations and disease signatures. We also applied supervised machine learning methods to predict novel substrates for kinases, endogenous ligands for G-protein coupled receptors, mouse phenotypes for knockout genes, and classified unannotated transmembrane proteins for likelihood of being ion channels. The Harmonizome is a comprehensive resource of knowledge about genes and proteins, and as such, it enables researchers to discover novel relationships between biological entities, as well as form novel data-driven hypotheses for experimental validation. Database URL: http://amp.pharm.mssm.edu/Harmonizome.


Nature Communications | 2016

Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd.

Zichen Wang; Caroline D. Monteiro; Kathleen M. Jagodnik; Nicolas F. Fernandez; Gregory W. Gundersen; Andrew D. Rouillard; Sherry L. Jenkins; Axel S Feldmann; Kevin Hu; Michael G. McDermott; Qiaonan Duan; Neil R. Clark; Matthew R. Jones; Yan Kou; Troy Goff; Holly Woodland; Fabio M R. Amaral; Gregory L. Szeto; Oliver Fuchs; Sophia Miryam Schüssler-Fiorenza Rose; Shvetank Sharma; Uwe Schwartz; Xabier Bengoetxea Bausela; Maciej Szymkiewicz; Vasileios Maroulis; Anton Salykin; Carolina M. Barra; Candice D. Kruth; Nicholas J. Bongio; Vaibhav Mathur

Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization.


Bioinformatics | 2015

GEO2Enrichr: browser extension and server app to extract gene sets from GEO and analyze them for biological functions

Gregory W. Gundersen; Matthew R. Jones; Andrew D. Rouillard; Yan Kou; Caroline D. Monteiro; Axel S Feldmann; Kevin Hu; Avi Ma’ayan

MOTIVATION Identification of differentially expressed genes is an important step in extracting knowledge from gene expression profiling studies. The raw expression data from microarray and other high-throughput technologies is deposited into the Gene Expression Omnibus (GEO) and served as Simple Omnibus Format in Text (SOFT) files. However, to extract and analyze differentially expressed genes from GEO requires significant computational skills. RESULTS Here we introduce GEO2Enrichr, a browser extension for extracting differentially expressed gene sets from GEO and analyzing those sets with Enrichr, an independent gene set enrichment analysis tool containing over 70 000 annotated gene sets organized into 75 gene-set libraries. GEO2Enrichr adds JavaScript code to GEO web-pages; this code scrapes user selected accession numbers and metadata, and then, with one click, users can submit this information to a web-server application that downloads the SOFT files, parses, cleans and normalizes the data, identifies the differentially expressed genes, and then pipes the resulting gene lists to Enrichr for downstream functional analysis. GEO2Enrichr opens a new avenue for adding functionality to major bioinformatics resources such GEO by integrating tools and resources without the need for a plug-in architecture. Importantly, GEO2Enrichr helps researchers to quickly explore hypotheses with little technical overhead, lowering the barrier of entry for biologists by automating data processing steps needed for knowledge extraction from the major repository GEO. AVAILABILITY AND IMPLEMENTATION GEO2Enrichr is an open source tool, freely available for installation as browser extensions at the Chrome Web Store and FireFox Add-ons. Documentation and a browser independent web application can be found at http://amp.pharm.mssm.edu/g2e/. CONTACT [email protected].


Scientific Data | 2017

Clustergrammer, a web-based heatmap visualization and analysis tool for high-dimensional biological data

Nicolas F. Fernandez; Gregory W. Gundersen; Adeeb Rahman; Mark L. Grimes; Klarisa Rikova; Peter Hornbeck; Avi Ma’ayan

Most tools developed to visualize hierarchically clustered heatmaps generate static images. Clustergrammer is a web-based visualization tool with interactive features such as: zooming, panning, filtering, reordering, sharing, performing enrichment analysis, and providing dynamic gene annotations. Clustergrammer can be used to generate shareable interactive visualizations by uploading a data table to a web-site, or by embedding Clustergrammer in Jupyter Notebooks. The Clustergrammer core libraries can also be used as a toolkit by developers to generate visualizations within their own applications. Clustergrammer is demonstrated using gene expression data from the cancer cell line encyclopedia (CCLE), original post-translational modification data collected from lung cancer cells lines by a mass spectrometry approach, and original cytometry by time of flight (CyTOF) single-cell proteomics data from blood. Clustergrammer enables producing interactive web based visualizations for the analysis of diverse biological data.


BMC Bioinformatics | 2016

GEN3VA: aggregation and analysis of gene expression signatures from related studies

Gregory W. Gundersen; Kathleen M. Jagodnik; Holly Woodland; Nicholas F. Fernandez; Kevin Sani; Anders B. Dohlman; Peter M. U. Ung; Caroline D. Monteiro; Avner Schlessinger; Avi Ma’ayan

BackgroundGenome-wide gene expression profiling of mammalian cells is becoming a staple of many published biomedical and biological research studies. Such data is deposited into data repositories such as the Gene Expression Omnibus (GEO) for potential reuse. However, these repositories currently do not provide simple interfaces to systematically analyze collections of related studies.ResultsHere we present GENE Expression and Enrichment Vector Analyzer (GEN3VA), a web-based system that enables the integrative analysis of aggregated collections of tagged gene expression signatures identified and extracted from GEO. Each tagged collection of signatures is presented in a report that consists of heatmaps of the differentially expressed genes; principal component analysis of all signatures; enrichment analysis with several gene set libraries across all signatures, which we term enrichment vector analysis; and global mapping of small molecules that are predicted to reverse or mimic each signature in the aggregate. We demonstrate how GEN3VA can be used to identify common molecular mechanisms of aging by analyzing tagged signatures from 244 studies that compared young vs. old tissues in mammalian systems. In a second case study, we collected 86 signatures from treatment of human cells with dexamethasone, a glucocorticoid receptor (GR) agonist. Our analysis confirms consensus GR target genes and predicts potential drug mimickers.ConclusionsGEN3VA can be used to identify, aggregate, and analyze themed collections of gene expression signatures from diverse but related studies. Such integrative analyses can be used to address concerns about data reproducibility, confirm results across labs, and discover new collective knowledge by data reuse. GEN3VA is an open-source web-based system that is freely available at: http://amp.pharm.mssm.edu/gen3va.


Nucleic Acids Research | 2018

eXpression2Kinases (X2K) Web: linking expression signatures to upstream cell signaling networks

Daniel J B Clarke; Maxim V. Kuleshov; Brian M Schilder; Denis Torre; Mary E Duffy; Alexandra B. Keenan; Alexander Lachmann; Axel S Feldmann; Gregory W. Gundersen; Moshe C. Silverstein; Zichen Wang; Avi Ma’ayan

Abstract While gene expression data at the mRNA level can be globally and accurately measured, profiling the activity of cell signaling pathways is currently much more difficult. eXpression2Kinases (X2K) computationally predicts involvement of upstream cell signaling pathways, given a signature of differentially expressed genes. X2K first computes enrichment for transcription factors likely to regulate the expression of the differentially expressed genes. The next step of X2K connects these enriched transcription factors through known protein–protein interactions (PPIs) to construct a subnetwork. The final step performs kinase enrichment analysis on the members of the subnetwork. X2K Web is a new implementation of the original eXpression2Kinases algorithm with important enhancements. X2K Web includes many new transcription factor and kinase libraries, and PPI networks. For demonstration, thousands of gene expression signatures induced by kinase inhibitors, applied to six breast cancer cell lines, are provided for fetching directly into X2K Web. The results are displayed as interactive downloadable vector graphic network images and bar graphs. Benchmarking various settings via random permutations enabled the identification of an optimal set of parameters to be used as the default settings in X2K Web. X2K Web is freely available from http://X2K.cloud.


Cancer Research | 2015

Abstract B1-28: GEO2Enrichr: A Google Chrome extension to extract gene sets from the Gene Expression Omnibus and analyze these lists for common biological functions

Gregory W. Gundersen; Matthew R. Jones; Avi Ma'ayan

Background: Identification of differentially expressed genes from gene expression profiling studies is a necessary and important step in the analysis of microarray and RNA-seq data. Such data is deposited into the Gene Expression Omnibus (GEO). However, currently, to extract and analyze differentially expressed genes from GEO requires significant computational expertise. Methodology: GEO2Enrichr has a front-end browser extension and back-end server application. The back-end is written in Python and uses the Flask framework for the web application server. The server exposes three API endpoints to: (1) query GEO; (2) identify the differentially expressed genes; and (3) query Enrichr. The API endpoints are chained into a useful data pipeline. The front-end is a Google Chrome Extension written in JavaScript and CSS and programmatically inserts HTML elements into the native GEO interface. The Chrome web browser loads the extension whenever the user navigates to an NCBI GEO webpage. GEO2Enrichr unobtrusively inserts itself onto the page as a single button and checkboxes for easy sample selection. The button opens a modal box that allows users to edit their selected data, choose settings for differential expression identification, insert metadata about the experiments, download their resultant gene lists, and submit their gene lists to Enrichr for further analysis. Differential expression analysis has five options: Characteristic Direction 1 , T-test, limma, SAM, and fold-change. Other options are available to control for the size of the resultant differentially expressed gene lists by setting p-value and FDR cutoffs. Results: GEO2Enrichr is a Google Chrome Extension and Python-based API that adds functionality to GEO by allowing users to pipe Simple Omnibus Format in Text (SOFT) files to a differential expression analysis tool and then pipe the differentially expressed genes for analysis to Enrichr 2 , a popular gene list enrichment analysis web application. GEO2Enrichr addresses a need for improved functionality of GEO by embedding new features into the existing GEO pages, allowing researchers to easily select samples and process them for differential expression by various methods and perform enrichment analyses. GEO2Enrichr is free and available for installation at the Chrome Web Store. Conclusions: GEO2Enrichr can facilitate the more broad reusability of the GEO resource by lowering the point of entry to biologists without computational expertise. The systematic use of GEO2Enrichr can generate a new useful searchable resource. Overall, GEO2Enrichr can lead to improved extraction of knowledge from data 3 . [1] http://maayanlab.net/CD [2] http://amp.pharm.mssm.edu/Enrichr [3] http://bd2k.nih.gov Citation Format: Gregory W. Gundersen, Matthew R. Jones, Avi Ma9ayan. GEO2Enrichr: A Google Chrome extension to extract gene sets from the Gene Expression Omnibus and analyze these lists for common biological functions. [abstract]. In: Proceedings of the AACR Special Conference on Computational and Systems Biology of Cancer; Feb 8-11 2015; San Francisco, CA. Philadelphia (PA): AACR; Cancer Res 2015;75(22 Suppl 2):Abstract nr B1-28.


F1000Research | 2016

Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd

Zichen Wang; Caroline D. Monteiro; Kathleen M. Jagodnik; Nicolas F. Fernandez; Gregory W. Gundersen; Andrew D. Rouillard; Sherry L. Jenkins; Axel S Feldmann; Kevin Hu; Michael G. McDermott; Qiaonan Duan; Neil R. Clark; Matthew R. Jones; Yan Kou; Troy Goff; Coursera Nasb students; Avi Ma'ayan


F1000Research | 2016

Clustergrammer: web-based heatmap visualization and analysis tool for high-dimensional biological data

Nicolas F. Fernandez; Gregory W. Gundersen; Qiaonan Duan; Matthew R. Jones; Andrew D. Rouillard; Michael G. McDermott; Adeeb Rahman; Avi Ma'ayan

Collaboration


Dive into the Gregory W. Gundersen's collaboration.

Top Co-Authors

Avatar

Andrew D. Rouillard

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Caroline D. Monteiro

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Matthew R. Jones

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Nicolas F. Fernandez

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Avi Ma’ayan

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Michael G. McDermott

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Zichen Wang

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Avi Ma'ayan

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Axel S Feldmann

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Kathleen M. Jagodnik

Icahn School of Medicine at Mount Sinai

View shared research outputs
Researchain Logo
Decentralizing Knowledge