Arthur Liberzon
Broad Institute
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Arthur Liberzon.
Bioinformatics | 2011
Arthur Liberzon; Aravind Subramanian; Reid Pinchback; Helga Thorvaldsdottir; Pablo Tamayo; Jill P. Mesirov
MOTIVATION Well-annotated gene sets representing the universe of the biological processes are critical for meaningful and insightful interpretation of large-scale genomic data. The Molecular Signatures Database (MSigDB) is one of the most widely used repositories of such sets. RESULTS We report the availability of a new version of the database, MSigDB 3.0, with over 6700 gene sets, a complete revision of the collection of canonical pathways and experimental signatures from publications, enhanced annotations and upgrades to the web site. AVAILABILITY AND IMPLEMENTATION MSigDB is freely available for non-commercial use at http://www.broadinstitute.org/msigdb.
Gastroenterology | 2011
Augusto Villanueva; Yujin Hoshida; Carlo Battiston; Victoria Tovar; Daniela Sia; Clara Alsinet; Helena Cornella; Arthur Liberzon; Masahiro Kobayashi; Swan N. Thung; Jordi Bruix; Philippa Newell; Craig April; Jian Bing Fan; Sasan Roayaie; Vincenzo Mazzaferro; Myron Schwartz; Josep M. Llovet
BACKGROUND & AIMS In approximately 70% of patients with hepatocellular carcinoma (HCC) treated by resection or ablation, disease recurs within 5 years. Although gene expression signatures have been associated with outcome, there is no method to predict recurrence based on combined clinical, pathology, and genomic data (from tumor and cirrhotic tissue). We evaluated gene expression signatures associated with outcome in a large cohort of patients with early stage (Barcelona-Clinic Liver Cancer 0/A), single-nodule HCC and heterogeneity of signatures within tumor tissues. METHODS We assessed 287 HCC patients undergoing resection and tested genome-wide expression platforms using tumor (n = 287) and adjacent nontumor, cirrhotic tissue (n = 226). We evaluated gene expression signatures with reported prognostic ability generated from tumor or cirrhotic tissue in 18 and 4 reports, respectively. In 15 additional patients, we profiled samples from the center and periphery of the tumor, to determine stability of signatures. Data analysis included Cox modeling and random survival forests to identify independent predictors of tumor recurrence. RESULTS Gene expression signatures that were associated with aggressive HCC were clustered, as well as those associated with tumors of progenitor cell origin and those from nontumor, adjacent, cirrhotic tissues. On multivariate analysis, the tumor-associated signature G3-proliferation (hazard ratio [HR], 1.75; P = .003) and an adjacent poor-survival signature (HR, 1.74; P = .004) were independent predictors of HCC recurrence, along with satellites (HR, 1.66; P = .04). Samples from different sites in the same tumor nodule were reproducibly classified. CONCLUSIONS We developed a composite prognostic model for HCC recurrence, based on gene expression patterns in tumor and adjacent tissues. These signatures predict early and overall recurrence in patients with HCC, and complement findings from clinical and pathology analyses.
Current protocols in human genetics | 2008
Heidi Kuehn; Arthur Liberzon; Michael Reich; Jill P. Mesirov
The abundance of genomic data now available in biomedical research has stimulated the development of sophisticated statistical methods for interpreting the data, and of special visualization tools for displaying the results in a concise and meaningful manner. However, biologists often find these methods and tools difficult to understand and use correctly. GenePattern is a freely available software package that addresses this issue by providing more than 100 analysis and visualization tools for genomic research in a comprehensive user‐friendly environment for users at all levels of computational experience and sophistication. This unit demonstrates how to prepare and analyze microarray data in GenePattern. Curr. Protoc. Bioinform. 22:7.12.1–7.12.39.
Immunity | 2016
Jernej Godec; Yan Tan; Arthur Liberzon; Pablo Tamayo; Sanchita Bhattacharya; Atul J. Butte; Jill P. Mesirov; W. Nicholas Haining
Gene-expression profiling has become a mainstay in immunology, but subtle changes in gene networks related to biological processes are hard to discern when comparing various datasets. For instance, conservation of the transcriptional response to sepsis in mouse models and human disease remains controversial. To improve transcriptional analysis in immunology, we created ImmuneSigDB: a manually annotated compendium of ∼5,000 gene-sets from diverse cell states, experimental manipulations, and genetic perturbations in immunology. Analysis using ImmuneSigDB identified signatures induced in activated myeloid cells and differentiating lymphocytes that were highly conserved between humans and mice. Sepsis triggered conserved patterns of gene expression in humans and mouse models. However, we also identified species-specific biological processes in the sepsis transcriptional response: although both species upregulated phagocytosis-related genes, a mitosis signature was specific to humans. ImmuneSigDB enables granular analysis of transcriptomic data to improve biological understanding of immune processes of the human and mouse immune systems.
Statistical Methods in Medical Research | 2016
Pablo Tamayo; George Steinhardt; Arthur Liberzon; Jill P. Mesirov
Since its first publication in 2003, the Gene Set Enrichment Analysis method, based on the Kolmogorov-Smirnov statistic, has been heavily used, modified, and also questioned. Recently a simplified approach using a one-sample t-test score to assess enrichment and ignoring gene-gene correlations was proposed by Irizarry et al. 2009 as a serious contender. The argument criticizes Gene Set Enrichment Analysis’s nonparametric nature and its use of an empirical null distribution as unnecessary and hard to compute. We refute these claims by careful consideration of the assumptions of the simplified method and its results, including a comparison with Gene Set Enrichment Analysis’s on a large benchmark set of 50 datasets. Our results provide strong empirical evidence that gene–gene correlations cannot be ignored due to the significant variance inflation they produced on the enrichment scores and should be taken into account when estimating gene set enrichment significance. In addition, we discuss the challenges that the complex correlation structure and multi-modality of gene sets pose more generally for gene set enrichment methods.
Nature Biotechnology | 2016
Jong Wook Kim; Olga Botvinnik; Omar Abudayyeh; Chet Birger; Joseph Rosenbluh; Yashaswi Shrestha; M. Abazeed; Peter S. Hammerman; Daniel DiCara; David J. Konieczkowski; Cory M. Johannessen; Arthur Liberzon; Amir Reza Alizad-Rahvar; Gabriela Alexe; Andrew J. Aguirre; Mahmoud Ghandi; Heidi Greulich; Francisca Vazquez; Barbara A. Weir; Eliezer M. Van Allen; Aviad Tsherniak; Diane D. Shao; Travis I. Zack; Michael S. Noble; Gad Getz; Rameen Beroukhim; Levi A. Garraway; Masoud Ardakani; Chiara Romualdi; Gabriele Sales
Systematic efforts to sequence the cancer genome have identified large numbers of mutations and copy number alterations in human cancers. However, elucidating the functional consequences of these variants, and their interactions to drive or maintain oncogenic states, remains a challenge in cancer research. We developed REVEALER, a computational method that identifies combinations of mutually exclusive genomic alterations correlated with functional phenotypes, such as the activation or gene dependency of oncogenic pathways or sensitivity to a drug treatment. We used REVEALER to uncover complementary genomic alterations associated with the transcriptional activation of β-catenin and NRF2, MEK-inhibitor sensitivity, and KRAS dependency. REVEALER successfully identified both known and new associations, demonstrating the power of combining functional profiles with extensive characterization of genomic alterations in cancer genomes.
Bioinformatics | 2012
Liming Lai; Arthur Liberzon; Jason Hennessey; Gaixin Jiang; Jianli Qi; Jill P. Mesirov; Steven Xijin Ge
UNLABELLED Studying plants using high-throughput genomics technologies is becoming routine, but interpretation of genome-wide expression data in terms of biological pathways remains a challenge, partly due to the lack of pathway databases. To create a knowledgebase for plant pathway analysis, we collected 1683 lists of differentially expressed genes from 397 gene-expression studies, which constitute a molecular signature database of various genetic and environmental perturbations of Arabidopsis. In addition, we extracted 1909 gene sets from various sources such as Gene Ontology, KEGG, AraCyc, Plant Ontology, predicted target genes of microRNAs and transcription factors, and computational gene clusters defined by meta-analysis. With this knowledgebase, we applied Gene Set Enrichment Analysis to an expression profile of cold acclimation and identified expected functional categories and pathways. Our results suggest that the AraPath database can be used to generate specific, testable hypotheses regarding plant molecular pathways from gene expression data. AVAILABILITY http://bioinformatics.sdstate.edu/arapath/.
Cell Cycle | 2013
Meital Cohen; Manuela Vecsler; Arthur Liberzon; Meirav Noach; Eitan Zlotorynski; Amit Tzur
Different types of mature B-cell lymphocytes are overall highly similar. Nevertheless, some B cells proliferate intensively, while others rarely do. Here, we demonstrate that a simple binary classification of gene expression in proliferating vs. resting B cells can identify, with remarkable selectivity, global in vivo regulators of the mammalian cell cycle, many of which are also post-translationally regulated by the APC/C E3 ligase. Consequently, we discover a novel regulatory network between the APC/C and the E2F transcription factors and discuss its potential impact on the G1–S transition of the cell cycle. In addition, by focusing on genes whose expression inversely correlates with proliferation, we demonstrate the inherent ability of our approach to also identify in vivo regulators of cell differentiation, cell survival, and other antiproliferative processes. Relying on data sets of wt, non-transgenic animals, our approach can be applied to other cell lineages and human data sets.
Nature Methods | 2018
Taibo Li; April Kim; Joseph Rosenbluh; Heiko Horn; Liraz Greenfeld; David An; Andrew Zimmer; Arthur Liberzon; Jon Bistline; Ted Natoli; Yang Li; Aviad Tsherniak; Rajiv Narayan; Aravind Subramanian; Ted Liefeld; Bang Wong; Dawn Anne Thompson; Sarah E. Calvo; Steve Carr; Jesse S. Boehm; Jake Jaffe; Jill P. Mesirov; Nir Hacohen; Aviv Regev; Kasper Lage
Functional genomics networks are widely used to identify unexpected pathway relationships in large genomic datasets. However, it is challenging to compare the signal-to-noise ratios of different networks and to identify the optimal network with which to interpret a particular genetic dataset. We present GeNets, a platform in which users can train a machine-learning model (Quack) to carry out these comparisons and execute, store, and share analyses of genetic and RNA-sequencing datasets.The GeNets web platform can identify the most informative network, as well as execute, store and share network-based analyses of RNA-seq or genomic datasets.
bioRxiv | 2017
Taibo Li; April Kim; Johnathan Mercer; Joseph Rosenbluh; Heiko Horn; Liraz Greenfeld; David An; Andrew Zimmer; Arthur Liberzon; Jon Bistline; Ted Natoli; Yang Li; Aviad Tsherniak; Rajiv Narayan; Aravind Subramanian; Ted Liefeld; Bang Wong; Dawn Anne Thompson; Sarah E. Calvo; Steve Carr; Jesse S. Boehm; Jake Jaffe; Jill P. Mesirov; Nir Hacohen; Aviv Regev; Kasper Lage
A major bottleneck in network-based analyses of genomic data is quantitatively comparing biological signal in different networks and to identifying the optimal network dataset to answer a particular biological question. Towards these aims, we developed a unified web platform 9Broad Institute Web Platform for Genome Networks (GeNets)9, where users can compare biological signal of networks, and execute, store, and share network analyses. We designed a machine learningmachine-learning algorithm (Quack) which), which uses topological features to can quantify the overall and pathway-specific biological signals in networks, thus enabling users to choose the optimal network dataset for their analyses. We illustrated a typical workflow using GeNets to identify interesting autism candidate genes in the network that, when compared to four other networks, best recapitulates established neurodevelopmental pathway information. GeNets is a scalable, general and uniquely enabling computational framework for analyzing, managing and sharing analyses of genetic datasets using heterogeneous functional genomics networks, for example, from single-cell transcriptional analyses.