Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Zhidong Tu is active.

Publication


Featured researches published by Zhidong Tu.


Nature | 2016

Proteogenomics connects somatic mutations to signalling in breast cancer

Philipp Mertins; D. R. Mani; Kelly V. Ruggles; Michael A. Gillette; Karl R. Clauser; Pei Wang; Xianlong Wang; Jana W. Qiao; Song Cao; Francesca Petralia; Emily Kawaler; Filip Mundt; Karsten Krug; Zhidong Tu; Jonathan T. Lei; Michael L. Gatza; Matthew D. Wilkerson; Charles M. Perou; Venkata Yellapantula; Kuan Lin Huang; Chenwei Lin; Michael D. McLellan; Ping Yan; Sherri R. Davies; R. Reid Townsend; Steven J. Skates; Jing Wang; Bing Zhang; Christopher R. Kinsinger; Mehdi Mesri

Summary Somatic mutations have been extensively characterized in breast cancer, but the effects of these genetic alterations on the proteomic landscape remain poorly understood. We describe quantitative mass spectrometry-based proteomic and phosphoproteomic analyses of 105 genomically annotated breast cancers of which 77 provided high-quality data. Integrated analyses allowed insights into the somatic cancer genome including the consequences of chromosomal loss, such as the 5q deletion characteristic of basal-like breast cancer. The 5q trans effects were interrogated against the Library of Integrated Network-based Cellular Signatures, thereby connecting CETN3 and SKP1 loss to elevated expression of EGFR, and SKP1 loss also to increased SRC. Global proteomic data confirmed a stromal-enriched group in addition to basal and luminal clusters and pathway analysis of the phosphoproteome identified a G Protein-coupled receptor cluster that was not readily identified at the mRNA level. Besides ERBB2, other amplicon-associated, highly phosphorylated kinases were identified, including CDK12, PAK1, PTK2, RIPK2 and TLK2. We demonstrate that proteogenomic analysis of breast cancer elucidates functional consequences of somatic mutations, narrows candidate nominations for driver genes within large deletions and amplified regions, and identifies therapeutic targets.


Bioinformatics | 2004

Mapping gene ontology to proteins based on protein--protein interaction data

Minghua Deng; Zhidong Tu; Fengzhu Sun; Ting Chen

MOTIVATION Gene Ontology (GO) consortium provides structural description of protein function that is used as a common language for gene annotation in many organisms. Large-scale techniques have generated many valuable protein-protein interaction datasets that are useful for the study of protein function. Combining both GO and protein-protein interaction data allows the prediction of function for unknown proteins. RESULT We apply a Markov random field method to the prediction of yeast protein function based on multiple protein-protein interaction datasets. We assign function to unknown proteins with a probability representing the confidence of this prediction. The functions are based on three general categories of cellular component, molecular function and biological process defined in GO. The yeast proteins are defined in the Saccharomyces Genome Database (SGD). The protein-protein interaction datasets are obtained from the Munich Information Center for Protein Sequences (MIPS), including physical interactions and genetic interactions. The efficiency of our prediction is measured by applying the leave-one-out validation procedure to a functional path matching scheme, which compares the prediction with the GO description of a proteins function from the abstract level to the detailed level along the GO structure. For biological process, the leave-one-out validation procedure shows 52% precision and recall of our method, much better than that of the simple guilty-by-association methods.


PLOS Biology | 2012

Stitching together multiple data dimensions reveals interacting metabolomic and transcriptomic networks that modulate cell regulation.

Jun Zhu; Pavel Sova; Qiuwei Xu; Kenneth M. Dombek; Ethan Yixun Xu; Heather Vu; Zhidong Tu; Rachel B. Brem; Roger E. Bumgarner; Eric E. Schadt

DNA variation can be used as a systematic source of perturbation in segregating populations as a way to infer regulatory networks via the integration of large-scale, high-dimensional molecular profiling data.


intelligent systems in molecular biology | 2006

An integrative approach for causal gene identification and gene regulatory pathway inference

Zhidong Tu; Li Wang; Michelle N. Arbeitman; Ting Chen; Fengzhu Sun

MOTIVATION Gene expression variation can often be linked to certain chromosomal regions and are tightly associated with phenotypic variation such as disease conditions. Inferring the causal genes for the expression variation is of great importance but rather challenging as the linked region generally contains multiple genes. Even when a single candidate gene is proposed, the underlying biological mechanism by which the regulation is enforced remains unknown. Novel approaches are needed to both infer the causal genes and generate hypothesis on the underlying regulatory mechanisms. RESULTS We propose a new approach which aims at achieving the above objectives by integrating genotype information, gene expression, protein-protein interaction, protein phosphorylation, and transcription factor (TF)-DNA binding information. A network based stochastic algorithm is designed to infer the causal genes and identify the underlying regulatory pathways. We first quantitatively verified our method by a test using data generated by yeast knock-out experiments. Over 40% of inferred causal genes are correct, which is significantly better than 10% by random guess. We then applied our method to a recent genome-wide expression variation study in yeast. We show that our method can correctly identify the causal genes and effectively output experimentally verified pathways. New potential gene regulatory pathways are generated and presented as a global network. AVAILABILITY Source code is available upon request.


Genome Research | 2009

Integrating siRNA and protein–protein interaction data to identify an expanded insulin signaling network

Zhidong Tu; Carmen A. Argmann; Kenny K. Wong; Lyndon J. Mitnaul; Stephen Edwards; Iliana C. Sach; Jun Zhu; Eric E. Schadt

Insulin resistance is one of the dominant symptoms of type 2 diabetes (T2D). Although the molecular mechanisms leading to this resistance are largely unknown, experimental data support that the insulin signaling pathway is impaired in patients who are insulin resistant. To identify novel components/modulators of the insulin signaling pathway, we designed siRNAs targeting over 300 genes and tested the effects of knocking down these genes in an insulin-dependent, anti-lipolysis assay in 3T3-L1 adipocytes. For 126 genes, significant changes in free fatty acid release were observed. However, due to off-target effects (in addition to other limitations), high-throughput RNAi-based screens in cell-based systems generate significant amounts of noise. Therefore, to obtain a more reliable set of genes from the siRNA hits in our screen, we developed and applied a novel network-based approach that elucidates the mechanisms of action for the true positive siRNA hits. Our analysis results in the identification of a core network underlying the insulin signaling pathway that is more significantly enriched for genes previously associated with insulin resistance than the set of genes annotated in the KEGG database as belonging to the insulin signaling pathway. We experimentally validated one of the predictions, S1pr2, as a novel candidate gene for T2D.


PLOS Genetics | 2012

Integrative analysis of a cross-loci regulation network identifies App as a gene regulating insulin secretion from pancreatic islets.

Zhidong Tu; Mark P. Keller; Chunsheng Zhang; Mary E. Rabaglia; Danielle M. Greenawalt; Xia Yang; I-Ming Wang; Hongyue Dai; Matthew D. Bruss; Pek Yee Lum; Yun-Ping Zhou; Daniel M. Kemp; Christina Kendziorski; Brian S. Yandell; Alan D. Attie; Eric E. Schadt; Jun Zhu

Complex diseases result from molecular changes induced by multiple genetic factors and the environment. To derive a systems view of how genetic loci interact in the context of tissue-specific molecular networks, we constructed an F2 intercross comprised of >500 mice from diabetes-resistant (B6) and diabetes-susceptible (BTBR) mouse strains made genetically obese by the Leptinob/ob mutation (Lepob). High-density genotypes, diabetes-related clinical traits, and whole-transcriptome expression profiling in five tissues (white adipose, liver, pancreatic islets, hypothalamus, and gastrocnemius muscle) were determined for all mice. We performed an integrative analysis to investigate the inter-relationship among genetic factors, expression traits, and plasma insulin, a hallmark diabetes trait. Among five tissues under study, there are extensive protein–protein interactions between genes responding to different loci in adipose and pancreatic islets that potentially jointly participated in the regulation of plasma insulin. We developed a novel ranking scheme based on cross-loci protein-protein network topology and gene expression to assess each genes potential to regulate plasma insulin. Unique candidate genes were identified in adipose tissue and islets. In islets, the Alzheimers gene App was identified as a top candidate regulator. Islets from 17-week-old, but not 10-week-old, App knockout mice showed increased insulin secretion in response to glucose or a membrane-permeant cAMP analog, in agreement with the predictions of the network model. Our result provides a novel hypothesis on the mechanism for the connection between two aging-related diseases: Alzheimers disease and type 2 diabetes.


Bioinformatics | 2015

Integrative random forest for gene regulatory network inference.

Francesca Petralia; Pei Wang; Jialiang Yang; Zhidong Tu

Motivation: Gene regulatory network (GRN) inference based on genomic data is one of the most actively pursued computational biological problems. Because different types of biological data usually provide complementary information regarding the underlying GRN, a model that integrates big data of diverse types is expected to increase both the power and accuracy of GRN inference. Towards this goal, we propose a novel algorithm named iRafNet: integrative random forest for gene regulatory network inference. Results: iRafNet is a flexible, unified integrative framework that allows information from heterogeneous data, such as protein–protein interactions, transcription factor (TF)-DNA-binding, gene knock-down, to be jointly considered for GRN inference. Using test data from the DREAM4 and DREAM5 challenges, we demonstrate that iRafNet outperforms the original random forest based network inference algorithm (GENIE3), and is highly comparable to the community learning approach. We apply iRafNet to construct GRN in Saccharomyces cerevisiae and demonstrate that it improves the performance in predicting TF-target gene regulations and provides additional functional insights to the predicted gene regulations. Availability and implementation: The R code of iRafNet implementation and a tutorial are available at: http://research.mssm.edu/tulab/software/irafnet.html Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


PLOS Genetics | 2015

Integrative Analysis of DNA Methylation and Gene Expression Data Identifies EPAS1 as a Key Regulator of COPD

Seungyeul Yoo; Sachiko Takikawa; Patrick Geraghty; Carmen A. Argmann; Joshua D. Campbell; Luan Lin; Tao Huang; Zhidong Tu; Robert Feronjy; Avrum Spira; Eric E. Schadt; Charles A. Powell; Jun Zhu

Chronic Obstructive Pulmonary Disease (COPD) is a complex disease. Genetic, epigenetic, and environmental factors are known to contribute to COPD risk and disease progression. Therefore we developed a systematic approach to identify key regulators of COPD that integrates genome-wide DNA methylation, gene expression, and phenotype data in lung tissue from COPD and control samples. Our integrative analysis identified 126 key regulators of COPD. We identified EPAS1 as the only key regulator whose downstream genes significantly overlapped with multiple genes sets associated with COPD disease severity. EPAS1 is distinct in comparison with other key regulators in terms of methylation profile and downstream target genes. Genes predicted to be regulated by EPAS1 were enriched for biological processes including signaling, cell communications, and system development. We confirmed that EPAS1 protein levels are lower in human COPD lung tissue compared to non-disease controls and that Epas1 gene expression is reduced in mice chronically exposed to cigarette smoke. As EPAS1 downstream genes were significantly enriched for hypoxia responsive genes in endothelial cells, we tested EPAS1 function in human endothelial cells. EPAS1 knockdown by siRNA in endothelial cells impacted genes that significantly overlapped with EPAS1 downstream genes in lung tissue including hypoxia responsive genes, and genes associated with emphysema severity. Our first integrative analysis of genome-wide DNA methylation and gene expression profiles illustrates that not only does DNA methylation play a ‘causal’ role in the molecular pathophysiology of COPD, but it can be leveraged to directly identify novel key mediators of this pathophysiology.


Cell Metabolism | 2017

Human Pancreatic β Cell lncRNAs Control Cell-Specific Regulatory Networks

Ildem Akerman; Zhidong Tu; Anthony Beucher; Delphine M.Y. Rolando; Claire Sauty-Colace; Marion Benazra; Nikolina Nakic; Jialiang Yang; Huan L. Wang; Lorenzo Pasquali; Ignasi Moran; Javier García-Hurtado; Natalia Castro; Roser Gonzalez-Franco; Andrew F. Stewart; Caroline Bonner; Lorenzo Piemonti; Thierry Berney; Leif Groop; Julie Kerr-Conte; François Pattou; Carmen A. Argmann; Eric E. Schadt; Philippe Ravassard; Jorge Ferrer

Summary Recent studies have uncovered thousands of long non-coding RNAs (lncRNAs) in human pancreatic β cells. β cell lncRNAs are often cell type specific and exhibit dynamic regulation during differentiation or upon changing glucose concentrations. Although these features hint at a role of lncRNAs in β cell gene regulation and diabetes, the function of β cell lncRNAs remains largely unknown. In this study, we investigated the function of β cell-specific lncRNAs and transcription factors using transcript knockdowns and co-expression network analysis. This revealed lncRNAs that function in concert with transcription factors to regulate β cell-specific transcriptional networks. We further demonstrate that the lncRNA PLUTO affects local 3D chromatin structure and transcription of PDX1, encoding a key β cell transcription factor, and that both PLUTO and PDX1 are downregulated in islets from donors with type 2 diabetes or impaired glucose tolerance. These results implicate lncRNAs in the regulation of β cell-specific transcription factor networks.


BMC Genomics | 2009

A network-based integrative approach to prioritize reliable hits from multiple genome-wide RNAi screens in Drosophila

Li Wang; Zhidong Tu; Fengzhu Sun

BackgroundThe recently developed RNA interference (RNAi) technology has created an unprecedented opportunity which allows the function of individual genes in whole organisms or cell lines to be interrogated at genome-wide scale. However, multiple issues, such as off-target effects or low efficacies in knocking down certain genes, have produced RNAi screening results that are often noisy and that potentially yield both high rates of false positives and false negatives. Therefore, integrating RNAi screening results with other information, such as protein-protein interaction (PPI), may help to address these issues.ResultsBy analyzing 24 genome-wide RNAi screens interrogating various biological processes in Drosophila, we found that RNAi positive hits were significantly more connected to each other when analyzed within a protein-protein interaction network, as opposed to random cases, for nearly all screens. Based on this finding, we developed a network-based approach to identify false positives (FPs) and false negatives (FNs) in these screening results. This approach relied on a scoring function, which we termed NePhe, to integrate information obtained from both PPI network and RNAi screening results. Using a novel rank-based test, we compared the performance of different NePhe scoring functions and found that diffusion kernel-based methods generally outperformed others, such as direct neighbor-based methods. Using two genome-wide RNAi screens as examples, we validated our approach extensively from multiple aspects. We prioritized hits in the original screens that were more likely to be reproduced by the validation screen and recovered potential FNs whose involvements in the biological process were suggested by previous knowledge and mutant phenotypes. Finally, we demonstrated that the NePhe scoring system helped to biologically interpret RNAi results at the module level.ConclusionBy comprehensively analyzing multiple genome-wide RNAi screens, we conclude that network information can be effectively integrated with RNAi results to produce suggestive FPs and FNs, and to bring biological insight to the screening results.

Collaboration


Dive into the Zhidong Tu's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Eric E. Schadt

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Fengzhu Sun

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Francesca Petralia

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Jialiang Yang

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Tao Huang

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Bin Zhang

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar

Carmen A. Argmann

Icahn School of Medicine at Mount Sinai

View shared research outputs
Researchain Logo
Decentralizing Knowledge