Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Yunyan Gu is active.

Publication


Featured researches published by Yunyan Gu.


Briefings in Bioinformatics | 2012

GO-function: deriving biologically relevant functions from statistically significant functions

Jing Wang; Xianxiao Zhou; Jing Zhu; Yunyan Gu; Wenyuan Zhao; Jinfeng Zou; Zheng Guo

In high-throughput studies of diseases, terms enriched with disease-related genes based on Gene Ontology (GO) are routinely found. However, most current algorithms used to find significant GO terms cannot handle the redundancy that results from the dependencies of GO terms. Simply based on some numerical considerations, current algorithms developed for reducing this redundancy may produce results that do not account for biologically interesting cases. In this article, we present several rules used to design a tool called GO-function for extracting biologically relevant terms from statistically significant GO terms for a disease. Using one gene expression profile for colorectal cancer, we compared GO-function with four algorithms designed to treat redundancy. Then, we validated results obtained in this data set by GO-function using another data set for colorectal cancer. Our analysis showed that GO-function can identify disease-related terms that are more statistically and biologically meaningful than those found by the other four algorithms.


Cell Death and Disease | 2014

Integrated analyses identify the involvement of microRNA-26a in epithelial-mesenchymal transition during idiopathic pulmonary fibrosis.

Haihai Liang; Yunyan Gu; T Li; Y. Zhang; Longtao Huangfu; M Hu; D Zhao; Yingzhun Chen; S Liu; Y Dong; Xuelian Li; Y Lu; Baofeng Yang; Hongli Shan

Idiopathic Pulmonary Fibrosis (IPF) is a chronic, progressive, and highly lethal fibrotic lung disease with poor treatment and unknown etiology. Emerging evidence suggests that epithelial–mesenchymal transition (EMT) has an important role in repair and scar formation following epithelial injury during pulmonary fibrosis. Although some miRNAs have been shown to be dysregulated in the pathophysiological processes of IPF, limited studies have payed attention on the participation of miRNAs in EMT in lung fibrosis. In our study, we identified and constructed a regulation network of differentially expressed IPF miRNAs and EMT genes. Additionally, we found the downregulation of miR-26a in mice with experimental pulmonary fibrosis. Further studies showed that miR-26a regulated HMGA2, which is a key factor in the process of EMT and had the maximum number of regulating miRNAs in the regulation network. More importantly, inhibition of miR-26a resulted in lung epithelial cells transforming into myofibroblasts in vitro and in vivo, whereas forced expression of miR-26a alleviated TGF-β1- and BLM-induced EMT in A549 cells and in mice, respectively. Taken together, our study deciphered the essential role of miR-26a in the pathogenesis of EMT in pulmonary fibrosis, and suggests that miR-26a may be a potential therapeutic target for IPF.


BMC Bioinformatics | 2010

Extracting consistent knowledge from highly inconsistent cancer gene data sources

Xue Gong; Ruihong Wu; Yuannv Zhang; Wenyuan Zhao; Lixin Cheng; Yunyan Gu; Lin Zhang; Jing Wang; Jing Zhu; Zheng Guo

BackgroundHundreds of genes that are causally implicated in oncogenesis have been found and collected in various databases. For efficient application of these abundant but diverse data sources, it is of fundamental importance to evaluate their consistency.ResultsFirst, we showed that the lists of cancer genes from some major data sources were highly inconsistent in terms of overlapping genes. In particular, most cancer genes accumulated in previous small-scale studies could not be rediscovered in current high-throughput genome screening studies. Then, based on a metric proposed in this study, we showed that most cancer gene lists from different data sources were highly functionally consistent. Finally, we extracted functionally consistent cancer genes from various data sources and collected them in our database F-Census.ConclusionsAlthough they have very low gene overlapping, most cancer gene data sources are highly consistent at the functional level, which indicates that they can separately capture partial genes in a few key pathways associated with cancer. Our results suggest that the sample sizes currently used for cancer studies might be inadequate for consistently capturing individual cancer genes, but could be sufficient for finding a number of cancer genes that could represent functionally most cancer genes. The F-Census database provides biologists with a useful tool for browsing and extracting functionally consistent cancer genes from various data sources.


Briefings in Bioinformatics | 2016

Critical limitations of prognostic signatures based on risk scores summarized from gene expression levels: a case study for resected stage I non-small-cell lung cancer

Lishuang Qi; Libin Chen; Yang Li; Yuan Qin; Rufei Pan; Wenyuan Zhao; Yunyan Gu; Hongwei Wang; Ruiping Wang; Xiangqi Chen; Zheng Guo

Most of current gene expression signatures for cancer prognosis are based on risk scores, usually calculated as some summaries of expression levels of the signature genes, whose applications require presetting risk score thresholds and data normalization. In this study, we demonstrate the critical limitations of such type of signatures that the risk scores of samples will change greatly when they are normalized together with different samples, which would induce spurious risk classification and difficulty in clinical settings, and the risk scores of independent samples are incomparable if data normalization is not adopted. To overcome these limitations, we propose a rank-based method to extract a prognostic gene pair signature for overall survival of stage I non-small-cell lung cancer. The prognostic gene pair signature is verified in three integrated data sets detected by different laboratories with different microarray platforms. We conclude that, different from the type of signatures based on risk scores summarized from gene expression levels, the rank-based signatures could be robustly applied at the individualized level to independent clinical samples assessed in different laboratories.


Genomics | 2009

Finding disease-specific coordinated functions by multi-function genes: Insight into the coordination mechanisms in diseases

Wencai Ma; Da Yang; Yunyan Gu; Xinwu Guo; Wenyuan Zhao; Zheng Guo

We developed an approach using multi-function disease genes to find function pairs whose co-deregulation might induce a disease. Analyzing cancer genes, we found many cancer-specific coordinated function pairs co-deregulated by dysfunction of multi-function genes and other molecular changes in cancer. Studying two subtypes of cardiomyopathy, we found they show certain consistency at the functional coordination level. Our approach can also provide important information for finding novel disease genes as well as their mechanisms in diseases.


Computational Biology and Chemistry | 2011

Extensive increase of microarray signals in cancers calls for novel normalization assumptions

D. Wang; Lixin Cheng; Mingyue Wang; Ruihong Wu; Pengfei Li; Bin Li; Yuannv Zhang; Yunyan Gu; Wenyuan Zhao; Chenguang Wang; Zheng Guo

When using microarray data for studying a complex disease such as cancer, it is a common practice to normalize data to force all arrays to have the same distribution of probe intensities regardless of the biological groups of samples. The assumption underlying such normalization is that in a disease the majority of genes are not differentially expressed genes (DE genes) and the numbers of up- and down-regulated genes are roughly equal. However, accumulated evidences suggest gene expressions could be widely altered in cancer, so we need to evaluate the sensitivities of biological discoveries to violation of the normalization assumption. Here, we analyzed 7 large Affymetrix datasets of pair-matched normal and cancer samples for cancers collected in the NCBI GEO database. We showed that in 6 of these 7 datasets, the medians of perfect match (PM) probe intensities increased in cancer state and the increases were significant in three datasets, suggesting the assumption that all arrays have the same median probe intensities regardless of the biological groups of samples might be misleading. Then, we evaluated the effects of three currently most widely used normalization algorithms (RMA, MAS5.0 and dChip) on the selection of DE genes by comparing them with LVS which relies less on the above-mentioned assumption. The results showed using RMA, MAS5.0 and dChip may produce lots of false results of down-regulated DE genes while missing many up-regulated DE genes. At least for cancer study, normalizing all arrays to have the same distribution of probe intensities regardless of the biological groups of samples might be misleading. Thus, most current normalizations based on unreliable assumptions may distort biological differences between normal and cancer samples. The LVS algorithm might perform relatively well due to that it relies less on the above-mentioned assumption. Also, our results indicate that genes may be widely up-regulated in most human cancer.


Briefings in Bioinformatics | 2016

Individualized identification of disease-associated pathways with disrupted coordination of gene expression

Hongwei Wang; Hao Cai; Lu Ao; Haidan Yan; Wenyuan Zhao; Lishuang Qi; Yunyan Gu; Zheng Guo

Current pathway analysis approaches are primarily dedicated to capturing deregulated pathways at the population level and cannot provide patient-specific pathway deregulation information. In this article, the authors present a simple approach, called individPath, to detect pathways with significantly disrupted intra-pathway relative expression orderings for each disease sample compared with the stable, normal intra-pathway relative expression orderings pre-determined in previously accumulated normal samples. Through the analysis of multiple microarray data sets for lung and breast cancer, the authors demonstrate individPaths effectiveness for detecting cancer-associated pathways with disrupted relative expression orderings at the individual level and dissecting the heterogeneity of pathway deregulation among different patients. The portable use of this simple approach in clinical contexts is exemplified by the identification of prognostic intra-pathway gene pair signatures to predict overall survival of resected early-stage lung adenocarcinoma patients and signatures to predict relapse-free survival of estrogen receptor-positive breast cancer patients after tamoxifen treatment.


Molecular Cancer Therapeutics | 2010

Systematic Interpretation of Comutated Genes in Large-Scale Cancer Mutation Profiles

Yunyan Gu; Da Yang; Jinfeng Zou; Wencai Ma; Ruihong Wu; Wenyuan Zhao; Yuannv Zhang; Hui Xiao; Xue Gong; Min Zhang; Jing Zhu; Zheng Guo

By high-throughput screens of somatic mutations of genes in cancer genomes, hundreds of cancer genes are being rapidly identified, providing us abundant information for systematically deciphering the genetic changes underlying cancer mechanism. However, the functional collaboration of mutated genes is often neglected in current studies. Here, using four genome-wide somatic mutation data sets and pathways defined in various databases, we showed that gene pairs significantly comutated in cancer samples tend to distribute between pathways rather than within pathways. At the basic functional level of motifs in the human protein-protein interaction network, we also found that comutated gene pairs were overrepresented between motifs but extremely depleted within motifs. Specifically, we showed that based on Gene Ontology that describes gene functions at various specific levels, we could tackle the pathway definition problem to some degree and study the functional collaboration of gene mutations in cancer genomes more efficiently. Then, by defining pairs of pathways frequently linked by comutated gene pairs as the between-pathway models, we showed they are also likely to be codisrupted by mutations of the interpathway hubs of the coupled pathways, suggesting new hints for understanding the heterogeneous mechanisms of cancers. Finally, we showed some between-pathway models consisting of important pathways such as cell cycle checkpoint and cell proliferation were codisrupted in most cancer samples under this study, suggesting that their codisruptions might be functionally essential in inducing these cancers. All together, our results would provide a channel to detangle the complex collaboration of the molecular processes underlying cancer mechanism. Mol Cancer Ther; 9(8); 2186–95. ©2010 AACR.


Molecular Carcinogenesis | 2016

Autophagy-related prognostic signature for breast cancer.

Yunyan Gu; Pengfei Li; Fuduan Peng; Mengmeng Zhang; Yuanyuan Zhang; Haihai Liang; Wenyuan Zhao; Lishuang Qi; Hongwei Wang; Chenguang Wang; Zheng Guo

Autophagy is a process that degrades intracellular constituents, such as long‐lived or damaged proteins and organelles, to buffer metabolic stress under starvation conditions. Deregulation of autophagy is involved in the progression of cancer. However, the predictive value of autophagy for breast cancer prognosis remains unclear. First, based on gene expression profiling, we found that autophagy genes were implicated in breast cancer. Then, using the Cox proportional hazard regression model, we detected autophagy prognostic signature for breast cancer in a training dataset. We identified a set of eight autophagy genes (BCL2, BIRC5, EIF4EBP1, ERO1L, FOS, GAPDH, ITPR1 and VEGFA) that were significantly associated with overall survival in breast cancer. The eight autophagy genes were assigned as a autophagy‐related prognostic signature for breast cancer. Based on the autophagy‐related signature, the training dataset GSE21653 could be classified into high‐risk and low‐risk subgroups with significantly different survival times (HR = 2.72, 95% CI = (1.91, 3.87); P = 1.37 × 10−5). Inactivation of autophagy was associated with shortened survival of breast cancer patients. The prognostic value of the autophagy‐related signature was confirmed in the testing dataset GSE3494 (HR = 2.12, 95% CI = (1.48, 3.03); P = 1.65 × 10−3) and GSE7390 (HR = 1.76, 95% CI = (1.22, 2.54); P = 9.95 × 10−4). Further analysis revealed that the prognostic value of the autophagy signature was independent of known clinical prognostic factors, including age, tumor size, grade, estrogen receptor status, progesterone receptor status, ERBB2 status, lymph node status and TP53 mutation status. Finally, we demonstrated that the autophagy signature could also predict distant metastasis‐free survival for breast cancer.


Molecular Cancer Therapeutics | 2011

Evaluating the consistency of differential expression of microRNA detected in human cancers

Xue Gong; Ruihong Wu; Hongwei Wang; Xinwu Guo; D. Wang; Yunyan Gu; Yuannv Zhang; Wenyuan Zhao; Lixin Cheng; Chenguang Wang; Zheng Guo

Differential expression of microRNA (miRNA) is involved in many human diseases and could potentially be used as a biomarker for disease diagnosis, prognosis, and therapy. However, inconsistency has often been found among differentially expressed miRNAs identified in various studies when using miRNA arrays for a particular disease such as a cancer. Before broadly applying miRNA arrays in a clinical setting, it is critical to evaluate inconsistent discoveries in a rational way. Thus, using data sets from 2 types of cancers, our study shows that the differentially expressed miRNAs detected from multiple experiments for each cancer exhibit stable regulation direction. This result also indicates that miRNA arrays could be used to reliably capture the signals of the regulation direction of differentially expressed miRNAs in cancer. We then assumed that 2 differentially expressed miRNAs with the same regulation direction in a particular cancer play similar functional roles if they regulate the same set of cancer-associated genes. On the basis of this hypothesis, we proposed a score to assess the functional consistency between differentially expressed miRNAs separately extracted from multiple studies for a particular cancer. We showed although lists of differentially expressed miRNAs identified from different studies for each cancer were highly variable, they were rather consistent at the level of function. Thus, the detection of differentially expressed miRNAs in various experiments for a certain disease tends to be functionally reproducible and capture functionally related differential expression of miRNAs in the disease. Mol Cancer Ther; 10(5); 752–60. ©2011 AACR.

Collaboration


Dive into the Yunyan Gu's collaboration.

Top Co-Authors

Avatar

Zheng Guo

Fujian Medical University

View shared research outputs
Top Co-Authors

Avatar

Wenyuan Zhao

Harbin Medical University

View shared research outputs
Top Co-Authors

Avatar

Lishuang Qi

Harbin Medical University

View shared research outputs
Top Co-Authors

Avatar

Chenguang Wang

Harbin Medical University

View shared research outputs
Top Co-Authors

Avatar

Haihai Liang

Harbin Medical University

View shared research outputs
Top Co-Authors

Avatar

Yuannv Zhang

Harbin Medical University

View shared research outputs
Top Co-Authors

Avatar

Hongwei Wang

Harbin Medical University

View shared research outputs
Top Co-Authors

Avatar

Ruiping Wang

Harbin Medical University

View shared research outputs
Top Co-Authors

Avatar

Ruihong Wu

Harbin Medical University

View shared research outputs
Top Co-Authors

Avatar

Wenbin Zhou

Harbin Medical University

View shared research outputs
Researchain Logo
Decentralizing Knowledge