Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ji-Gang Zhang is active.

Publication


Featured researches published by Ji-Gang Zhang.


PLOS ONE | 2013

Comparative Studies of Copy Number Variation Detection Methods for Next-Generation Sequencing Technologies

Junbo Duan; Ji-Gang Zhang; Hong-Wen Deng; Yu-Ping Wang

Copy number variation (CNV) has played an important role in studies of susceptibility or resistance to complex diseases. Traditional methods such as fluorescence in situ hybridization (FISH) and array comparative genomic hybridization (aCGH) suffer from low resolution of genomic regions. Following the emergence of next generation sequencing (NGS) technologies, CNV detection methods based on the short read data have recently been developed. However, due to the relatively young age of the procedures, their performance is not fully understood. To help investigators choose suitable methods to detect CNVs, comparative studies are needed. We compared six publicly available CNV detection methods: CNV-seq, FREEC, readDepth, CNVnator, SegSeq and event-wise testing (EWT). They are evaluated both on simulated and real data with different experiment settings. The receiver operating characteristic (ROC) curve is employed to demonstrate the detection performance in terms of sensitivity and specificity, box plot is employed to compare their performances in terms of breakpoint and copy number estimation, Venn diagram is employed to show the consistency among these methods, and F-score is employed to show the overlapping quality of detected CNVs. The computational demands are also studied. The results of our work provide a comprehensive evaluation on the performances of the selected CNV detection methods, which will help biological investigators choose the best possible method.


BMC Bioinformatics | 2013

Group sparse canonical correlation analysis for genomic data integration

Dongdong Lin; Ji-Gang Zhang; Jingyao Li; Vince D. Calhoun; Hong-Wen Deng; Yu-Ping Wang

BackgroundThe emergence of high-throughput genomic datasets from different sources and platforms (e.g., gene expression, single nucleotide polymorphisms (SNP), and copy number variation (CNV)) has greatly enhanced our understandings of the interplay of these genomic factors as well as their influences on the complex diseases. It is challenging to explore the relationship between these different types of genomic data sets. In this paper, we focus on a multivariate statistical method, canonical correlation analysis (CCA) method for this problem. Conventional CCA method does not work effectively if the number of data samples is significantly less than that of biomarkers, which is a typical case for genomic data (e.g., SNPs). Sparse CCA (sCCA) methods were introduced to overcome such difficulty, mostly using penalizations with l-1 norm (CCA-l1) or the combination of l-1and l-2 norm (CCA-elastic net). However, they overlook the structural or group effect within genomic data in the analysis, which often exist and are important (e.g., SNPs spanning a gene interact and work together as a group).ResultsWe propose a new group sparse CCA method (CCA-sparse group) along with an effective numerical algorithm to study the mutual relationship between two different types of genomic data (i.e., SNP and gene expression). We then extend the model to a more general formulation that can include the existing sCCA models. We apply the model to feature/variable selection from two data sets and compare our group sparse CCA method with existing sCCA methods on both simulation and two real datasets (human gliomas data and NCI60 data). We use a graphical representation of the samples with a pair of canonical variates to demonstrate the discriminating characteristic of the selected features. Pathway analysis is further performed for biological interpretation of those features.ConclusionsThe CCA-sparse group method incorporates group effects of features into the correlation analysis while performs individual feature selection simultaneously. It outperforms the two sCCA methods (CCA-l1 and CCA-group) by identifying the correlated features with more true positives while controlling total discordance at a lower level on the simulated data, even if the group effect does not exist or there are irrelevant features grouped with true correlated features. Compared with our proposed CCA-group sparse models, CCA-l1 tends to select less true correlated features while CCA-group inclines to select more redundant features.


PLOS ONE | 2013

Comprehensive Characterization of Human Genome Variation by High Coverage Whole-Genome Sequencing of Forty Four Caucasians

Hui Shen; Jian Li; Ji-Gang Zhang; Chao Xu; Yan Jiang; Zikai Wu; Fuping Zhao; Li Liao; Jun Chen; Yong Lin; Qing Tian; Christopher J. Papasian; Hong-Wen Deng

Whole genome sequencing studies are essential to obtain a comprehensive understanding of the vast pattern of human genomic variations. Here we report the results of a high-coverage whole genome sequencing study for 44 unrelated healthy Caucasian adults, each sequenced to over 50-fold coverage (averaging 65.8×). We identified approximately 11 million single nucleotide polymorphisms (SNPs), 2.8 million short insertions and deletions, and over 500,000 block substitutions. We showed that, although previous studies, including the 1000 Genomes Project Phase 1 study, have catalogued the vast majority of common SNPs, many of the low-frequency and rare variants remain undiscovered. For instance, approximately 1.4 million SNPs and 1.3 million short indels that we found were novel to both the dbSNP and the 1000 Genomes Project Phase 1 data sets, and the majority of which (∼96%) have a minor allele frequency less than 5%. On average, each individual genome carried ∼3.3 million SNPs and ∼492,000 indels/block substitutions, including approximately 179 variants that were predicted to cause loss of function of the gene products. Moreover, each individual genome carried an average of 44 such loss-of-function variants in a homozygous state, which would completely “knock out” the corresponding genes. Across all the 44 genomes, a total of 182 genes were “knocked-out” in at least one individual genome, among which 46 genes were “knocked out” in over 30% of our samples, suggesting that a number of genes are commonly “knocked-out” in general populations. Gene ontology analysis suggested that these commonly “knocked-out” genes are enriched in biological process related to antigen processing and immune response. Our results contribute towards a comprehensive characterization of human genomic variation, especially for less-common and rare variants, and provide an invaluable resource for future genetic studies of human variation and diseases.


PLOS ONE | 2014

Replication of 6 Obesity Genes in a Meta-Analysis of Genome-Wide Association Studies from Diverse Ancestries

Li-Jun Tan; Hu Zhu; Hao He; Ke-Hao Wu; Jian Li; Xiang-Ding Chen; Ji-Gang Zhang; Hui Shen; Qing Tian; Marie Krousel-Wood; Christopher J. Papasian; Claude Bouchard; Louis Pérusse; Hong-Wen Deng

Obesity is a major public health problem with a significant genetic component. Multiple DNA polymorphisms/genes have been shown to be strongly associated with obesity, typically in populations of European descent. The aim of this study was to verify the extent to which 6 confirmed obesity genes (FTO, CTNNBL1, ADRB2, LEPR, PPARG and UCP2 genes) could be replicated in 8 different samples (n = 11,161) and to explore whether the same genes contribute to obesity-susceptibility in populations of different ancestries (five Caucasian, one Chinese, one African-American and one Hispanic population). GWAS-based data sets with 1000 G imputed variants were tested for association with obesity phenotypes individually in each population, and subsequently combined in a meta-analysis. Multiple variants at the FTO locus showed significant associations with BMI, fat mass (FM) and percentage of body fat (PBF) in meta-analysis. The strongest association was detected at rs7185735 (P-value = 1.01×10−7 for BMI, 1.80×10−6 for FM, and 5.29×10−4 for PBF). Variants at the CTNNBL1, LEPR and PPARG loci demonstrated nominal association with obesity phenotypes (meta-analysis P-values ranging from 1.15×10−3 to 4.94×10−2). There was no evidence of association with variants at ADRB2 and UCP2 genes. When stratified by sex and ethnicity, FTO variants showed sex-specific and ethnic-specific effects on obesity traits. Thus, it is likely that FTO has an important role in the sex- and ethnic-specific risk of obesity. Our data confirmed the role of FTO, CTNNBL1, LEPR and PPARG in obesity predisposition. These findings enhanced our knowledge of genetic associations between these genes and obesity-related phenotypes, and provided further justification for pursuing functional studies of these genes in the pathophysiology of obesity. Sex and ethnic differences in genetic susceptibility across populations of diverse ancestries may contribute to a more targeted prevention and customized treatment of obesity.


BMC Bioinformatics | 2013

CNV-TV: A robust method to discover copy number variation from short sequencing reads

Junbo Duan; Ji-Gang Zhang; Hong-Wen Deng; Yu-Ping Wang

BackgroundCopy number variation (CNV) is an important structural variation (SV) in human genome. Various studies have shown that CNVs are associated with complex diseases. Traditional CNV detection methods such as fluorescence in situ hybridization (FISH) and array comparative genomic hybridization (aCGH) suffer from low resolution. The next generation sequencing (NGS) technique promises a higher resolution detection of CNVs and several methods were recently proposed for realizing such a promise. However, the performances of these methods are not robust under some conditions, e.g., some of them may fail to detect CNVs of short sizes. There has been a strong demand for reliable detection of CNVs from high resolution NGS data.ResultsA novel and robust method to detect CNV from short sequencing reads is proposed in this study. The detection of CNV is modeled as a change-point detection from the read depth (RD) signal derived from the NGS, which is fitted with a total variation (TV) penalized least squares model. The performance (e.g., sensitivity and specificity) of the proposed approach are evaluated by comparison with several recently published methods on both simulated and real data from the 1000 Genomes Project.ConclusionThe experimental results showed that both the true positive rate and false positive rate of the proposed detection method do not change significantly for CNVs with different copy numbers and lengthes, when compared with several existing methods. Therefore, our proposed approach results in a more reliable detection of CNVs than the existing methods.


Network Modeling Analysis in Health Informatics and BioInformatics | 2015

MicroRNA–mRNA interaction analysis to detect potential dysregulation in complex diseases

Wenlong Tang; Chao Xu; Yu-Ping Wang; Hong-Wen Deng; Ji-Gang Zhang

It is now recognized that genetic interactions (epistasis) are important sources of the hidden genetic variations and may play an important role in complex diseases. Identifying genetic interactions not only helps to explain part of the heritability of complex diseases, but also provides the clue to understand the underlying pathogenesis of complex diseases. Advances in high-throughput technologies enable simultaneous measurements of multiple genomic features from the same samples on a genome-wide scale, and different omics features are not acting in isolation but interact/crosstalk at multiple (within and across individual omics features) levels in complex networks. Therefore, genetic interaction needs to be accounted for across different omics features, potentially allowing an explanation of phenotype variation that single omics data cannot capture. In this study, we propose an analysis framework to detect the miRNA–mRNA interaction enrichment by incorporating principal components analysis and canonical correlation analysis. We demonstrate the advantages of our method by applying to miRNA and mRNA data on glioblastoma (GBM) generated by The Cancer Genome Atlas project. The results show that there are enrichments of the interactions between co-expressed miRNAs and gene pathways which are associated with GBM status. The biological functions of those identified genes and miRNAs have been confirmed to be associated with glioblastoma by independent studies. The proposed approach provides new insights in the regulatory mechanisms and an example for detecting interactions of multi-omics data on complex diseases.


Bone | 2017

Increased detection of genetic loci associated with risk predictors of osteoporotic fracture using a pleiotropic cFDR method

Jonathan Greenbaum; Ke-Hao Wu; Lan Zhang; Hui Shen; Ji-Gang Zhang; Hong-Wen Deng

Although GWAS have been successful in identifying some osteoporosis associated loci, the findings explain only a small fraction of the total genetic variance. In this study we use a recently developed novel pleiotropic conditional false discovery rate (cFDR) method to identify novel genetic loci associated with two risk traits for osteoporotic fracture (the clinical outcome and end result of osteoporosis), Height (HT) and Femoral Neck (FNK) BMD. The cFDR method allows us to improve the detection of associated variants by incorporating any potentially shared genetic mechanisms between the two associated traits. We analyzed the summary statistics from two GWAS meta-analyses for single nucleotide polymorphisms (SNPs) that are associated with HT and FNK BMD. Using the cFDR method, we show enrichment in the identification of SNPs associated with each trait conditioned on their strength of association with the second trait. The findings revealed 18 SNPs that are associated with both HT and FNK BMD, 4 of which had not previously been reported to play a role in bone health. The novel SNPs located at KIF1B and the intergenic region between FERD3L and TWISTNB are noteworthy as these genes may be associated with processes that are functionally important in bone metabolism. By leveraging GWAS results from related phenotypes we identified several novel loci that may contribute to the proportion of variability explained for each trait, although we cannot speculate about these potential contributions to heritability based on this analysis alone.


Proteomics | 2016

Network-based proteomic analysis for postmenopausal osteoporosis in Caucasian females.

Lan Zhang; Yao-Zhong Liu; Yong Zeng; Wei Zhu; Ying-Chun Zhao; Ji-Gang Zhang; Jia-Qiang Zhu; Hao He; Hui Shen; Qing Tian; Fei-Yan Deng; Christopher J. Papasian; Hong-Wen Deng

Menopause is one of the crucial physiological events during the life of a woman. Transition of menopause status is accompanied by increased risks of various health problems such as osteoporosis. Peripheral blood monocytes can differentiate into osteoclasts and produce cytokines important for osteoclast activity. With quantitative proteomics LC‐nano‐ESI‐MSE (where MSE is elevated‐energy MS), we performed protein expression profiling of peripheral blood monocytes in 42 postmenopausal women with discordant bone mineral density (BMD) levels. Traditional comparative analysis showed proteins encoded by four genes (LOC654188, PPIA, TAGLN2, YWHAB) and three genes (LMNB1, ANXA2P2, ANXA2) were significantly down‐ and upregulated, respectively, in extremely low‐ versus high‐BMD subjects. To study functionally orchestrating groups of detected proteins in the form of networks, we performed weighted gene coexpression network analysis and gene set enrichment analysis. Weighted gene coexpression network analysis showed that the module including the annexin gene family was most significantly correlated with low BMD, and the lipid‐binding related GO terms were enriched in this identified module. Gene set enrichment analysis revealed that two significantly enriched gene sets may be involved in postmenopausal BMD variation by regulating pro‐inflammatory cytokines activities. To gain more insights into the proteomics data generated, we performed integrative analyses of the datasets available to us at the genome (DNA level), transcriptome (RNA level), and proteome levels jointly.


PLOS ONE | 2015

Attenuated Monocyte Apoptosis, a New Mechanism for Osteoporosis Suggested by a Transcriptome-Wide Expression Study of Monocytes

Yao-Zhong Liu; Yu Zhou; Lei Zhang; Jian Li; Qing Tian; Ji-Gang Zhang; Hong-Wen Deng

Background Osteoporosis is caused by excessive bone resorption (by osteoclasts) over bone formation (by osteoblasts). Monocytes are important to osteoporosis by serving as progenitors of osteoclasts and produce cytokines for osteoclastogenesis. Aim To identify osteoporosis-related genes, we performed microarray analyses of monocytes using Affymetrix 1.0 ST arrays in 42 (including 16 pre- and 26 postmenopausal) high hip BMD (bone mineral density) vs. 31 (including 15 pre- and 16 postmenopausal) low hip BMD Caucasian female subjects. Here, high vs. low BMD is defined as belonging to top vs. bottom 30% of BMD values in population. Method Differential gene expression analysis in high vs. low BMD subjects was conducted in the total cohort as well as pre- and post-menopausal subjects. Focusing on the top differentially expressed genes identified in the total, the pre- and the postmenopausal subjects (with a p <5E-03), we performed replication of the findings in 3 independent datasets of microarray analyses of monocytes (total N = 125). Results We identified (in the 73 subjects) and successfully replicated in all the 3 independent datasets 2 genes, DAXX and PLK3. Interestingly, both genes are apoptosis induction genes and both down-regulated in the low BMD subjects. Moreover, using the top 200 genes identified in the meta-analysis across all of the 4 microarray datasets, GO term enrichment analysis identified a number of terms related to induction of apoptosis, for which the majority of component genes are also down-regulated in the low BMD subjects. Overall, our result may suggest that there might be a decreased apoptosis activity of monocytes in the low BMD subjects. Conclusion Our study for the first time suggested a decreased apoptosis rate (hence an increased survival) of monocytes, an important osteoclastogenic cell, as a novel mechanism for osteoporosis.


PLOS ONE | 2015

Integrative Analysis of Transcriptomic and Epigenomic Data to Reveal Regulation Patterns for BMD Variation.

Ji-Gang Zhang; Li-Jun Tan; Chao Xu; Hao He; Qing Tian; Yu Zhou; Chuan Qiu; Xiang-Ding Chen; Hong-Wen Deng

Integration of multiple profiling data and construction of functional gene networks may provide additional insights into the molecular mechanisms of complex diseases. Osteoporosis is a worldwide public health problem, but the complex gene-gene interactions, post-transcriptional modifications and regulation of functional networks are still unclear. To gain a comprehensive understanding of osteoporosis etiology, transcriptome gene expression microarray, epigenomic miRNA microarray and methylome sequencing were performed simultaneously in 5 high hip BMD (Bone Mineral Density) subjects and 5 low hip BMD subjects. SPIA (Signaling Pathway Impact Analysis) and PCST (Prize Collecting Steiner Tree) algorithm were used to perform pathway-enrichment analysis and construct the interaction networks. Through integrating the transcriptomic and epigenomic data, firstly we identified 3 genes (FAM50A, ZNF473 and TMEM55B) and one miRNA (hsa-mir-4291) which showed the consistent association evidence from both gene expression and methylation data; secondly in network analysis we identified an interaction network module with 12 genes and 11 miRNAs including AKT1, STAT3, STAT5A, FLT3, hsa-mir-141 and hsa-mir-34a which have been associated with BMD in previous studies. This module revealed the crosstalk among miRNAs, mRNAs and DNA methylation and showed four potential regulatory patterns of gene expression to influence the BMD status. In conclusion, the integration of multiple layers of omics can yield in-depth results than analysis of individual omics data respectively. Integrative analysis from transcriptomics and epigenomic data improves our ability to identify causal genetic factors, and more importantly uncover functional regulation pattern of multi-omics for osteoporosis etiology.

Collaboration


Dive into the Ji-Gang Zhang's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Dongdong Lin

The Mind Research Network

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge