Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Futao Zhang is active.

Publication


Featured researches published by Futao Zhang.


Nature Genetics | 2016

Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets

Zhihong Zhu; Futao Zhang; Han Hu; Andrew Bakshi; Matthew R. Robinson; Joseph E. Powell; Grant W. Montgomery; Michael E. Goddard; Naomi R. Wray; Peter M. Visscher; Jian Yang

Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with human complex traits. However, the genes or functional DNA elements through which these variants exert their effects on the traits are often unknown. We propose a method (called SMR) that integrates summary-level data from GWAS with data from expression quantitative trait locus (eQTL) studies to identify genes whose expression levels are associated with a complex trait because of pleiotropy. We apply the method to five human complex traits using GWAS data on up to 339,224 individuals and eQTL data on 5,311 individuals, and we prioritize 126 genes (for example, TRAF1 and ANKRD55 for rheumatoid arthritis and SNX19 and NMRAL1 for schizophrenia), of which 25 genes are new candidates; 77 genes are not the nearest annotated gene to the top associated GWAS SNP. These genes provide important leads to design future functional studies to understand the mechanism whereby DNA variation leads to complex trait variation.


Nature Communications | 2018

Causal associations between risk factors and common diseases inferred from GWAS summary data

Zhihong Zhu; Zhili Zheng; Futao Zhang; Yang Wu; Maciej Trzaskowski; Robert Maier; Matthew R. Robinson; John J. McGrath; Peter M. Visscher; Naomi R. Wray; Jian Yang

Health risk factors such as body mass index (BMI) and serum cholesterol are associated with many common diseases. It often remains unclear whether the risk factors are cause or consequence of disease, or whether the associations are the result of confounding. We develop and apply a method (called GSMR) that performs a multi-SNP Mendelian randomization analysis using summary-level data from genome-wide association studies to test the causal associations of BMI, waist-to-hip ratio, serum cholesterols, blood pressures, height, and years of schooling (EduYears) with common diseases (sample sizes of up to 405,072). We identify a number of causal associations including a protective effect of LDL-cholesterol against type-2 diabetes (T2D) that might explain the side effects of statins on T2D, a protective effect of EduYears against Alzheimer’s disease, and bidirectional associations with opposite effects (e.g., higher BMI increases the risk of T2D but the effect of T2D on BMI is negative).Genetic methods are useful to test whether risk factors are causal for or consequence of disease. Here, Zhu et al. develop a generalized summary-based Mendelian Randomization (GSMR) method which uses summary-level data from GWAS to test for causal associations of health risk factors with common diseases.


Scientific Reports | 2015

Mixed Linear Model Approaches of Association Mapping for Complex Traits Based on Omics Variants.

Futao Zhang; Zhihong Zhu; Xiaoran Tong; Zhi-Xiang Zhu; Ting Qi; Jun Zhu

Precise prediction for genetic architecture of complex traits is impeded by the limited understanding on genetic effects of complex traits, especially on gene-by-gene (GxG) and gene-by-environment (GxE) interaction. In the past decades, an explosion of high throughput technologies enables omics studies at multiple levels (such as genomics, transcriptomics, proteomics, and metabolomics). The analyses of large omics data, especially two-loci interaction analysis, are very time intensive. Integrating the diverse omics data and environmental effects in the analyses also remain challenges. We proposed mixed linear model approaches using GPU (Graphic Processing Unit) computation to simultaneously dissect various genetic effects. Analyses can be performed for estimating genetic main effects, GxG epistasis effects, and GxE environment interaction effects on large-scale omics data for complex traits, and for estimating heritability of specific genetic effects. Both mouse data analyses and Monte Carlo simulations demonstrated that genetic effects and environment interaction effects could be unbiasedly estimated with high statistical power by using the proposed approaches.


Genome Research | 2014

Epistasis analysis for quantitative traits by functional regression model

Futao Zhang; Eric Boerwinkle; Momiao Xiong

The critical barrier in interaction analysis for rare variants is that most traditional statistical methods for testing interactions were originally designed for testing the interaction between common variants and are difficult to apply to rare variants because of their prohibitive computational time and poor ability. The great challenges for successful detection of interactions with next-generation sequencing (NGS) data are (1) lack of methods for interaction analysis with rare variants, (2) severe multiple testing, and (3) time-consuming computations. To meet these challenges, we shift the paradigm of interaction analysis between two loci to interaction analysis between two sets of loci or genomic regions and collectively test interactions between all possible pairs of SNPs within two genomic regions. In other words, we take a genome region as a basic unit of interaction analysis and use high-dimensional data reduction and functional data analysis techniques to develop a novel functional regression model to collectively test interactions between all possible pairs of single nucleotide polymorphisms (SNPs) within two genome regions. By intensive simulations, we demonstrate that the functional regression models for interaction analysis of the quantitative trait have the correct type 1 error rates and a much better ability to detect interactions than the current pairwise interaction analysis. The proposed method was applied to exome sequence data from the NHLBIs Exome Sequencing Project (ESP) and CHARGE-S study. We discovered 27 pairs of genes showing significant interactions after applying the Bonferroni correction (P-values < 4.58 × 10(-10)) in the ESP, and 11 were replicated in the CHARGE-S study.


PLOS ONE | 2015

FastGCN: A GPU Accelerated Tool for Fast Gene Co-Expression Networks

Meimei Liang; Futao Zhang; Gulei Jin; Jun Zhu

Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out.


Addiction Biology | 2017

An association study revealed substantial effects of dominance, epistasis and substance dependence co-morbidity on alcohol dependence symptom count

Gang Chen; Futao Zhang; Wenda Xue; Ruyan Wu; Haiming Xu; Kesheng Wang; Jun Zhu

Alcohol dependence is a complex disease involving polygenes, environment and their interactions. Inadequate consideration of these interactions may have hampered the progress on genome‐wide association studies of alcohol dependence. By using the dataset of the Study of Addiction: Genetics and Environment with 3838 subjects, we conducted a genome‐wide association studies of alcohol dependence symptom count (ADSC) with a full genetic model considering additive, dominance, epistasis and their interactions with ethnicity, as well as conditions of co‐morbid substance dependence. Twenty quantitative trait single nucleotide polymorphisms (QTSs) showed highly significant associations with ADSC, including four previously reported genes (ADH1C, PKNOX2, CPE and KCNB2) and the reported intergenic rs1363605, supporting the overall validity of the analysis. Two QTSs within or near ADH1C showed very strong association in a dominance inheritance mode and increased the phenotype value of ADSC when the effect of co‐morbid opiate or marijuana dependence was controlled. Highly significant association was also identified in variants within four novel genes (RGS6, FMN1, NRM and BPTF), two non‐coding RNA and two epistasis loci. QTS rs7616413, located near PTPRG encoding a protein tyrosine phosphatase receptor, interacted with rs10090742 within ANGPT1 encoding a protein tyrosine phosphatase in an additive × additive or dominance × additive manner. The detected QTSs contributed to about 20 percent of total heritability, in which dominance and epistasis effects accounted for over 50 percent. These results demonstrated that perturbations arising from gene–gene interaction and conditions of co‐morbidity substantially influence the genetic architecture of complex trait.


Nature Communications | 2018

Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits

Yang Wu; Jian Zeng; Futao Zhang; Zhihong Zhu; Ting Qi; Zhili Zheng; Luke R. Lloyd-Jones; Riccardo E. Marioni; Nicholas G. Martin; Grant W. Montgomery; Ian J. Deary; Naomi R. Wray; Peter M. Visscher; Allan F. McRae; Jian Yang

The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm.The identification of the causal gene at a GWAS locus remains to be a challenging task. Here, using the SMR & HEIDI method to integrate GWAS, eQTL and mQTL data, Wu et al. map DNA methylation sites to the transcriptome and thereby prioritize functionally relevant genes for 12 human complex traits.


PLOS Genetics | 2016

Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits.

Futao Zhang; Dan Xie; Meimei Liang; Momiao Xiong

To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI’s Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes.


Nature Communications | 2018

Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes

Angli Xue; Yang Wu; Zhihong Zhu; Futao Zhang; Kathryn E. Kemper; Zhili Zheng; Loic Yengo; Luke R. Lloyd-Jones; Julia Sidorenko; Yeda Wu; Allan F. McRae; Peter M. Visscher; Jian Zeng; Jian Yang

Type 2 diabetes (T2D) is a very common disease in humans. Here we conduct a meta-analysis of genome-wide association studies (GWAS) with ~16 million genetic variants in 62,892 T2D cases and 596,424 controls of European ancestry. We identify 139 common and 4 rare variants associated with T2D, 42 of which (39 common and 3 rare variants) are independent of the known variants. Integration of the gene expression data from blood (n = 14,115 and 2765) with the GWAS results identifies 33 putative functional genes for T2D, 3 of which were targeted by approved drugs. A further integration of DNA methylation (n = 1980) and epigenomic annotation data highlight 3 genes (CAMK1D, TP53INP1, and ATP5G1) with plausible regulatory mechanisms, whereby a genetic variant exerts an effect on T2D through epigenetic regulation of gene expression. Our study uncovers additional loci, proposes putative genetic regulatory mechanisms for T2D, and provides evidence of purifying selection for T2D-associated variants.GWAS have so far identified 129 loci associated with type 2 diabetes (T2D). Here, the authors meta-analyse three large T2D GWA studies which uncovers 42 additional loci, further prioritize 33 functional genes using eQTL and mQTL data and propose regulatory mechanisms for three putative T2D genes.


bioRxiv | 2018

Novel susceptibility loci and genetic regulation mechanisms for type 2 diabetes

Angli Xue; Yang Wu; Zhihong Zhu; Futao Zhang; Kathryn E. Kemper; Zhili Zheng; Loic Yengo; Luke R. Lloyd-Jones; Julia Sidorenko; Yeda Wu; Allan F. McRae; Peter M. Visscher; Jian Zeng; Jian Yang

We conducted a meta-analysis of genome-wide association studies (GWAS) with ∼16 million genotyped/imputed genetic variants in 62,892 type 2 diabetes (T2D) cases and 596,424 controls of European ancestry. We identified 139 common and 4 rare (minor allele frequency < 0.01) variants associated with T2D, 42 of which (39 common and 3 rare variants) were independent of the known variants. Integration of the gene expression data from blood (n = 14,115 and 2,765) and other T2D-relevant tissues (n = up to 385) with the GWAS results identified 33 putative functional genes for T2D, three of which were targeted by approved drugs. A further integration of DNA methylation (n = 1,980) and epigenomic annotations data highlighted three putative T2D genes (CAMK1D, TP53INP1 and ATP5G1) with plausible regulatory mechanisms whereby a genetic variant exerts an effect on T2D through epigenetic regulation of gene expression. We further found evidence that the T2D-associated loci have been under purifying selection.

Collaboration


Dive into the Futao Zhang's collaboration.

Top Co-Authors

Avatar

Jian Yang

University of Queensland

View shared research outputs
Top Co-Authors

Avatar

Zhihong Zhu

University of Queensland

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Allan F. McRae

University of Queensland

View shared research outputs
Top Co-Authors

Avatar

Naomi R. Wray

University of Queensland

View shared research outputs
Top Co-Authors

Avatar

Yang Wu

University of Queensland

View shared research outputs
Top Co-Authors

Avatar

Jian Zeng

University of Queensland

View shared research outputs
Top Co-Authors

Avatar

Zhili Zheng

Wenzhou Medical College

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Angli Xue

University of Queensland

View shared research outputs
Researchain Logo
Decentralizing Knowledge