Kaixiong Ye
Cornell University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Kaixiong Ye.
Proceedings of the National Academy of Sciences of the United States of America | 2014
Kaixiong Ye; Jian Lu; Fei Ma; Alon Keinan; Zhenglong Gu
Significance There are hundreds to thousands of copies of mitochondrial DNA (mtDNA) in each human cell in contrast to only two copies of nuclear DNA. High-frequency pathogenic mtDNA mutations have been found in patients with classic mitochondrial diseases, premature aging, cancers, and neurodegenerative diseases. In this study we investigated the distribution of heteroplasmic mutations, their pathogenic potential, and their underlying evolutionary forces using genome sequence data from the 1000 Genomes Project. Our results demonstrated the prevalence of low-frequency high-pathogenic-potential mtDNA mutations in healthy human individuals. These deleterious mtDNA mutations, when reaching high frequency, could provide a likely source of mitochondrial dysfunction. Managing the expansion of deleterious mtDNA mutations could be a promising means of preventing disease progression. A majority of mitochondrial DNA (mtDNA) mutations reported to be implicated in diseases are heteroplasmic, a status with coexisting mtDNA variants in a single cell. Quantifying the prevalence of mitochondrial heteroplasmy and its pathogenic effect in healthy individuals could further our understanding of its possible roles in various diseases. A total of 1,085 human individuals from 14 global populations have been sequenced by the 1000 Genomes Project to a mean coverage of ∼2,000× on mtDNA. Using a combination of stringent thresholds and a maximum-likelihood method to define heteroplasmy, we demonstrated that ∼90% of the individuals carry at least one heteroplasmy. At least 20% of individuals harbor heteroplasmies reported to be implicated in disease. Mitochondrial heteroplasmy tend to show high pathogenicity, and is significantly overrepresented in disease-associated loci. Consistent with their deleterious effect, heteroplasmies with derived allele frequency larger than 60% within an individual show a significant reduction in pathogenicity, indicating the action of purifying selection. Purifying selection on heteroplasmies can also be inferred from nonsynonymous and synonymous heteroplasmy comparison and the unfolded site frequency spectra for different functional sites in mtDNA. Nevertheless, in comparison with population polymorphic mtDNA mutations, the purifying selection is much less efficient in removing heteroplasmic mutations. The prevalence of mitochondrial heteroplasmy with high pathogenic potential in healthy individuals, along with the possibility of these mutations drifting to high frequency inside a subpopulation of cells across lifespan, emphasizes the importance of managing mitochondrial heteroplasmy to prevent disease progression.
Molecular Biology and Evolution | 2016
Kumar S.D. Kothapalli; Kaixiong Ye; Maithili S. Gadgil; Susan E. Carlson; Kimberly O. O’Brien; Ji Yao Zhang; Hui Gyu Park; Kinsley Ojukwu; James Zou; Stephanie S. Hyon; Kalpana Joshi; Zhenglong Gu; Alon Keinan; J. Thomas Brenna
Long chain polyunsaturated fatty acids (LCPUFA) are bioactive components of membrane phospholipids and serve as substrates for signaling molecules. LCPUFA can be obtained directly from animal foods or synthesized endogenously from 18 carbon precursors via the FADS2 coded enzyme. Vegans rely almost exclusively on endogenous synthesis to generate LCPUFA and we hypothesized that an adaptive genetic polymorphism would confer advantage. The rs66698963 polymorphism, a 22-bp insertion–deletion within FADS2, is associated with basal FADS1 expression, and coordinated induction of FADS1 and FADS2 in vitro. Here, we determined rs66698963 genotype frequencies from 234 individuals of a primarily vegetarian Indian population and 311 individuals from the US. A much higher I/I genotype frequency was found in Indians (68%) than in the US (18%). Analysis using 1000 Genomes Project data confirmed our observation, revealing a global I/I genotype of 70% in South Asians, 53% in Africans, 29% in East Asians, and 17% in Europeans. Tests based on population divergence, site frequency spectrum, and long-range haplotype consistently point to positive selection encompassing rs66698963 in South Asian, African, and some East Asian populations. Basal plasma phospholipid arachidonic acid (ARA) status was 8% greater in I/I compared with D/D individuals. The biochemical pathway product–precursor difference, ARA minus linoleic acid, was 31% and 13% greater for I/I and I/D compared with D/D, respectively. This study is consistent with previous in vitro data suggesting that the insertion allele enhances n-6 LCPUFA synthesis and may confer an adaptive advantage in South Asians because of the traditional plant-based diet practice.
Journal of Biological Chemistry | 2011
Zhen Xue; Yin He; Kaixiong Ye; Zhenglong Gu; Yuxin Mao; Ling Qi
Inositol-requiring enzyme 1α (IRE1α), an endoplasmic reticulum-resident sensor for mammalian unfolded protein response, is a bifunctional enzyme containing kinase and RNase domains critical for trans-autophosphorylation and Xbp1 mRNA splicing, respectively, in response to endoplasmic reticulum stress. However, the amino acid residues important for its function and activation remain largely unexplored. Here, through analysis of IRE1α mutants associated with human somatic cancers, we have identified a highly conserved proline residue at position 830 (Pro830) that is critical for its structural integrity and hence, the activation of both kinase and RNase domains. Structural analysis revealed that Pro830 may form a highly conserved structural linker with adjacent tryptophan and tyrosine residues at positions 833 and 945 (Trp833 and Tyr945), thereby bridging the kinase and RNase domains. Indeed, mutation of Pro830 to leucine (P830L) completely abolished the kinase and RNase activities, significantly decreased protein stability, and prevented oligomerization of IRE1α upon ER stress; similar observations were made for mutations of Trp833 to alanine (W833A) and to a lesser extent for Y945A. Our finding may facilitate the identification of small molecules to compromise IRE1α function specifically.
Advances in Nutrition | 2011
Kaixiong Ye; Zhenglong Gu
Dietary transitions in human history have been suggested to play important roles in the evolution of mankind. Genetic variations caused by adaptation to diet during human evolution could have important health consequences in current society. The advance of sequencing technologies and the rapid accumulation of genome information provide an unprecedented opportunity to comprehensively characterize genetic variations in human populations and unravel the genetic basis of human evolution. Series of selection detection methods, based on various theoretical models and exploiting different aspects of selection signatures, have been developed. Their applications at the species and population levels have respectively led to the identification of human specific selection events that distinguish human from nonhuman primates and local adaptation events that contribute to human diversity. Scrutiny of candidate genes has revealed paradigms of adaptations to specific nutritional components and genome-wide selection scans have verified the prevalence of diet-related selection events and provided many more candidates awaiting further investigation. Understanding the role of diet in human evolution is fundamental for the development of evidence-based, genome-informed nutritional practices in the era of personal genomics.
Gene | 2015
Rex Lei; Kaixiong Ye; Zhenglong Gu; Xuepeng Sun
RNA-seq is increasingly used to study gene expression of various organisms. While it provides a great opportunity to explore genome-scale transcriptional patterns with tremendous depth, it comes with prohibitive costs. Establishing a minimal sequencing depth for required accuracy will guide cost-effective experimental design and promote the routine application of RNA-seq. To address this issue, we selected 36 RNA-seq datasets, each with more than 20 million reads from six widely-used model organisms: Saccharomyces cerevisiae, Homo sapiens, Drosophila melanogaster, Caenorhabditis elegans, Mus musculus, and Arabidopsis thaliana, and investigated statistical correlations between the sequencing depth and the outcome accuracy. To achieve this, we randomly chose reads from each dataset, mapped them to the reference genomes, and analyzed the accuracy achieved with varying coverage. Our results indicated that as low as one million reads can provide the same sequencing accuracy in transcript abundance (r=0.99) as >30 million reads for highly-expressed genes in all six species. Because many metabolically and pathologically-relevant genes are highly expressed, our findings might be instructive for cost-effective experimental designs in NGS-based research and also provide useful guidance to similar research for other organisms.
Journal of Translational Medicine | 2016
Paul Billing-Ross; Arnaud Germain; Kaixiong Ye; Alon Keinan; Zhenglong Gu; Maureen R. Hanson
BackgroundMitochondrial dysfunction has been hypothesized to occur in Myalgic Encephalomyelitis/Chronic Fatigue Syndrome (ME/CFS), a disease characterized by fatigue, cognitive difficulties, pain, malaise, and exercise intolerance. We investigated whether haplogroup, single nucleotide polymorphisms (SNPs), or heteroplasmy of mitochondrial DNA (mtDNA) were associated with health status and/or symptoms.MethodsIllumina sequencing of PCR-amplified mtDNA was performed to analyze sequence and extent of heteroplasmy of mtDNAs of 193 cases and 196 age- and gender-matched controls from DNA samples collected by the Chronic Fatigue Initiative. Association testing was carried out to examine possible correlations of mitochondrial sequences with case/control status and symptom constellation and severity as reported by subjects on Short Form-36 and DePaul Symptom Questionnaires.ResultsNo ME/CFS subject exhibited known disease-causing mtDNA mutations. Extent of heteroplasmy was low in all subjects. Although no association between mtDNA SNPs and ME/CFS vs. healthy status was observed, haplogroups J, U and H as well as eight SNPs in ME/CFS cases were significantly associated with individual symptoms, symptom clusters, or symptom severity.ConclusionsAnalysis of mitochondrial genomes in ME/CFS cases indicates that individuals of a certain haplogroup or carrying specific SNPs are more likely to exhibit certain neurological, inflammatory, and/or gastrointestinal symptoms. No increase in susceptibility to ME/CFS of individuals carrying particular mitochondrial genomes or SNPs was observed.
Nature Ecology and Evolution | 2017
Kaixiong Ye; Feng Gao; David Wang; Ofer Bar-Yosef; Alon Keinan
Fatty acid desaturase (FADS) genes encode rate-limiting enzymes for the biosynthesis of omega-6 and omega-3 long chain polyunsaturated fatty acids (LCPUFAs). This biosynthesis is essential for individuals subsisting on LCPUFAs-poor diets (e.g. plant-based). Positive selection on FADS genes has been reported in multiple populations, but its presence and pattern in Europeans remain elusive. Here, using ancient and modern DNA, we demonstrate that positive selection acted on the same FADS variants both before and after the advent of farming in Europe, but on opposite (i.e. alternative) alleles. Selection in recent farmers also varied geographically, with the strongest signal in Southern Europe. These varying selection patterns concur with anthropological evidence of varying diets, and with the association of farming-adaptive alleles with higher FADS1 expression and thus enhanced LCPUFAs biosynthesis. Genome-wide association studies reveal that farming-adaptive alleles not only increase LCPUFAs, but also affect other lipid levels and protect against several inflammatory diseases.
Genome Biology and Evolution | 2013
Kaixiong Ye; Jian Lu; Srilakshmi Raj; Zhenglong Gu
Expression quantitative trait loci (eQTLs) have been found to be enriched in trait-associated single-nucleotide polymorphisms (SNPs). However, whether eQTLs are adaptive to different environmental factors and its relative evolutionary significance compared with nonsynonymous SNPs (NS SNPs) are still elusive. Compiling environmental correlation data from three studies for more than 500,000 SNPs and 42 environmental factors, including climate, subsistence, pathogens, and dietary patterns, we performed a systematic examination of the adaptive patterns of eQTLs to local environment. Compared with intergenic SNPs, eQTLs are significantly enriched in the lower tail of a transformed rank statistic in the environmental correlation analysis, indicating possible adaptation of eQTLs to the majority of 42 environmental factors. The mean enrichment of eQTLs across 42 environmental factors is as great as, if not greater than, that of NS SNPs. The enrichment of eQTLs, although significant across all levels of recombination rate, is inversely correlated with recombination rate, suggesting the presence of selective sweep or background selection. Further pathway enrichment analysis identified a number of pathways with possible environmental adaption from eQTLs. These pathways are mostly related with immune function and metabolism. Our results indicate that eQTLs might have played an important role in recent and ongoing human adaptation and are of special importance for some environmental factors and biological pathways.
Proceedings of the National Academy of Sciences of the United States of America | 2014
Kaixiong Ye; Jian Lu; Alon Keinan; Zhenglong Gu
In their comment on our report (1), Just et al. suggest that sample contamination explained mitochondrial DNA (mtDNA) heteroplasmy identified in some individuals (2). The authors further question the validity of our conclusions and the reliability of using massively parallel sequencing (MPS) to detect low-frequency heteroplasmy. We systematically evaluated the presence and impact of contamination and found that it only affects a small fraction of all individuals, leaving our original conclusions unchanged.
bioRxiv | 2017
Kaixiong Ye; Feng Gao; David Wang; Ofer Bar-Yosef; Alon Keinan
Fatty acid desaturase (FADS) genes encode rate-limiting enzymes for the biosynthesis of omega-6 and omega-3 long chain polyunsaturated fatty acids (LCPUFAs). This biosynthesis is essential for individuals subsisting on LCPUFAs-poor, plant-based diets. Positive selection on FADS genes has been reported in multiple populations, but its presence and pattern in Europeans remain elusive. Here, with analyses of ancient and modern DNA, we demonstrated that positive selection acted on the same FADS variants both before and after the advent of farming in Europe, but on opposite alleles. Selection in recent farmers also varied geographically, with the strongest signal in Southern Europe. These varying selection patterns concur with anthropological evidence of differences in diets, and with the association of recently-adaptive alleles with higher FADS1 expression and enhanced LCPUFAs biosynthesis. Genome-wide association studies revealed associations of recently-adaptive alleles with not only LCPUFAs, but also other lipids and decreased risk of several inflammation-related diseases.The FADS gene family encodes rate-limiting enzymes for the biosynthesis of omega-6 and omega-3 long chain polyunsaturated fatty acids (LCPUFAs), which is essential for individuals subsisting on LCPUFAs-poor diets (e.g. plant-based). Positive selection on FADS genes has been reported in multiple populations, but its presence and pattern in Europeans remain elusive. Here, with analyses of ancient and modern DNA, we demonstrated positive selection acted on variants centered on FADS1 and FADS2 both before and after the advent of farming in Europe, but adaptive alleles in these two periods are opposite. Selection signals in recent history also vary geographically, with the strongest in Southern Europe. We showed that adaptive alleles in recent farmers are associated with expression of FADS genes, enhanced LCPUFAs biosynthesis and reduced risk of inflammatory bowel diseases. Thus, the adaptation of FADS genes in Europe varies across time and geography, probably due to varying diet and subsistence.