Yu S. Huang
University of Southern California
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Yu S. Huang.
Nature | 2010
Susanna Atwell; Yu S. Huang; Bjarni J. Vilhjálmsson; Glenda Willems; Matthew Horton; Yan Li; Dazhe Meng; Alexander Platt; Aaron M. Tarone; Tina T. Hu; Rong Jiang; N. Wayan Muliyati; Xu Zhang; Muhammad Ali Amer; Ivan Baxter; Benjamin Brachi; Joanne Chory; Caroline Dean; Marilyne Debieu; Juliette de Meaux; Joseph R. Ecker; Nathalie Faure; Joel M. Kniskern; Jonathan D. G. Jones; Todd P. Michael; Adnane Nemri; Fabrice Roux; David E. Salt; Chunlao Tang; Marco Todesco
Although pioneered by human geneticists as a potential solution to the challenging problem of finding the genetic basis of common human diseases, genome-wide association (GWA) studies have, owing to advances in genotyping and sequencing technology, become an obvious general approach for studying the genetics of natural variation and traits of agricultural importance. They are particularly useful when inbred lines are available, because once these lines have been genotyped they can be phenotyped multiple times, making it possible (as well as extremely cost effective) to study many different traits in many different environments, while replicating the phenotypic measurements to reduce environmental noise. Here we demonstrate the power of this approach by carrying out a GWA study of 107 phenotypes in Arabidopsis thaliana, a widely distributed, predominantly self-fertilizing model plant known to harbour considerable genetic variation for many adaptively important traits. Our results are dramatically different from those of human GWA studies, in that we identify many common alleles of major effect, but they are also, in many cases, harder to interpret because confounding by complex genetics and population structure make it difficult to distinguish true associations from false. However, a-priori candidates are significantly over-represented among these associations as well, making many of them excellent candidates for follow-up experiments. Our study demonstrates the feasibility of GWA studies in A. thaliana and suggests that the approach will be appropriate for many other organisms.
Nature Genetics | 2012
Matthew Horton; Angela M. Hancock; Yu S. Huang; Christopher Toomajian; Susanna Atwell; Adam Auton; N. Wayan Muliyati; Alexander Platt; F. Gianluca Sperone; Bjarni J. Vilhjálmsson; Magnus Nordborg; Justin O. Borevitz; Joy Bergelson
Arabidopsis thaliana is native to Eurasia and is naturalized across the world. Its ability to be easily propagated and its high phenotypic variability make it an ideal model system for functional, ecological and evolutionary genetics. To date, analyses of the natural genetic variation of A. thaliana have involved small numbers of individual plants or genetic markers. Here we genotype 1,307 worldwide accessions, including several regional samples, using a 250K SNP chip. This allowed us to produce a high-resolution description of the global pattern of genetic variation. We applied three complementary selection tests and identified new targets of selection. Further, we characterized the pattern of historical recombination in A. thaliana and observed an enrichment of hotspots in its intergenic regions and repetitive DNA, which is consistent with the pattern that is observed for humans but which is strikingly different from that observed in other plant species. We have made the seeds we used to produce this Regional Mapping (RegMap) panel publicly available. This panel comprises one of the largest genomic mapping resources currently available for global natural isolates of a non-human species.
intelligent systems in molecular biology | 2005
Haiyan Hu; Xifeng Yan; Yu S. Huang; Jiawei Han; Xianghong Jasmine Zhou
MOTIVATION The rapid accumulation of biological network data translates into an urgent need for computational methods for graph pattern mining. One important problem is to identify recurrent patterns across multiple networks to discover biological modules. However, existing algorithms for frequent pattern mining become very costly in time and space as the pattern sizes and network numbers increase. Currently, no efficient algorithm is available for mining recurrent patterns across large collections of genome-wide networks. RESULTS We developed a novel algorithm, CODENSE, to efficiently mine frequent coherent dense subgraphs across large numbers of massive graphs. Compared with previous methods, our approach is scalable in the number and size of the input graphs and adjustable in terms of exact or approximate pattern mining. Applying CODENSE to 39 co-expression networks derived from microarray datasets, we discovered a large number of functionally homogeneous clusters and made functional predictions for 169 uncharacterized yeast genes. AVAILABILITY http://zhoulab.usc.edu/CODENSE/
PLOS Genetics | 2010
Alexander Platt; Matthew Horton; Yu S. Huang; Yan Li; Alison E. Anastasio; Ni Wayan Mulyati; Jon Ågren; Oliver Bossdorf; Diane L. Byers; Kathleen Donohue; Megan Dunning; Eric B. Holub; Andrew Hudson; Valérie Le Corre; Olivier Loudet; Fabrice Roux; Norman Warthmann; Detlef Weigel; Luz Rivero; Randy Scholl; Magnus Nordborg; Joy Bergelson; Justin O. Borevitz
The population structure of an organism reflects its evolutionary history and influences its evolutionary trajectory. It constrains the combination of genetic diversity and reveals patterns of past gene flow. Understanding it is a prerequisite for detecting genomic regions under selection, predicting the effect of population disturbances, or modeling gene flow. This paper examines the detailed global population structure of Arabidopsis thaliana. Using a set of 5,707 plants collected from around the globe and genotyped at 149 SNPs, we show that while A. thaliana as a species self-fertilizes 97% of the time, there is considerable variation among local groups. This level of outcrossing greatly limits observed heterozygosity but is sufficient to generate considerable local haplotypic diversity. We also find that in its native Eurasian range A. thaliana exhibits continuous isolation by distance at every geographic scale without natural breaks corresponding to classical notions of populations. By contrast, in North America, where it exists as an exotic species, A. thaliana exhibits little or no population structure at a continental scale but local isolation by distance that extends hundreds of km. This suggests a pattern for the development of isolation by distance that can establish itself shortly after an organism fills a new habitat range. It also raises questions about the general applicability of many standard population genetics models. Any model based on discrete clusters of interchangeable individuals will be an uneasy fit to organisms like A. thaliana which exhibit continuous isolation by distance on many scales.
Nature | 2010
Marco Todesco; Sureshkumar Balasubramanian; Tina T. Hu; M. Brian Traw; Matthew Horton; Petra Epple; Christine Kuhns; Sridevi Sureshkumar; Christopher J. Schwartz; Christa Lanz; Roosa A. E. Laitinen; Yu S. Huang; Joanne Chory; Volker Lipka; Justin O. Borevitz; Jeffery L. Dangl; Joy Bergelson; Magnus Nordborg; Detlef Weigel
Plants can defend themselves against a wide array of enemies, from microbes to large animals, yet there is great variability in the effectiveness of such defences, both within and between species. Some of this variation can be explained by conflicting pressures from pathogens with different modes of attack. A second explanation comes from an evolutionary ‘tug of war’, in which pathogens adapt to evade detection, until the plant has evolved new recognition capabilities for pathogen invasion. If selection is, however, sufficiently strong, susceptible hosts should remain rare. That this is not the case is best explained by costs incurred from constitutive defences in a pest-free environment. Using a combination of forward genetics and genome-wide association analyses, we demonstrate that allelic diversity at a single locus, ACCELERATED CELL DEATH 6 (ACD6), underpins marked pleiotropic differences in both vegetative growth and resistance to microbial infection and herbivory among natural Arabidopsis thaliana strains. A hyperactive ACD6 allele, compared to the reference allele, strongly enhances resistance to a broad range of pathogens from different phyla, but at the same time slows the production of new leaves and greatly reduces the biomass of mature leaves. This allele segregates at intermediate frequency both throughout the worldwide range of A. thaliana and within local populations, consistent with this allele providing substantial fitness benefits despite its marked impact on growth.
PLOS Genetics | 2010
Ivan Baxter; Jessica N. Brazelton; Danni Yu; Yu S. Huang; Brett Lahner; Elena Yakubova; Yan Li; Joy Bergelson; Justin O. Borevitz; Magnus Nordborg; Olga Vitek; David E. Salt
The genetic model plant Arabidopsis thaliana, like many plant species, experiences a range of edaphic conditions across its natural habitat. Such heterogeneity may drive local adaptation, though the molecular genetic basis remains elusive. Here, we describe a study in which we used genome-wide association mapping, genetic complementation, and gene expression studies to identify cis-regulatory expression level polymorphisms at the AtHKT1;1 locus, encoding a known sodium (Na+) transporter, as being a major factor controlling natural variation in leaf Na+ accumulation capacity across the global A. thaliana population. A weak allele of AtHKT1;1 that drives elevated leaf Na+ in this population has been previously linked to elevated salinity tolerance. Inspection of the geographical distribution of this allele revealed its significant enrichment in populations associated with the coast and saline soils in Europe. The fixation of this weak AtHKT1;1 allele in these populations is genetic evidence supporting local adaptation to these potentially saline impacted environments.
Proceedings of the National Academy of Sciences of the United States of America | 2010
Yan Li; Yu S. Huang; Joy Bergelson; Magnus Nordborg; Justin O. Borevitz
Flowering time (FT) is the developmental transition coupling an internal genetic program with external local and seasonal climate cues. The genetic loci sensitive to predictable environmental signals underlie local adaptation. We dissected natural variation in FT across a new global diversity set of 473 unique accessions, with >12,000 plants across two seasonal plantings in each of two simulated local climates, Spain and Sweden. Genome-wide association mapping was carried out with 213,497 SNPs. A total of 12 FT candidate quantitative trait loci (QTL) were fine-mapped in two independent studies, including 4 located within ±10 kb of previously cloned FT alleles and 8 novel loci. All QTL show sensitivity to planting season and/or simulated location in a multi-QTL mixed model. Alleles at four QTL were significantly correlated with latitude of origin, implying past selection for faster flowering in southern locations. Finally, maximum seed yield was observed at an optimal FT unique to each season and location, with four FT QTL directly controlling yield. Our results suggest that these major, environmentally sensitive FT QTL play an important role in spatial and temporal adaptation.
intelligent systems in molecular biology | 2007
Yu S. Huang; Haifeng Li; Haiyan Hu; Xifeng Yan; Michael S. Waterman; Haiyan Huang; Xianghong Jasmine Zhou
MOTIVATION The rapid accumulation of microarray datasets provides unique opportunities to perform systematic functional characterization of the human genome. We designed a graph-based approach to integrate cross-platform microarray data, and extract recurrent expression patterns. A series of microarray datasets can be modeled as a series of co-expression networks, in which we search for frequently occurring network patterns. The integrative approach provides three major advantages over the commonly used microarray analysis methods: (1) enhance signal to noise separation (2) identify functionally related genes without co-expression and (3) provide a way to predict gene functions in a context-specific way. RESULTS We integrate 65 human microarray datasets, comprising 1105 experiments and over 11 million expression measurements. We develop a data mining procedure based on frequent itemset mining and biclustering to systematically discover network patterns that recur in at least five datasets. This resulted in 143,401 potential functional modules. Subsequently, we design a network topology statistic based on graph random walk that effectively captures characteristics of a genes local functional environment. Function annotations based on this statistic are then subject to the assessment using the random forest method, combining six other attributes of the network modules. We assign 1126 functions to 895 genes, 779 known and 116 unknown, with a validation accuracy of 70%. Among our assignments, 20% genes are assigned with multiple functions based on different network environments. AVAILABILITY http://zhoulab.usc.edu/ContextAnnotation.
PLOS Genetics | 2012
Dai-Yin Chao; Adriano Silva; Ivan Baxter; Yu S. Huang; Magnus Nordborg; John Danku; Brett Lahner; Elena Yakubova; David E. Salt
Understanding the mechanism of cadmium (Cd) accumulation in plants is important to help reduce its potential toxicity to both plants and humans through dietary and environmental exposure. Here, we report on a study to uncover the genetic basis underlying natural variation in Cd accumulation in a world-wide collection of 349 wild collected Arabidopsis thaliana accessions. We identified a 4-fold variation (0.5–2 µg Cd g−1 dry weight) in leaf Cd accumulation when these accessions were grown in a controlled common garden. By combining genome-wide association mapping, linkage mapping in an experimental F2 population, and transgenic complementation, we reveal that HMA3 is the sole major locus responsible for the variation in leaf Cd accumulation we observe in this diverse population of A. thaliana accessions. Analysis of the predicted amino acid sequence of HMA3 from 149 A. thaliana accessions reveals the existence of 10 major natural protein haplotypes. Association of these haplotypes with leaf Cd accumulation and genetics complementation experiments indicate that 5 of these haplotypes are active and 5 are inactive, and that elevated leaf Cd accumulation is associated with the reduced function of HMA3 caused by a nonsense mutation and polymorphisms that change two specific amino acids.
Proceedings of the National Academy of Sciences of the United States of America | 2010
Adnane Nemri; Susanna Atwell; Aaron M. Tarone; Yu S. Huang; Keyan Zhao; David J. Studholme; Magnus Nordborg; Jonathan D. G. Jones
The model plant Arabidopsis thaliana exhibits extensive natural variation in resistance to parasites. Immunity is often conferred by resistance (R) genes that permit recognition of specific races of a disease. The number of such R genes and their distribution are poorly understood. In this study, we investigated the basis for resistance to the downy mildew agent Hyaloperonospora arabidopsidis ex parasitica (Hpa) in a global sample of A. thaliana. We implemented a combined genome-wide mapping of resistance using populations of recombinant inbred lines and a collection of wild A. thaliana accessions. We tested the interaction between 96 host genotypes collected worldwide and five strains of Hpa. Then, a fraction of the species-wide resistance was genetically dissected using six recently constructed populations of recombinant inbred lines. We found that resistance is usually governed by single dominant R genes that are concentrated in four genomic regions only. We show that association genetics of resistance to diseases such as downy mildew enables increased mapping resolution from quantitative trait loci interval to candidate gene level. Association patterns in quantitative trait loci intervals indicate that the pool of A. thaliana resistance sources against the tested Hpa isolates may be predominantly confined to six RPP (Resistance to Hpa) loci isolated in previous studies. Our results suggest that combining association and linkage mapping could accelerate resistance gene discovery in plants.