Yue-Yu Hang
Chinese Academy of Sciences
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Yue-Yu Hang.
Plant Physiology | 2014
Zhu-Qing Shao; Yan-Mei Zhang; Yue-Yu Hang; Jia-Yu Xue; Guang-Can Zhou; Ping Wu; Xiao-Yi Wu; Xun-Zong Wu; Qiang Wang; Bin Wang; Jian-Qun Chen
During the past 54 million years of evolution, 94% of ancestral legume leucine-rich-repeat gene lineages have experienced deletions or expansions, while 6% have been maintained in a conservative manner. Proper utilization of plant disease resistance genes requires a good understanding of their short- and long-term evolution. Here we present a comprehensive study of the long-term evolutionary history of nucleotide-binding site (NBS)-leucine-rich repeat (LRR) genes within and beyond the legume family. The small group of NBS-LRR genes with an amino-terminal RESISTANCE TO POWDERY MILDEW8 (RPW8)-like domain (referred to as RNL) was first revealed as a basal clade sister to both coiled-coil-NBS-LRR (CNL) and Toll/Interleukin1 receptor-NBS-LRR (TNL) clades. Using Arabidopsis (Arabidopsis thaliana) as an outgroup, this study explicitly recovered 31 ancestral NBS lineages (two RNL, 21 CNL, and eight TNL) that had existed in the rosid common ancestor and 119 ancestral lineages (nine RNL, 55 CNL, and 55 TNL) that had diverged in the legume common ancestor. It was shown that, during their evolution in the past 54 million years, approximately 94% (112 of 119) of the ancestral legume NBS lineages experienced deletions or significant expansions, while seven original lineages were maintained in a conservative manner. The NBS gene duplication pattern was further examined. The local tandem duplications dominated NBS gene gains in the total number of genes (more than 75%), which was not surprising. However, it was interesting from our study that ectopic duplications had created many novel NBS gene loci in individual legume genomes, which occurred at a significant frequency of 8% to 20% in different legume lineages. Finally, by surveying the legume microRNAs that can potentially regulate NBS genes, we found that the microRNA-NBS gene interaction also exhibited a gain-and-loss pattern during the legume evolution.
Plant Physiology | 2016
Zhu-Qing Shao; Jia-Yu Xue; Ping Wu; Yan-Mei Zhang; Yue Wu; Yue-Yu Hang; Bin Wang; Jian-Qun Chen
Three ancient classes of genes encoding leucine-rich repeat proteins diverged into at least 23 lineages in the common ancestor of angiosperm, from which all current gene repertoires evolved through dynamic expansion. Nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes make up the largest plant disease resistance gene family (R genes), with hundreds of copies occurring in individual angiosperm genomes. However, the expansion history of NBS-LRR genes during angiosperm evolution is largely unknown. By identifying more than 6,000 NBS-LRR genes in 22 representative angiosperms and reconstructing their phylogenies, we present a potential framework of NBS-LRR gene evolution in the angiosperm. Three anciently diverged NBS-LRR classes (TNLs, CNLs, and RNLs) were distinguished with unique exon-intron structures and DNA motif sequences. A total of seven ancient TNL, 14 CNL, and two RNL lineages were discovered in the ancestral angiosperm, from which all current NBS-LRR gene repertoires were evolved. A pattern of gradual expansion during the first 100 million years of evolution of the angiosperm clade was observed for CNLs. TNL numbers remained stable during this period but were eventually deleted in three divergent angiosperm lineages. We inferred that an intense expansion of both TNL and CNL genes started from the Cretaceous-Paleogene boundary. Because dramatic environmental changes and an explosion in fungal diversity occurred during this period, the observed expansions of R genes probably reflect convergent adaptive responses of various angiosperm families. An ancient whole-genome duplication event that occurred in an angiosperm ancestor resulted in two RNL lineages, which were conservatively evolved and acted as scaffold proteins for defense signal transduction. Overall, the reconstructed framework of angiosperm NBS-LRR gene evolution in this study may serve as a fundamental reference for better understanding angiosperm NBS-LRR genes.
PLOS ONE | 2012
Xiao-Qin Sun; Yingjie Zhu; Jianlin Guo; Bin Peng; Ming-Ming Bai; Yue-Yu Hang
Background Dioscorea is an important plant genus in terms of food supply and pharmaceutical applications. However, its classification and identification are controversial. DNA barcoding is a recent aid to taxonomic identification and uses a short standardized DNA region to discriminate plant species. In this study, the applicability of three candidate DNA barcodes (rbcL, matK, and psbA-trnH) to identify species within Dioscorea was tested. Methodology/Principal Findings One-hundred and forty-eight individual plant samples of Dioscorea, encompassing 38 species, seven varieties and one subspecies, representing majority species distributed in China of this genus, were collected from its main distributing areas. Samples were assessed by PCR amplification, sequence quality, extent of specific genetic divergence, DNA barcoding gap, and the ability to discriminate between species. matK successfully identified 23.26% of all species, compared with 9.30% for rbcL and 11.63% for psbA-trnH. Therefore, matK is recommended as the best DNA barcoding candidate. We found that the combination of two or three loci achieved a higher success rate of species discrimination than one locus alone. However, experimental cost would be much higher if two or three loci, rather than a single locus, were assessed. Conclusions We conclude that matK is a strong, although not perfect, candidate as a DNA barcode for Dioscorea identification. This assessment takes into account both its ability for species discrimination and the cost of experiments.
PLOS ONE | 2011
Bin Wang; Zhu-Qing Shao; Ying Xu; Jing Liu; Yuan Liu; Yue-Yu Hang; Jian-Qun Chen
A correlation method was recently adopted to identify selection-favored ‘optimal’ codons from 675 bacterial genomes. Surprisingly, the identities of these optimal codons were found to track the bacterial GC content, leading to a conclusion that selection would generally shape the codon usages to the same direction as the overall mutation does. Raising several concerns, here we report a thorough comparative study on 203 well-selected bacterial species, which strongly suggest that the previous conclusion is likely an illusion. Firstly, the previous study did not preclude species that are suffering weak or no selection pressures on their codon usages. For these species, as showed in this study, the optimal codon identities are prone to be incorrect and follow GC content. Secondly, the previous study only adopted the correlation method, without considering another method to test the reliability of inferred optimal codons. Actually by definition, optimal codons can also be identified by simply comparing codon usages between high- and low-expression genes. After using both methods to identify optimal codons for the selected species, we obtained highly conflicting results, suggesting at least one method is misleading. Further we found a critical problem of correlation method at the step of calculating gene bias level. Due to a failure of accurately defining the background mutation, the problem would result in wrong optimal codon identities. In other words, partial mutational effects on codon choices were mistakenly regarded as selective influences, leading to incorrect and biased optimal codon identities. Finally, considering the translational dynamics, optimal codons identified by comparison method can be well-explained by tRNA compositions, whereas optimal codons identified by correlation method can not be. For all above reasons, we conclude that real optimal codons actually do not track the genomic GC content, and correlation method is misleading in identifying optimal codons and better be avoided.
Journal of Integrative Plant Biology | 2016
Yan-Mei Zhang; Zhu-Qing Shao; Qiang Wang; Yue-Yu Hang; Jia-Yu Xue; Bin Wang; Jian-Qun Chen
Plant genomes harbor dozens to hundreds of nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes; however, the long-term evolutionary history of these resistance genes has not been fully understood. This study focuses on five Brassicaceae genomes and the Carica papaya genome to explore changes in NBS-LRR genes that have taken place in this Rosid II lineage during the past 72 million years. Various numbers of NBS-LRR genes were identified from Arabidopsis lyrata (198), A. thaliana (165), Brassica rapa (204), Capsella rubella (127), Thellungiella salsuginea (88), and C. papaya (51). In each genome, the identified NBS-LRR genes were found to be unevenly distributed among chromosomes and most of them were clustered together. Phylogenetic analysis revealed that, before and after Brassicaceae speciation events, both toll/interleukin-1 receptor-NBS-LRR (TNL) genes and non-toll/interleukin-1 receptor-NBS-LRR (nTNL) genes exhibited a pattern of first expansion and then contraction, suggesting that both subclasses of NBS-LRR genes were responding to pathogen pressures synchronically. Further, by examining the gain/loss of TNL and nTNL genes at different evolutionary nodes, this study revealed that both events often occurred more drastically in TNL genes. Finally, the phylogeny of nTNL genes suggested that this NBS-LRR subclass is composed of two separate ancient gene types: RPW8-NBS-LRR and Coiled-coil-NBS-LRR.
Gene | 2014
Ping Wu; Zhu-Qing Shao; Xun-Zong Wu; Qiang Wang; Bin Wang; Jian-Qun Chen; Yue-Yu Hang; Jia-Yu Xue
A genome triplication took place in the ancestor of Brassiceae species after the split of the Arabidopsis lineage. The postfragmentation and shuffling of the genome turned the ancestral hexaploid back to diploids and caused the radiation of Brassiceae species. The course of speciation was accompanied by the loss of duplicate genes and also influenced the evolution of retained genes. Of all the genes, those encoding NBS domains are typical R genes that confer resistance to invading pathogens. In this study, using the genome of Arabidopsis thaliana as a reference, we examined the loss/retention of orthologous NBS-encoding loci in the tripled Brassica rapa genome and discovered differential loss/retention frequencies. Further analysis indicated that loci of different retention ratios showed different evolutionary patterns. The loci of classesII and III (maintaining two and three syntenic loci, respectively, multi-loci) show sharper expansions by tandem duplications, have faster evolutionary rates and have more potential to be associated with novel gene functions. On the other hand, the loci that are retained at the minimal rate (keeping only one locus, class I, single locus) showed opposite patterns. Phylogenetic analysis indicated that recombination and translocation events were common among multi-loci in B. rapa, and differential evolutionary patterns between multi- and single-loci are likely the consequence of recombination. Investigations towards other gene families demonstrated different evolutionary characteristics between different gene families. The evolution of genes is more likely determined by the property of each gene family, and the whole genome triplication provided only a specific condition.
Frontiers in Plant Science | 2016
Ping Wu; Yue Wu; Cheng-Chen Liu; Li-Wei Liu; Fang-Fang Ma; Xiao-Yi Wu; Mian Wu; Yue-Yu Hang; Jian-Qun Chen; Zhu-Qing Shao; Bin Wang
A majority of land plants can form symbiosis with arbuscular mycorrhizal (AM) fungi. MicroRNAs (miRNAs) have been implicated to regulate this process in legumes, but their involvement in non-legume species is largely unknown. In this study, by performing deep sequencing of sRNA libraries in tomato roots and comparing with tomato genome, a total of 700 potential miRNAs were predicted, among them, 187 are known plant miRNAs that have been previously deposited in miRBase. Unlike the profiles in other plants such as rice and Arabidopsis, a large proportion of predicted tomato miRNAs was 24 nt in length. A similar pattern was observed in the potato genome but not in tobacco, indicating a Solanum genus-specific expansion of 24-nt miRNAs. About 40% identified tomato miRNAs showed significantly altered expressions upon Rhizophagus irregularis inoculation, suggesting the potential roles of these novel miRNAs in AM symbiosis. The differential expression of five known and six novel miRNAs were further validated using qPCR analysis. Interestingly, three up-regulated known tomato miRNAs belong to a known miR171 family, a member of which has been reported in Medicago truncatula to regulate AM symbiosis. Thus, the miR171 family likely regulates AM symbiosis conservatively across different plant lineages. More than 1000 genes targeted by potential AM-responsive miRNAs were provided and their roles in AM symbiosis are worth further exploring.
Molecular Genetics and Genomics | 2016
Haisong Guo; Yan-Mei Zhang; Xiao-Qin Sun; Mi-Mi Li; Yue-Yu Hang; Jia-Yu Xue
Very long-chain fatty acids (VLCFAs) play an important role in the survival and development of plants, and VLCFA synthesis is regulated by β-ketoacyl-CoA synthases (KCSs), which catalyze the condensation of an acyl-CoA with malonyl-CoA. Here, we present a genome-wide survey of the genes encoding these enzymes, KCS genes, in 28 species (26 genomes and two transcriptomes), which represents a large phylogenetic scale, and also reconstruct the evolutionary history of this gene family. KCS genes were initially single-copy genes in the green plant lineage; duplication resulted in five ancestral copies in land plants, forming five fundamental monophyletic groups in the phylogenetic tree. Subsequently, KCS genes duplicated to generate 11 genes of angiosperm origin, expanding up to 20–30 members in further-diverged angiosperm species. During this process, tandem duplications had only a small contribution, whereas polyploidy events and large-scale segmental duplications appear to be the main driving force. Accompanying this expansion were variations that led to the sub- and neofunctionalization of different members, resulting in specificity that is likely determined by the 3-D protein structure. Novel functions involved in other physiological processes emerged as well, though redundancy is also observed, largely among recent duplications. Conserved sites and variable sites of KCS proteins are also identified by statistical analysis. The variable sites are likely to be involved in the emergence of product specificity and catalytic power, and conserved sites are possibly responsible for the preservation of fundamental function.
PLOS ONE | 2013
Xiao-Qin Sun; Hui Pang; Mi-Mi Li; Bin Peng; Haisong Guo; Qinqin Yan; Yue-Yu Hang
The fatty acid elongase 1 (FAE1) gene catalyzes the initial condensation step in the elongation pathway of VLCFA (very long chain fatty acid) biosynthesis and is thus a key gene in erucic acid biosynthesis. Based on a worldwide collection of 62 accessions representing 14 tribes, 31 genera, 51 species, 4 subspecies and 7 varieties, we conducted a phylogenetic reconstruction and correlation analysis between genetic variations in the FAE1 gene and the erucic acid trait, attempting to gain insight into the evolutionary patterns and the correlations between genetic variations in FAE1 and trait variations. The five clear, deeply diverged clades detected in the phylogenetic reconstruction are largely congruent with a previous multiple gene-derived phylogeny. The Ka/Ks ratio (<1) and overall low level of nucleotide diversity in the FAE1 gene suggest that purifying selection is the major evolutionary force acting on this gene. Sequence variations in FAE1 show a strong correlation with the content of erucic acid in seeds, suggesting a causal link between the two. Furthermore, we detected 16 mutations that were fixed between the low and high phenotypes of the FAE1 gene, which constitute candidate active sites in this gene for altering the content of erucic acid in seeds. Our findings begin to shed light on the evolutionary pattern of this important gene and represent the first step in elucidating how the sequence variations impact the production of erucic acid in plants.
Virus Research | 2014
Guang-Can Zhou; Xiao-Yi Wu; Yan-Mei Zhang; Ping Wu; Xun-Zong Wu; Li-Wei Liu; Qiang Wang; Yue-Yu Hang; Jia-Yin Yang; Zhu-Qing Shao; Bin Wang; Jian-Qun Chen
Widely known as a severe pathogen of bean plants, the bean common mosaic virus (BCMV) has been reported to infect soybeans only sporadically and the involved strains were all found in China regions. To explore variations among soybean-infecting BCMV strains, hundreds of soybean mosaic leave samples were collected throughout China, with a total of 30 BCMV isolates detected and their genomes sequenced. These newly obtained genomes, together with 16 other BCMV genomes available in GenBank were examined from multiple aspects to characterize BCMV evolutionary processes. Phylogenetic analysis showed that both soybean-infecting BCMVs (group I) and peanut-infecting BCMVs (group II) are distantly related to other BCMVs, suggesting ancestral differentiation and host adaptation. Genetic variation analysis showed that P1, P3 and 6K2 genes and the beginning portion of CP gene showed higher levels of variation relative to other genes. Moreover, selection analyses further confirmed that a number of sites within the P1 and P3 genes have suffered positive selection. These obtained BCMV sequences also exhibit high recombination frequencies, indicating a more dynamic evolutionary history. Finally, 12 different soybean cultivars were challenged with two BCMV isolates (DXH015 and HZZB011), with most of the cultivars successfully infected. These findings suggest that BCMV is indeed a potential threat to soybean production.