Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Yingjie Zhu is active.

Publication


Featured researches published by Yingjie Zhu.


PLOS ONE | 2010

Validation of the ITS2 Region as a Novel DNA Barcode for Identifying Medicinal Plant Species

Shilin Chen; Hui Yao; Jianping Han; Chang Liu; Jingyuan Song; Linchun Shi; Yingjie Zhu; Xinye Ma; Ting Gao; Kun Luo; Ying Li; Xiwen Li; Xiaocheng Jia; Yulin Lin; Christine Leon

Background The plant working group of the Consortium for the Barcode of Life recommended the two-locus combination of rbcL + matK as the plant barcode, yet the combination was shown to successfully discriminate among 907 samples from 550 species at the species level with a probability of 72%. The group admits that the two-locus barcode is far from perfect due to the low identification rate, and the search is not over. Methodology/Principal Findings Here, we compared seven candidate DNA barcodes (psbA-trnH, matK, rbcL, rpoC1, ycf5, ITS2, and ITS) from medicinal plant species. Our ranking criteria included PCR amplification efficiency, differential intra- and inter-specific divergences, and the DNA barcoding gap. Our data suggest that the second internal transcribed spacer (ITS2) of nuclear ribosomal DNA represents the most suitable region for DNA barcoding applications. Furthermore, we tested the discrimination ability of ITS2 in more than 6600 plant samples belonging to 4800 species from 753 distinct genera and found that the rate of successful identification with the ITS2 was 92.7% at the species level. Conclusions The ITS2 region can be potentially used as a standard DNA barcode to identify medicinal plants and their closely related species. We also propose that ITS2 can serve as a novel universal barcode for the identification of a broader range of plant taxa.


Proceedings of the National Academy of Sciences of the United States of America | 2011

Comparative analysis of a large dataset indicates that internal transcribed spacer (ITS) should be incorporated into the core barcode for seed plants

De-Zhu Li; Lian-Ming Gao; Hong-Tao Li; Hong Wang; Xue-Jun Ge; Jianquan Liu; Zhi-Duan Chen; Shiliang Zhou; Shilin Chen; Jun-Bo Yang; Cheng-Xin Fu; Chun-Xia Zeng; Hai-Fei Yan; Yingjie Zhu; Yongshuai Sun; Si-Yun Chen; Lei Zhao; Kun Wang; Tuo Yang; Guang-Wen Duan

A two-marker combination of plastid rbcL and matK has previously been recommended as the core plant barcode, to be supplemented with additional markers such as plastid trnH–psbA and nuclear ribosomal internal transcribed spacer (ITS). To assess the effectiveness and universality of these barcode markers in seed plants, we sampled 6,286 individuals representing 1,757 species in 141 genera of 75 families (42 orders) by using four different methods of data analysis. These analyses indicate that (i) the three plastid markers showed high levels of universality (87.1–92.7%), whereas ITS performed relatively well (79%) in angiosperms but not so well in gymnosperms; (ii) in taxonomic groups for which direct sequencing of the marker is possible, ITS showed the highest discriminatory power of the four markers, and a combination of ITS and any plastid DNA marker was able to discriminate 69.9–79.1% of species, compared with only 49.7% with rbcL + matK; and (iii) where multiple individuals of a single species were tested, ascriptions based on ITS and plastid DNA barcodes were incongruent in some samples for 45.2% of the sampled genera (for genera with more than one species sampled). This finding highlights the importance of both sampling multiple individuals and using markers with different modes of inheritance. In cases where it is difficult to amplify and directly sequence ITS in its entirety, just using ITS2 is a useful backup because it is easier to amplify and sequence this subset of the marker. We therefore propose that ITS/ITS2 should be incorporated into the core barcode for seed plants.


Nature Communications | 2012

Genome sequence of the model medicinal mushroom Ganoderma lucidum

Chen Sl; Jiang Xu; Chang Liu; Yingjie Zhu; David R. Nelson; Shiguo Zhou; Chunfang Li; Lizhi Wang; Xu Guo; Yongzhen Sun; Hongmei Luo; Ying Li; Jingyuan Song; Bernard Henrissat; Anthony Levasseur; Jun Qian; Jianqin Li; Xiang Luo; Linchun Shi; Liu He; Li Xiang; Xiaolan Xu; Yunyun Niu; Qiushi Li; Mira V. Han; Haixia Yan; Jin Zhang; Haimei Chen; Aiping Lv; Zhen Wang

Ganoderma lucidum is a widely used medicinal macrofungus in traditional Chinese medicine that creates a diverse set of bioactive compounds. Here we report its 43.3-Mb genome, encoding 16,113 predicted genes, obtained using next-generation sequencing and optical mapping approaches. The sequence analysis reveals an impressive array of genes encoding cytochrome P450s (CYPs), transporters and regulatory proteins that cooperate in secondary metabolism. The genome also encodes one of the richest sets of wood degradation enzymes among all of the sequenced basidiomycetes. In all, 24 physical CYP gene clusters are identified. Moreover, 78 CYP genes are coexpressed with lanosterol synthase, and 16 of these show high similarity to fungal CYPs that specifically hydroxylate testosterone, suggesting their possible roles in triterpenoid biosynthesis. The elucidation of the G. lucidum genome makes this organism a potential model system for the study of secondary metabolic pathways and their regulation in medicinal fungi.


BMC Genomics | 2012

CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

Chang Liu; Linchun Shi; Yingjie Zhu; Haimei Chen; Jianhui Zhang; Xiaohan Lin; Xiaojun Guan

BackgroundThe complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need.ResultsWe have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR) are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities.ConclusionsCPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible fromhttp://www.herbalgenomics.org/cpgavas.


PLOS ONE | 2013

The Complete Chloroplast Genome Sequence of the Medicinal Plant Salvia miltiorrhiza

Jun Qian; Jingyuan Song; Huanhuan Gao; Yingjie Zhu; Jiang Xu; Hui Yao; Chao Sun; Xian’en Li; Chuyuan Li; Juyan Liu; Haibin Xu; Shilin Chen

Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.


BMC Evolutionary Biology | 2010

Evaluating the feasibility of using candidate DNA barcodes in discriminating species of the large Asteraceae family

Ting Gao; Hui Yao; Jingyuan Song; Yingjie Zhu; Chang Liu; Shilin Chen

BackgroundFive DNA regions, namely, rbcL, matK, ITS, ITS2, and psbA-trnH, have been recommended as primary DNA barcodes for plants. Studies evaluating these regions for species identification in the large plant taxon, which includes a large number of closely related species, have rarely been reported.ResultsThe feasibility of using the five proposed DNA regions was tested for discriminating plant species within Asteraceae, the largest family of flowering plants. Among these markers, ITS2 was the most useful in terms of universality, sequence variation, and identification capability in the Asteraceae family. The species discriminating power of ITS2 was also explored in a large pool of 3,490 Asteraceae sequences that represent 2,315 species belonging to 494 different genera. The result shows that ITS2 correctly identified 76.4% and 97.4% of plant samples at the species and genus levels, respectively. In addition, ITS2 displayed a variable ability to discriminate related species within different genera.ConclusionsITS2 is the best DNA barcode for the Asteraceae family. This approach significantly broadens the application of DNA barcoding to resolve classification problems in the family Asteraceae at the genera and species levels.


Cladistics | 2011

Applying plant DNA barcodes for Rosaceae species identification

Jingyuan Song; Yingjie Zhu; Hongxi Xu; Linfang Huang; Shilin Chen

© The Willi Hennig Society 2010.


Plant Cell Reports | 2011

454 EST analysis detects genes putatively involved in ginsenoside biosynthesis in Panax ginseng

Chen Sl; Hongmei Luo; Ying Li; Yongzhen Sun; Q. Wu; Y. Niu; Jingyuan Song; A. Lv; Yingjie Zhu; Chao Sun; A. Steinmetz; Z. Qian

Panax ginseng C.A. Meyer is one of the most highly valued medicinal plants in the world. To analyze the transcriptome of P. ginseng and discover the genes involved in ginsenoside biosynthesis, cDNAs derived from the total RNA of 11-year-old, wood-grown P. ginseng roots were analyzed by 454 sequencing. A total of 217,529 high quality reads (expressed sequence tags, ESTs), with an average length of 409 bases, were generated from a one-quarter run to yield 31,741 unique sequences. The majority (20,198; 63.6%) of the unique sequences were annotated using BLAST similarity searches. A total of 16,810 and 16,577 unique sequences were assigned to functional classifications and biochemical pathways based on Gene Ontology analysis and the Kyoto Encyclopedia of Genes and Genomes assignment, respectively. Nine genes involved in the biosynthesis of ginsenoside skeletons and many candidate genes putatively responsible for modification of the skeletons, including 133 cytochrome P450s and 235 glycosyltransferases, were identified. From these candidates, six transcripts encoding UDP-glycosyltransferases that were most likely to be involved in ginsenoside biosynthesis were selected. These results open a new avenue by which to explore and exploit biosynthetic and biochemical properties that may lead to drug improvement. These 454 ESTs will provide the foundation for further functional genomic research into the traditional herb P. ginseng or its closely related species.


BMC Genomics | 2013

Transcriptome analysis reveals ginsenosides biosynthetic genes, microRNAs and simple sequence repeats in Panax ginseng C. A. Meyer

Chunfang Li; Yingjie Zhu; Xu Guo; Chao Sun; Hongmei Luo; Jingyuan Song; Ying Li; Lizhi Wang; Jun Qian; Shilin Chen

BackgroundPanax ginseng C. A. Meyer is one of the most widely used medicinal plants. Complete genome information for this species remains unavailable due to its large genome size. At present, analysis of expressed sequence tags is still the most powerful tool for large-scale gene discovery. The global expressed sequence tags from P. ginseng tissues, especially those isolated from stems, leaves and flowers, are still limited, hindering in-depth study of P. ginseng.ResultsTwo 454 pyrosequencing runs generated a total of 2,423,076 reads from P. ginseng roots, stems, leaves and flowers. The high-quality reads from each of the tissues were independently assembled into separate and shared contigs. In the separately assembled database, 45,849, 6,172, 4,041 and 3,273 unigenes were only found in the roots, stems, leaves and flowers database, respectively. In the jointly assembled database, 178,145 unigenes were observed, including 86,609 contigs and 91,536 singletons. Among the 178,145 unigenes, 105,522 were identified for the first time, of which 65.6% were identified in the stem, leaf or flower cDNA libraries of P. ginseng. After annotation, we discovered 223 unigenes involved in ginsenoside backbone biosynthesis. Additionally, a total of 326 potential cytochrome P450 and 129 potential UDP-glycosyltransferase sequences were predicted based on the annotation results, some of which may encode enzymes responsible for ginsenoside backbone modification. A BLAST search of the obtained high-quality reads identified 14 potential microRNAs in P. ginseng, which were estimated to target 100 protein-coding genes, including transcription factors, transporters and DNA binding proteins, among others. In addition, a total of 13,044 simple sequence repeats were identified from the 178,145 unigenes.ConclusionsThis study provides global expressed sequence tags for P. ginseng, which will contribute significantly to further genome-wide research and analyses in this species. The novel unigenes identified here enlarge the available P. ginseng gene pool and will facilitate gene discovery. In addition, the identification of microRNAs and the prediction of targets from this study will provide information on gene transcriptional regulation in P. ginseng. Finally, the analysis of simple sequence repeats will provide genetic makers for molecular breeding and genetic applications in this species.


Molecular Plant | 2016

Analysis of the Genome Sequence of the Medicinal Plant Salvia miltiorrhiza

Haibin Xu; Jingyuan Song; Hongmei Luo; Yujun Zhang; Qiushi Li; Yingjie Zhu; Jiang Xu; Ying Li; Chi Song; Bo Wang; Wei Sun; Guoan Shen; Xin Zhang; Jun Qian; Aijia Ji; Zhichao Xu; Xiang Luo; Liu He; Chuyuan Li; Chao Sun; Haixia Yan; Guanghong Cui; Xiwen Li; Xian'en Li; Jianhe Wei; Juyan Liu; Wang Y; A. C. Hayward; David R. Nelson; Zemin Ning

Document S1. Supplemental Methods and Supplemental ReferencesxDownload (.04 MB ) Document S1. Supplemental Methods and Supplemental ReferencesSupplemental Figure 1. Flow Cytometry of S. miltiorrhiza for Genome Size EstimationxDownload (.08 MB ) Supplemental Figure 1. Flow Cytometry of S. miltiorrhiza for Genome Size EstimationSupplemental Figure 2. Evolutionary Relationship of the Putative SmCPS1- and SmCPS2- Gene ClustersxDownload (.04 MB ) Supplemental Figure 2. Evolutionary Relationship of the Putative SmCPS1- and SmCPS2- Gene ClustersSupplemental Figure 3. Identification of SNPs and INDELs between Purple- and White-Flower S. miltiorrhiza VarietiesxDownload (.07 MB ) Supplemental Figure 3. Identification of SNPs and INDELs between Purple- and White-Flower S. miltiorrhiza VarietiesSupplemental Table 1. Statistics of Raw Sequencing Data for S. miltiorrhiza Genome AssemblyxDownload (.03 MB ) Supplemental Table 1. Statistics of Raw Sequencing Data for S. miltiorrhiza Genome AssemblySupplemental Table 2. Summary Statistics of De Novo Genome Assembly from Different DatasetsxDownload (.03 MB ) Supplemental Table 2. Summary Statistics of De Novo Genome Assembly from Different DatasetsSupplemental Table 3. Statistics of SNPs in the S. miltiorrhiza GenomexDownload (.01 MB ) Supplemental Table 3. Statistics of SNPs in the S. miltiorrhiza GenomeSupplemental Table 4. Summary of Gene and Repeat Annotation of the S. miltiorrhiza GenomexDownload (.03 MB ) Supplemental Table 4. Summary of Gene and Repeat Annotation of the S. miltiorrhiza GenomeSupplemental Table 5. List of 1620 Transcription Factors Identified in the S. miltiorrhiza GenomexDownload (.05 MB ) Supplemental Table 5. List of 1620 Transcription Factors Identified in the S. miltiorrhiza GenomeSupplemental Table 6. List of 82 TPS Genes in the S. miltiorrhiza GenomexDownload (.02 MB ) Supplemental Table 6. List of 82 TPS Genes in the S. miltiorrhiza GenomeSupplemental Table 7. List of 437 Putative CYPs in the S. miltiorrhiza GenomexDownload (.03 MB ) Supplemental Table 7. List of 437 Putative CYPs in the S. miltiorrhiza GenomeSupplemental Table 8. Co-Expression Analysis of CYPs with SmCPS1xDownload (.01 MB ) Supplemental Table 8. Co-Expression Analysis of CYPs with SmCPS1Supplemental Table 9. Putative Genes Encoding Tanshinone Biosynthetic Enzymes in S. miltiorrhizaxDownload (.01 MB ) Supplemental Table 9. Putative Genes Encoding Tanshinone Biosynthetic Enzymes in S. miltiorrhiza

Collaboration


Dive into the Yingjie Zhu's collaboration.

Top Co-Authors

Avatar

Jingyuan Song

Peking Union Medical College

View shared research outputs
Top Co-Authors

Avatar

Shilin Chen

Peking Union Medical College

View shared research outputs
Top Co-Authors

Avatar

Chao Sun

Peking Union Medical College

View shared research outputs
Top Co-Authors

Avatar

Hongmei Luo

Peking Union Medical College

View shared research outputs
Top Co-Authors

Avatar

Ying Li

Peking Union Medical College

View shared research outputs
Top Co-Authors

Avatar

Hui Yao

Peking Union Medical College

View shared research outputs
Top Co-Authors

Avatar

Jun Qian

Peking Union Medical College

View shared research outputs
Top Co-Authors

Avatar

Aijia Ji

Peking Union Medical College

View shared research outputs
Top Co-Authors

Avatar

Haibin Xu

Peking Union Medical College

View shared research outputs
Top Co-Authors

Avatar

Linchun Shi

Peking Union Medical College

View shared research outputs
Researchain Logo
Decentralizing Knowledge