Wenjie Li | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Wenjie Li is active.

Explore More

Publication

Featured researches published by Wenjie Li.

Science | 2002

A draft sequence of the rice genome (Oryza sativa L. ssp indica)

Jun Yu; Songnian Hu; Jun Wang; Gane Ka-Shu Wong; Songgang Li; Bin Liu; Yajun Deng; Yan Zhou; Xiuqing Zhang; Mengliang Cao; Jing Liu; Jiandong Sun; Jiabin Tang; Yanjiong Chen; Xiaobing Huang; Wei Lin; Chen Ye; Wei Tong; Lijuan Cong; Jianing Geng; Yujun Han; Lin Li; Wei Li; Guangqiang Hu; Xiangang Huang; Wenjie Li; Jian Li; Zhanwei Liu; Long Li; Jianping Liu

The genome of the japonica subspecies of rice, an important cereal and model monocot, was sequenced and assembled by whole-genome shotgun sequencing. The assembled sequence covers 93% of the 420-megabase genome. Gene predictions on the assembled sequence suggest that the genome contains 32,000 to 50,000 genes. Homologs of 98% of the known maize, wheat, and barley proteins are found in rice. Synteny and gene homology between rice and the other cereal genomes are extensive, whereas synteny with Arabidopsis is limited. Assignment of candidate rice orthologs to Arabidopsis genes is possible in many cases. The rice genome sequence provides a foundation for the improvement of cereals, our most important crops.

Nature | 2008

The diploid genome sequence of an Asian individual

Jun Wang; Wei Wang; Ruiqiang Li; Yingrui Li; Geng Tian; Laurie Goodman; Wei Fan; Junqing Zhang; Jun Li; Juanbin Zhang; Yiran Guo; Binxiao Feng; Heng Li; Yao Lu; Xiaodong Fang; Huiqing Liang; Z. Du; Dong Li; Yiqing Zhao; Yujie Hu; Zhenzhen Yang; Hancheng Zheng; Ines Hellmann; Michael Inouye; John E. Pool; Xin Yi; Jing Zhao; Jinjie Duan; Yan Zhou; Junjie Qin

Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual’s genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics.

Proceedings of the National Academy of Sciences of the United States of America | 2015

Extremely high genetic diversity in a single tumor points to prevalence of non-Darwinian cell evolution.

Shaoping Ling; Zheng Hu; Z.F. Yang; Fang Yang; Yawei Li; Pei Lin; Ke Chen; Lili Dong; Lihua Cao; Yong Tao; Lingtong Hao; Qingjian Chen; Qiang Gong; Dafei Wu; Wenjie Li; Wenming Zhao; Xiuyun Tian; Chunyi Hao; Eric A. Hungate; Daniel V.T. Catenacci; Richard R. Hudson; Wen-Hsiung Li; Xuemei Lu; Chung-I Wu

Significance A tumor comprising many cells can be compared to a natural population with many individuals. The amount of genetic diversity reflects how it has evolved and can influence its future evolution. We evaluated a single tumor by sequencing or genotyping nearly 300 regions from the tumor. When the data were analyzed by modern population genetic theory, we estimated more than 100 million coding region mutations in this unexceptional tumor. The extreme genetic diversity implies evolution under the non-Darwinian mode. In contrast, under the prevailing view of Darwinian selection, the genetic diversity would be orders of magnitude lower. Because genetic diversity accrues rapidly, a high probability of drug resistance should be heeded, even in the treatment of microscopic tumors. The prevailing view that the evolution of cells in a tumor is driven by Darwinian selection has never been rigorously tested. Because selection greatly affects the level of intratumor genetic diversity, it is important to assess whether intratumor evolution follows the Darwinian or the non-Darwinian mode of evolution. To provide the statistical power, many regions in a single tumor need to be sampled and analyzed much more extensively than has been attempted in previous intratumor studies. Here, from a hepatocellular carcinoma (HCC) tumor, we evaluated multiregional samples from the tumor, using either whole-exome sequencing (WES) (n = 23 samples) or genotyping (n = 286) under both the infinite-site and infinite-allele models of population genetics. In addition to the many single-nucleotide variations (SNVs) present in all samples, there were 35 “polymorphic” SNVs among samples. High genetic diversity was evident as the 23 WES samples defined 20 unique cell clones. With all 286 samples genotyped, clonal diversity agreed well with the non-Darwinian model with no evidence of positive Darwinian selection. Under the non-Darwinian model, MALL (the number of coding region mutations in the entire tumor) was estimated to be greater than 100 million in this tumor. DNA sequences reveal local diversities in small patches of cells and validate the estimation. In contrast, the genetic diversity under a Darwinian model would generally be orders of magnitude smaller. Because the level of genetic diversity will have implications on therapeutic resistance, non-Darwinian evolution should be heeded in cancer treatments even for microscopic tumors.

Proceedings of the National Academy of Sciences of the United States of America | 2011

Rapid growth of a hepatocellular carcinoma and the driving mutations revealed by cell-population genetic analysis of whole-genome data

Yong Tao; Jue Ruan; Shiou-Hwei Yeh; Xuemei Lu; Yu Wang; Weiwei Zhai; Jun Cai; Shaoping Ling; Qiang Gong; Zecheng Chong; Zhengzhong Qu; Qianqian Li; Jiang Liu; Jin Yang; Caihong Zheng; Changqing Zeng; Hurng-Yi Wang; Jing Zhang; Sheng-Han Wang; Lingtong Hao; Lili Dong; Wenjie Li; Min Sun; Wei Zou; Caixia Yu; Chaohua Li; Guojing Liu; Lan Jiang; Jin Xu; Huanwei Huang

We present the analysis of the evolution of tumors in a case of hepatocellular carcinoma. This case is particularly informative about cancer growth dynamics and the underlying driving mutations. We sampled nine different sections from three tumors and seven more sections from the adjacent nontumor tissues. Selected sections were subjected to exon as well as whole-genome sequencing. Putative somatic mutations were then individually validated across all 9 tumor and 7 nontumor sections. Among the mutations validated, 24 were amino acid changes; in addition, 22 large indels/copy number variants (>1 Mb) were detected. These somatic mutations define four evolutionary lineages among tumor cells. Separate evolution and expansion of these lineages were recent and rapid, each apparently having only one lineage-specific protein-coding mutation. Hence, by using a cell-population genetic definition, this approach identified three coding changes (CCNG1, P62, and an indel/fusion gene) as tumor driver mutations. These three mutations, affecting cell cycle control and apoptosis, are functionally distinct from mutations that accumulated earlier, many of which are involved in inflammation/immunity or cell anchoring. These distinct functions of mutations at different stages may reflect the genetic interactions underlying tumor growth.

Chinese Science Bulletin | 2003

A complete sequence and comparative analysis of a SARS-associated virus (Isolate BJ01)

E’de Qin; Qingyu Zhu; Man Yu; Baochang Fan; Guohui Chang; Bingyin Si; Bao’an Yang; Wenming Peng; Tao Jiang; Bohua Liu; Yong-Qiang Deng; Liu H; Yu Zhang; Cui’e Wang; Y. Li; Yonghua Gan; Xiaoyu Li; Fushuang Lü; Gang Tan; Wuchun Cao; Ruifu Yang; Jian Wang; Wei Li; Z. Y. Xu; Yan Li; Qingfa Wu; Wei Lin; Weijun Chen; Lin Tang; Yajun Deng

The genome sequence of the Severe Acute Respiratory Syndrome (SARS)-associated virus provides essential information for the identification of pathogen(s), exploration of etiology and evolution, interpretation of transmission and pathogenesis, development of diagnostics, prevention by future vaccination, and treatment by developing new drugs. We report the complete genome sequence and comparative analysis of an isolate (BJ01) of the coronavirus that has been recognized as a pathogen for SARS. The genome is 29725 nt in size and has 11 ORFs (Open Reading Frames). It is composed of a stable region encoding an RNA-dependent RNA polymerase (composed of 2 ORFs) and a variable region representing 4 CDSs (coding sequences) for viral structural genes (the S, E, M, N proteins) and 5 PUPs (putative uncharacterized proteins). Its gene order is identical to that of other known coronaviruses. The sequence alignment with all known RNA viruses places this virus as a member in the family of Coronaviridae. Thirty putative substitutions have been identified by comparative analysis of the 5 SARS-associated virus genome sequences in GenBank. Fifteen of them lead to possible amino acid changes (non-synonymous mutations) in the proteins. Three amino acid changes, with predicted alteration of physical and chemical features, have been detected in the S protein that is postulated to be involved in the immunoreactions between the virus and its host. Two amino acid changes have been detected in the M protein, which could be related to viral envelope formation. Phylogenetic analysis suggests the possibility of non-human origin of the SARS-associated viruses but provides no evidence that they are man-made. Further efforts should focus on identifying the etiology of the SARS-associated virus and ruling out conclusively the existence of other possible SARS-related pathogen(s).

Nature plants | 2016

The rubber tree genome reveals new insights into rubber production and species adaptation.

Chaorong Tang; Meng Yang; Yongjun Fang; Yingfeng Luo; Shenghan Gao; Xiaohu Xiao; Zewei An; Binhui Zhou; Bing Zhang; Xinyu Tan; Hoong Yeet Yeang; Yunxia Qin; Jianghua Yang; Qiang Lin; Hailiang Mei; Pascal Montoro; Xiangyu Long; Jiyan Qi; Yuwei Hua; Zilong He; Min Sun; Wenjie Li; Xia Zeng; Han Cheng; Ying Liu; Jin Yang; Weimin Tian; Nansheng Zhuang; Rizhong Zeng; Dejun Li

The Para rubber tree (Hevea brasiliensis) is an economically important tropical tree species that produces natural rubber, an essential industrial raw material. Here we present a high-quality genome assembly of this species (1.37 Gb, scaffold N50 = 1.28 Mb) that covers 93.8% of the genome (1.47 Gb) and harbours 43,792 predicted protein-coding genes. A striking expansion of the REF/SRPP (rubber elongation factor/small rubber particle protein) gene family and its divergence into several laticifer-specific isoforms seem crucial for rubber biosynthesis. The REF/SRPP family has isoforms with sizes similar to or larger than SRPP1 (204 amino acids) in 17 other plants examined, but no isoforms with similar sizes to REF1 (138 amino acids), the predominant molecular variant. A pivotal point in Hevea evolution was the emergence of REF1, which is located on the surface of large rubber particles that account for 93% of rubber in the latex (despite constituting only 6% of total rubber particles, large and small). The stringent control of ethylene synthesis under active ethylene signalling and response in laticifers resolves a longstanding mystery of ethylene stimulation in rubber production. Our study, which includes the re-sequencing of five other Hevea cultivars and extensive RNA-seq data, provides a valuable resource for functional genomics and tools for breeding elite Hevea cultivars.

Chinese Science Bulletin | 2001

A draft sequence of the rice (Oryza sativa ssp.indica) genome

Jun Yu; Songnian Hu; Jun Wang; Songgang Li; Ka-Shu Gane Wong; Bin Liu; Yajun Deng; Li Dai; Yan Zhou; Xiuqing Zhang; Mengliang Cao; Jing Liu; Jiandong Sun; Jiabin Tang; Yanjiong Chen; Xiaobing Huang; Wei Lin; Chen Ye; Wei Tong; Lijuan Cong; Jianing Geng; Yujun Han; Lin Li; Wei Li; Guangqiang Hu; Xiangang Huang; Wenjie Li; Jian Li; Zhanwei Liu; Long Li

The sequence of the rice genome holds fundamental information for its biology, including physiology, genetics, development, and evolution, as well as information on many beneficial phenotypes of economic significance. Using a “whole genome shotgun” approach, we have produced a draft rice genome sequence ofOryza sativa ssp.indica, the major crop rice subspecies in China and many other regions of Asia. The draft genome sequence is constructed from over 4.3 million successful sequencing traces with an accumulative total length of 2214.9 Mb. The initial assembly of the non-redundant sequences reached 409.76 Mb in length, based on 3.30 million successful sequencing traces with a total length of 1797.4 Mb from anindica variant cultivar93-11, giving an estimated coverage of 95.29% of the rice genome with an average base accuracy of higher than 99%. The coverage of the draft sequence, the randomness of the sequence distribution, and the consistency of BIG-ASSEMBLER, a custom-designed software package used for the initial assembly, were verified rigorously by comparisons against finished BAC clone sequences from bothindica andjapanica strains, available from the public databases. Over all, 96.3% of full-length cDNAs, 96.4% of STS, STR, RFLP markers, 94.0% of ESTs and 94.9% unigene clusters were identified from the draft sequence. Our preliminary analysis on the data set shows that our rice draft sequence is consistent with the comman standard accepted by the genome sequencing community. The unconditional release of the draft to the public also undoubtedly provides a fundamental resource to the international scientific communities to facilitate genomic and genetic studies on rice biology.

Genomics, Proteomics & Bioinformatics | 2003

Complete Genome Sequences of the SARS-CoV: the BJ Group (Isolates BJ01-BJ04)

Shengli Bi; E’de Qin; Z. Y. Xu; Wei Li; Jing Wang; Yongwu Hu; Yong Liu; Shumin Duan; Jianfei Hu; Yujun Han; Jing Xu; Yan Li; Yao Yi; Yongdong Zhou; Wei Lin; Jie Wen; Hong Xu; Ruan Li; Zizhang Zhang; Haiyan Sun; Jingui Zhu; Man Yu; Baochang Fan; Qingfa Wu; Lin Tang; Bao’an Yang; Guoqing Li; Wenming Peng; Wenjie Li; Tao Jiang

Beijing has been one of the epicenters attacked most severely by the SARS-CoV (severe acute respiratory syndrome-associated coronavirus) since the first patient was diagnosed in one of the city’s hospitals. We now report complete genome sequences of the BJ Group, including four isolates (Isolates BJ01, BJ02, BJ03, and BJ04) of the SARS-CoV. It is remarkable that all members of the BJ Group share a common haplotype, consisting of seven loci that differentiate the group from other isolates published to date. Among 42 substitutions uniquely identified from the BJ group, 32 are non-synonymous changes at the amino acid level. Rooted phylogenetic trees, proposed on the basis of haplotypes and other sequence variations of SARS-CoV isolates from Canada, USA, Singapore, and China, gave rise to different paradigms but positioned the BJ Group, together with the newly discovered GD01 (GD-Ins29) in the same clade, followed by the H-U Group (from Hong Kong to USA) and the H-T Group (from Hong Kong to Toronto), leaving the SP Group (Singapore) more distant. This result appears to suggest a possible transmission path from Guangdong to Beijing/Hong Kong, then to other countries and regions.

Science | 2004

A Draft Sequence for the Genome of the Domesticated Silkworm (Bombyx mori)

Qingyou Xia; Zeyang Zhou; Cheng Lu; Daojun Cheng; Fangyin Dai; Bin Li; Ping Zhao; Xingfu Zha; Tingcai Cheng; Chunli Chai; Guoqing Pan; Jinshan Xu; Chun Liu; Ying Lin; Jifeng Qian; Yong Hou; Zhengli Wu; Guanrong Li; Minhui Pan; Chunfeng Li; Yihong Shen; Xiqian Lan; Lianwei Yuan; Tian Li; Hanfu Xu; Guangwei Yang; Yongji Wan; Yong Zhu; Maode Yu; Weide Shen

BMC Genomics | 2015