Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Shengting Li is active.

Publication


Featured researches published by Shengting Li.


Nature | 2010

A human gut microbial gene catalogue established by metagenomic sequencing

Junjie Qin; Ruiqiang Li; Jeroen Raes; Manimozhiyan Arumugam; Kristoffer Sølvsten Burgdorf; Chaysavanh Manichanh; Trine Nielsen; Nicolas Pons; Florence Levenez; Takuji Yamada; Daniel R. Mende; Junhua Li; Junming Xu; Shaochuan Li; Dongfang Li; Jianjun Cao; Bo Wang; Huiqing Liang; Huisong Zheng; Yinlong Xie; Julien Tap; Patricia Lepage; Marcelo Bertalan; Jean-Michel Batto; Torben Hansen; Denis Le Paslier; Allan Linneberg; H. Bjørn Nielsen; Eric Pelletier; Pierre Renault

To understand the impact of gut microbes on human health and well-being it is crucial to assess their genetic potential. Here we describe the Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence, from faecal samples of 124 European individuals. The gene set, ∼150 times larger than the human gene complement, contains an overwhelming majority of the prevalent (more frequent) microbial genes of the cohort and probably includes a large proportion of the prevalent human intestinal microbial genes. The genes are largely shared among individuals of the cohort. Over 99% of the genes are bacterial, indicating that the entire cohort harbours between 1,000 and 1,150 prevalent bacterial species and each individual at least 160 such species, which are also largely shared. We define and describe the minimal gut metagenome and the minimal gut bacterial genome in terms of functions present in all individuals and most bacteria, respectively.


Genome Research | 2010

De novo assembly of human genomes with massively parallel short read sequencing

Ruiqiang Li; Hongmei Zhu; Jue Ruan; Wubin Qian; Xiaodong Fang; Zhongbin Shi; Yingrui Li; Shengting Li; Gao Shan; Karsten Kristiansen; Songgang Li; Huanming Yang; Jian Wang; Jun Wang

Next-generation massively parallel DNA sequencing technologies provide ultrahigh throughput at a substantially lower unit data cost; however, the data are very short read length sequences, making de novo assembly extremely challenging. Here, we describe a novel method for de novo assembly of large genomes from short read sequences. We successfully assembled both the Asian and African human genome sequences, achieving an N50 contig size of 7.4 and 5.9 kilobases (kb) and scaffold of 446.3 and 61.9 kb, respectively. The development of this de novo short read assembly method creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost-effective way.


Nucleic Acids Research | 2006

WEGO: a web tool for plotting GO annotations

Jia Ye; Lin Fang; Hongkun Zheng; Yong Zhang; Jie Chen; Zengjin Zhang; Jing Wang; Shengting Li; Ruiqiang Li; Lars Bolund; Jun Wang

Unified, structured vocabularies and classifications freely provided by the Gene Ontology (GO) Consortium are widely accepted in most of the large scale gene annotation projects. Consequently, many tools have been created for use with the GO ontologies. WEGO (Web Gene Ontology Annotation Plot) is a simple but useful tool for visualizing, comparing and plotting GO annotation results. Different from other commercial software for creating chart, WEGO is designed to deal with the directed acyclic graph structure of GO to facilitate histogram creation of GO annotation results. WEGO has been used widely in many important biological research projects, such as the rice genome project and the silkworm genome project. It has become one of the daily tools for downstream gene annotation analysis, especially when performing comparative genomics tasks. WEGO, along with the two other tools, namely External to GO Query and GO Archive Query, are freely available for all users at . There are two available mirror sites at and . Any suggestions are welcome at [email protected].


Genome Research | 2013

SOAPindel: Efficient identification of indels from short paired reads

Shengting Li; Ruiqiang Li; Heng Li; Jianliang Lu; Yingrui Li; Lars Bolund; Mikkel H. Schierup; Jun Wang

We present a new approach to indel calling that explicitly exploits that indel differences between a reference and a sequenced sample make the mapping of reads less efficient. We assign all unmapped reads with a mapped partner to their expected genomic positions and then perform extensive de novo assembly on the regions with many unmapped reads to resolve homozygous, heterozygous, and complex indels by exhaustive traversal of the de Bruijn graph. The method is implemented in the software SOAPindel and provides a list of candidate indels with quality scores. We compare SOAPindel to Dindel, Pindel, and GATK on simulated data and find similar or better performance for short indels (<10 bp) and higher sensitivity and specificity for long indels. A validation experiment suggests that SOAPindel has a false-positive rate of ∼10% for long indels (>5 bp), while still providing many more candidate indels than other approaches.


Nature Communications | 2015

Novel variation and de novo mutation rates in population-wide de novo assembled Danish trios

Søren Besenbacher; Siyang Liu; Jose M. G. Izarzugaza; Jakob Grove; Kirstine Belling; Jette Bork-Jensen; Shujia Huang; Thomas Damm Als; Shengting Li; Rachita Yadav; Arcadio Rubio-García; Francesco Lescai; Ditte Demontis; Junhua Rao; Weijian Ye; Thomas Mailund; Rune M. Friborg; Christian N. S. Pedersen; Ruiqi Xu; Jihua Sun; Hao Liu; Ou Wang; Xiaofang Cheng; David Flores; Emil Rydza; Kristoffer Rapacki; John Damm Sørensen; Piotr Jaroslaw Chmura; David Westergaard; Piotr Dworzynski

Building a population-specific catalogue of single nucleotide variants (SNVs), indels and structural variants (SVs) with frequencies, termed a national pan-genome, is critical for further advancing clinical and public health genetics in large cohorts. Here we report a Danish pan-genome obtained from sequencing 10 trios to high depth (50 × ). We report 536k novel SNVs and 283k novel short indels from mapping approaches and develop a population-wide de novo assembly approach to identify 132k novel indels larger than 10 nucleotides with low false discovery rates. We identify a higher proportion of indels and SVs than previous efforts showing the merits of high coverage and de novo assembly approaches. In addition, we use trio information to identify de novo mutations and use a probabilistic method to provide direct estimates of 1.27e−8 and 1.5e−9 per nucleotide per generation for SNVs and indels, respectively.


Aging Cell | 2014

The co-occurrence of mtDNA mutations on different oxidative phosphorylation subunits, not detected by haplogroup analysis, affects human longevity and is population specific

Nicola Raule; Federica Sevini; Shengting Li; Annalaura Barbieri; Federica Tallaro; Laura Lomartire; Dario Vianello; Alberto Montesanto; Jukka S. Moilanen; Vladyslav Bezrukov; Hélène Blanché; Antti Hervonen; Kaare Christensen; Luca Deiana; Efstathios S. Gonos; Thomas B. L. Kirkwood; Peter Kristensen; Alberta Leon; Pier Giuseppe Pelicci; Michel Poulain; Irene Maeve Rea; José Remacle; Jean-Marie Robine; Stefan Schreiber; Ewa Sikora; Peternella Eline Slagboom; Liana Spazzafumo; Maria Antonietta Stazi; Olivier Toussaint; James W. Vaupel

To re‐examine the correlation between mtDNA variability and longevity, we examined mtDNAs from samples obtained from over 2200 ultranonagenarians (and an equal number of controls) collected within the framework of the GEHA EU project. The samples were categorized by high‐resolution classification, while about 1300 mtDNA molecules (650 ultranonagenarians and an equal number of controls) were completely sequenced. Sequences, unlike standard haplogroup analysis, made possible to evaluate for the first time the cumulative effects of specific, concomitant mtDNA mutations, including those that per se have a low, or very low, impact. In particular, the analysis of the mutations occurring in different OXPHOS complex showed a complex scenario with a different mutation burden in 90+ subjects with respect to controls. These findings suggested that mutations in subunits of the OXPHOS complex I had a beneficial effect on longevity, while the simultaneous presence of mutations in complex I and III (which also occurs in J subhaplogroups involved in LHON) and in complex I and V seemed to be detrimental, likely explaining previous contradictory results. On the whole, our study, which goes beyond haplogroup analysis, suggests that mitochondrial DNA variation does affect human longevity, but its effect is heavily influenced by the interaction between mutations concomitantly occurring on different mtDNA genes.


Nature Reviews Genetics | 2003

Vertebrate gene predictions and the problem of large genes

Jun Wang; Shengting Li; Yong Zhang; Hongkun Zheng; Zhao Xu; Jia Ye; Jun Yu; Gane Ka-Shu Wong

To find unknown protein-coding genes, annotation pipelines use a combination of ab initio gene prediction and similarity to experimentally confirmed genes or proteins. Here, we show that although the ab initio predictions have an intrinsically high false-positive rate, they also have a consistently low false-negative rate. The incorporation of similarity information is meant to reduce the false-positive rate, but in doing so it increases the false-negative rate. The crucial variable is gene size (including introns) — genes of the most extreme sizes, especially very large genes, are most likely to be incorrectly predicted.


Nucleic Acids Research | 2004

ChickVD: a sequence variation database for the chicken genome

Jing Wang; Ximiao He; Jue Ruan; Mingtao Dai; Jie Chen; Yong Zhang; Yafeng Hu; Chen Ye; Shengting Li; Lijuan Cong; Lin Fang; Bin Liu; Songgang Li; Jian Wang; David W. Burt; Gane Ka-Shu Wong; Jun Yu; Huanming Yang; Jun Wang

Working in parallel with the efforts to sequence the chicken (Gallus gallus) genome, the Beijing Genomics Institute led an international team of scientists from China, USA, UK, Sweden, The Netherlands and Germany to map extensive DNA sequence variation throughout the chicken genome by sampling DNA from domestic breeds. Using the Red Jungle Fowl genome sequence as a reference, we identified 3.1 million non-redundant DNA sequence variants. To facilitate the application of our data to avian genetics and to provide a foundation for functional and evolutionary studies, we created the ‘Chicken Variation Database’ (ChickVD). A graphical MapView shows variants mapped onto the chicken genome in the context of gene annotations and other features, including genetic markers, trait loci, cDNAs, chicken orthologs of human disease genes and raw sequence traces. ChickVD also stores information on quantitative trait loci using data from collaborating institutions and public resources. Our data can be queried by search engine and homology-based BLAST searches. ChickVD is publicly accessible at http://chicken.genomics.org.cn.


BMC Bioinformatics | 2009

Estimation of tumor heterogeneity using CGH array data

Kai Wang; Jian Li; Shengting Li; Lars Bolund; Carsten Wiuf

BackgroundArray-based comparative genomic hybridization (CGH) is a commonly-used approach to detect DNA copy number variation in whole genome-wide screens. Several statistical methods have been proposed to define genomic segments with different copy numbers in cancer tumors. However, most tumors are heterogeneous and show variation in DNA copy numbers across tumor cells. The challenge is to reveal the copy number profiles of the subpopulations in a tumor and to estimate the percentage of each subpopulation.ResultsWe describe a relation between experimental data and exact DNA copy number and develop a statistical method to reveal the heterogeneity of tumors containing a mixture of different-stage cells. Furthermore, we validate the method on simulated data and apply the method to 29 pairs of breast primary tumors and their matched lymph node metastases.ConclusionWe demonstrate a new method for CGH array analysis that allows a tumor sample to be classified according to its heterogeneity. The method gives an interpretable series of copy number profiles, one for each major subpopulation in a tumor. The profiles facilitate identification of copy number alterations in cancer development.


BMC Genomics | 2009

An improved method for genome wide DNA methylation profiling correlated to transcription and genomic instability in two breast cancer cell lines.

Jian Li; Fei Gao; Ning Li; Shengting Li; Guangliang Yin; Geng Tian; Shangang Jia; Kai Wang; Xiuqing Zhang; Huanming Yang; Anders Lade Nielsen; Lars Bolund

BackgroundDNA methylation is a widely studied epigenetic mechanism known to correlate with gene repression and genomic stability. Development of sensitive methods for global detection of DNA methylation events is of particular importance.ResultsWe here describe a technique, called modified methylation-specific digital karyotyping (MMSDK) based on methylation-specific digital karyotyping (MSDK) with a novel sequencing approach. Briefly, after a tandem digestion of genomic DNA with a methylation-sensitive mapping enzyme and a fragmenting enzyme, short sequence tags are obtained. These tags are amplified, followed by direct, massively parallel sequencing (Solexa 1G Genome Analyzer). This method allows high-throughput and low-cost genome-wide DNA methylation mapping. We applied this method to investigate global DNA methylation profiles for widely used breast cancer cell lines, MCF-7 and MDA-MB-231, which are representatives for luminal-like and mesenchymal-like cancer types, respectively. By comparison, a highly similar overall DNA methylation pattern was revealed for the two cell lines. However a cohort of individual genomic loci with significantly different DNA methylation status between two cell lines was identified. Furthermore, we revealed a genome-wide significant correlation between gene expression and the methylation status of gene promoters with CpG islands (CGIs) in the two cancer cell lines, and a correlation of gene expression and the methylation status of promoters without CGIs in MCF-7 cells.ConclusionThe MMSDK method will be a valuable tool to increase the current knowledge of genome wide DNA methylation profiles.

Collaboration


Dive into the Shengting Li's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

R. Yang

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Huanming Yang

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Jun Wang

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Y.L. Hao

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

H. Wang

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Jian Wang

Guangzhou Medical University

View shared research outputs
Top Co-Authors

Avatar

Yang Hao

Queen Mary University of London

View shared research outputs
Researchain Logo
Decentralizing Knowledge