Kerstin Lindblad-Toh | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Kerstin Lindblad-Toh is active.

Explore More

Publication

Featured researches published by Kerstin Lindblad-Toh.

Nature Biotechnology | 2011

Full-length transcriptome assembly from RNA-Seq data without a reference genome

Manfred Grabherr; Brian J. Haas; Moran Yassour; Joshua Z. Levin; Dawn Anne Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica Di Palma; Bruce Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev

Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

Nature | 2005

Initial sequence of the chimpanzee genome and comparison with the human genome

Tarjei S. Mikkelsen; LaDeana W. Hillier; Evan E. Eichler; Michael C. Zody; David B. Jaffe; Shiaw-Pyng Yang; Wolfgang Enard; Ines Hellmann; Kerstin Lindblad-Toh; Tasha K. Altheide; Nicoletta Archidiacono; Peer Bork; Jonathan Butler; Jean L. Chang; Ze Cheng; Asif T. Chinwalla; Pieter J. de Jong; Kimberley D. Delehaunty; Catrina C. Fronick; Lucinda L. Fulton; Yoav Gilad; Gustavo Glusman; Sante Gnerre; Tina Graves; Toshiyuki Hayakawa; Karen E. Hayden; Xiaoqiu Huang; Hongkai Ji; W. James Kent; Mary Claire King

Here we present a draft genome sequence of the common chimpanzee (Pan troglodytes). Through comparison with the human genome, we have generated a largely complete catalogue of the genetic differences that have accumulated since the human and chimpanzee species diverged from our common ancestor, constituting approximately thirty-five million single-nucleotide changes, five million insertion/deletion events, and various chromosomal rearrangements. We use this catalogue to explore the magnitude and regional variation of mutational forces shaping these two genomes, and the strength of positive and negative selection acting on their genes. In particular, we find that the patterns of evolution in human and chimpanzee protein-coding genes are highly correlated and dominated by the fixation of neutral and slightly deleterious alleles. We also use the chimpanzee genome as an outgroup to investigate human population genetics and identify signatures of selective sweeps in recent human evolution.Here we present a draft genome sequence of the common chimpanzee (Pan troglodytes). Through comparison with the human genome, we have generated a largely complete catalogue of the genetic differences that have accumulated since the human and chimpanzee species diverged from our common ancestor, constituting approximately thirty-five million single-nucleotide changes, five million insertion/deletion events, and various chromosomal rearrangements. We use this catalogue to explore the magnitude and regional variation of mutational forces shaping these two genomes, and the strength of positive and negative selection acting on their genes. In particular, we find that the patterns of evolution in human and chimpanzee protein-coding genes are highly correlated and dominated by the fixation of neutral and slightly deleterious alleles. We also use the chimpanzee genome as an outgroup to investigate human population genetics and identify signatures of selective sweeps in recent human evolution.

Nature | 2005

Systematic discovery of regulatory motifs in human promoters and 3' UTRs by comparison of several mammals.

Xiaohui Xie; Jun Lu; Edward J. Kulbokas; Todd R. Golub; Vamsi K. Mootha; Kerstin Lindblad-Toh; Eric S. Lander; Manolis Kellis

Comprehensive identification of all functional elements encoded in the human genome is a fundamental need in biomedical research. Here, we present a comparative analysis of the human, mouse, rat and dog genomes to create a systematic catalogue of common regulatory motifs in promoters and 3′ untranslated regions (3′ UTRs). The promoter analysis yields 174 candidate motifs, including most previously known transcription-factor binding sites and 105 new motifs. The 3′-UTR analysis yields 106 motifs likely to be involved in post-transcriptional regulation. Nearly one-half are associated with microRNAs (miRNAs), leading to the discovery of many new miRNA genes and their likely target genes. Our results suggest that previous estimates of the number of human miRNA genes were low, and that miRNAs regulate at least 20% of human genes. The overall results provide a systematic view of gene regulation in the human, which will be refined as additional mammalian genomes become available.

Nature | 2004

Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype

Olivier Jaillon; Jean-Marc Aury; Frédéric Brunet; Jean-Louis Petit; Nicole Stange-Thomann; Evan Mauceli; Laurence Bouneau; Cécile Fischer; Catherine Ozouf-Costaz; Alain Bernot; Sophie Nicaud; David B. Jaffe; Sheila Fisher; Georges Lutfalla; Carole Dossat; Béatrice Segurens; Corinne Dasilva; Marcel Salanoubat; Michael Levy; Nathalie Boudet; Sergi Castellano; Véronique Anthouard; Claire Jubin; Vanina Castelli; Michael Katinka; Benoit Vacherie; Christian Biémont; Zineb Skalli; Laurence Cattolico; Julie Poulain

Tetraodon nigroviridis is a freshwater puffer fish with the smallest known vertebrate genome. Here, we report a draft genome sequence with long-range linkage and substantial anchoring to the 21 Tetraodon chromosomes. Genome analysis provides a greatly improved fish gene catalogue, including identifying key genes previously thought to be absent in fish. Comparison with other vertebrates and a urochordate indicates that fish proteins have diverged markedly faster than their mammalian homologues. Comparison with the human genome suggests ∼900 previously unannotated human genes. Analysis of the Tetraodon and human genomes shows that whole-genome duplication occurred in the teleost fish lineage, subsequent to its divergence from mammals. The analysis also makes it possible to infer the basic structure of the ancestral bony vertebrate genome, which was composed of 12 chromosomes, and to reconstruct much of the evolutionary history of ancient and recent chromosome rearrangements leading to the modern human karyotype.

Cell | 2005

Genomic Maps and Comparative Analysis of Histone Modifications in Human and Mouse

Bradley E. Bernstein; Michael Kamal; Kerstin Lindblad-Toh; Stefan Bekiranov; Dione K. Bailey; Dana J. Huebert; Scott McMahon; Elinor K. Karlsson; Edward J. Kulbokas; Thomas R. Gingeras; Stuart L. Schreiber; Eric S. Lander

We mapped histone H3 lysine 4 di- and trimethylation and lysine 9/14 acetylation across the nonrepetitive portions of human chromosomes 21 and 22 and compared patterns of lysine 4 dimethylation for several orthologous human and mouse loci. Both chromosomes show punctate sites enriched for modified histones. Sites showing trimethylation correlate with transcription starts, while those showing mainly dimethylation occur elsewhere in the vicinity of active genes. Punctate methylation patterns are also evident at the cytokine and IL-4 receptor loci. The Hox clusters present a strikingly different picture, with broad lysine 4-methylated regions that overlay multiple active genes. We suggest these regions represent active chromatin domains required for the maintenance of Hox gene expression. Methylation patterns at orthologous loci are strongly conserved between human and mouse even though many methylated sites do not show sequence conservation notably higher than background. This suggests that the DNA elements that direct the methylation represent only a small fraction of the region or lie at some distance from the site.

Nature | 2012

The genomic basis of adaptive evolution in threespine sticklebacks

Felicity C. Jones; Manfred Grabherr; Yingguang Frank Chan; Pamela Russell; Evan Mauceli; Jeremy A. Johnson; Ross Swofford; Mono Pirun; Michael C. Zody; Simon D. M. White; Ewan Birney; Stephen M. J. Searle; Jeremy Schmutz; Jane Grimwood; Mark Dickson; Richard M. Myers; Craig T. Miller; Brian R. Summers; Anne K. Knecht; Shannon D. Brady; Haili Zhang; Alex A. Pollen; Timothy R. Howes; Chris T. Amemiya; Eric S. Lander; Federica Di Palma; Kerstin Lindblad-Toh; David M. Kingsley

Marine stickleback fish have colonized and adapted to thousands of streams and lakes formed since the last ice age, providing an exceptional opportunity to characterize genomic mechanisms underlying repeated ecological adaptation in nature. Here we develop a high-quality reference genome assembly for threespine sticklebacks. By sequencing the genomes of twenty additional individuals from a global set of marine and freshwater populations, we identify a genome-wide set of loci that are consistently associated with marine–freshwater divergence. Our results indicate that reuse of globally shared standing genetic variation, including chromosomal inversions, has an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation. Both coding and regulatory changes occur in the set of loci underlying marine–freshwater evolution, but regulatory changes appear to predominate in this well known example of repeated adaptive evolution in nature.

Nature | 2011

A high-resolution map of human evolutionary constraint using 29 mammals

Kerstin Lindblad-Toh; Manuel Garber; Or Zuk; Michael F. Lin; Brian J. Parker; Stefan Washietl; Pouya Kheradpour; Jason Ernst; Gregory Jordan; Evan Mauceli; Lucas D. Ward; Craig B. Lowe; Alisha K. Holloway; Michele Clamp; Sante Gnerre; Jessica Alföldi; Kathryn Beal; Jean Chang; Hiram Clawson; James Cuff; Federica Di Palma; Stephen Fitzgerald; Paul Flicek; Mitchell Guttman; Melissa J. Hubisz; David B. Jaffe; Irwin Jungreis; W. James Kent; Dennis Kostka; Marcia Lara

The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ∼4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ∼60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease.

Nature | 2010

Whole-genome resequencing reveals loci under selection during chicken domestication

Carl-Johan Rubin; Michael C. Zody; Jonas Eriksson; Jennifer R. S. Meadows; Ellen Sherwood; Matthew T. Webster; Lin Jiang; Max Ingman; Ted Sharpe; Sojeong Ka; Finn Hallböök; Francois Besnier; Örjan Carlborg; Bertrand Bed’hom; Michèle Tixier-Boichard; Per Jensen; P. B. Siegel; Kerstin Lindblad-Toh; Leif Andersson

Domestic animals are excellent models for genetic studies of phenotypic evolution. They have evolved genetic adaptations to a new environment, the farm, and have been subjected to strong human-driven selection leading to remarkable phenotypic changes in morphology, physiology and behaviour. Identifying the genetic changes underlying these developments provides new insight into general mechanisms by which genetic variation shapes phenotypic diversity. Here we describe the use of massively parallel sequencing to identify selective sweeps of favourable alleles and candidate mutations that have had a prominent role in the domestication of chickens (Gallus gallus domesticus) and their subsequent specialization into broiler (meat-producing) and layer (egg-producing) chickens. We have generated 44.5-fold coverage of the chicken genome using pools of genomic DNA representing eight different populations of domestic chickens as well as red jungle fowl (Gallus gallus), the major wild ancestor. We report more than 7,000,000 single nucleotide polymorphisms, almost 1,300 deletions and a number of putative selective sweeps. One of the most striking selective sweeps found in all domestic chickens occurred at the locus for thyroid stimulating hormone receptor (TSHR), which has a pivotal role in metabolic regulation and photoperiod control of reproduction in vertebrates. Several of the selective sweeps detected in broilers overlapped genes associated with growth, appetite and metabolic regulation. We found little evidence that selection for loss-of-function mutations had a prominent role in chicken domestication, but we detected two deletions in coding sequences that we suggest are functionally important. This study has direct application to animal breeding and enhances the importance of the domestic chicken as a model organism for biomedical research.

Science | 2009

Genome Sequence, Comparative Analysis, and Population Genetics of the Domestic Horse

Claire M. Wade; Elena Giulotto; Snaevar Sigurdsson; Monica Zoli; Sante Gnerre; Freyja Imsland; Teri L. Lear; David L. Adelson; Ernest Bailey; Rebecca R. Bellone; Helmut Blöcker; Ottmar Distl; R.C. Edgar; Manuel Garber; Tosso Leeb; Evan Mauceli; James N. MacLeod; M.C.T. Penedo; Joy M. Raison; Ted Sharpe; J. Vogel; Leif Andersson; Douglas F. Antczak; Tara Biagi; M. M. Binns; B.P. Chowdhary; S.J. Coleman; G. Della Valle; Sarah Fryc; Gérard Guérin

A Horse Is a Horse, of Course The history of horse domestication is closely tied to the history of the human society. Wade et al. (p. 865) report on the sequencing and provide a single nucleotide polymorphism map of the horse (Equus caballus) genome. Horses are a member of the order perissodactyla (odd-toed animals with hooves). The analysis reveals an evolutionarily new centromere on equine chromosome 11 that displays properties of an immature but fully functioning centromere and is devoid of centromeric satellite sequence. The findings clarify the nature of genetic diversity within and across horse breeds and suggest that the horse was domesticated from a relatively large number of females, but few males. The horse genome reveals an evolutionary new centromere and conserved chromosomal sequences relative to other mammals. We report a high-quality draft sequence of the genome of the horse (Equus caballus). The genome is relatively repetitive but has little segmental duplication. Chromosomes appear to have undergone few historical rearrangements: 53% of equine chromosomes show conserved synteny to a single human chromosome. Equine chromosome 11 is shown to have an evolutionary new centromere devoid of centromeric satellite DNA, suggesting that centromeric function may arise before satellite repeat accumulation. Linkage disequilibrium, showing the influences of early domestication of large herds of female horses, is intermediate in length between dog and human, and there is long-range haplotype sharing among breeds.

Proceedings of the National Academy of Sciences of the United States of America | 2007

Distinguishing protein-coding and noncoding genes in the human genome

Michele Clamp; Ben Fry; Mike Kamal; Xiaohui Xie; James Cuff; Michael F. Lin; Manolis Kellis; Kerstin Lindblad-Toh; Eric S. Lander

Although the Human Genome Project was completed 4 years ago, the catalog of human protein-coding genes remains a matter of controversy. Current catalogs list a total of ≈24,500 putative protein-coding genes. It is broadly suspected that a large fraction of these entries are functionally meaningless ORFs present by chance in RNA transcripts, because they show no evidence of evolutionary conservation with mouse or dog. However, there is currently no scientific justification for excluding ORFs simply because they fail to show evolutionary conservation: the alternative hypothesis is that most of these ORFs are actually valid human genes that reflect gene innovation in the primate lineage or gene loss in the other lineages. Here, we reject this hypothesis by carefully analyzing the nonconserved ORFs—specifically, their properties in other primates. We show that the vast majority of these ORFs are random occurrences. The analysis yields, as a by-product, a major revision of the current human catalogs, cutting the number of protein-coding genes to ≈20,500. Specifically, it suggests that nonconserved ORFs should be added to the human gene catalog only if there is clear evidence of an encoded protein. It also provides a principled methodology for evaluating future proposed additions to the human gene catalog. Finally, the results indicate that there has been relatively little true innovation in mammalian protein-coding genes.

Explore More