Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Leo Goodstadt is active.

Publication


Featured researches published by Leo Goodstadt.


Nature | 2011

Mouse genomic variation and its effect on phenotypes and gene regulation.

Thomas M. Keane; Leo Goodstadt; Petr Danecek; Michael A. White; Kim Wong; Binnaz Yalcin; Andreas Heger; Avigail Agam; Guy Slater; Martin Goodson; N A Furlotte; Eleazar Eskin; Christoffer Nellåker; H Whitley; James Cleak; Deborah Janowitz; Polinka Hernandez-Pliego; Andrew Edwards; T G Belgard; Peter L. Oliver; Rebecca E McIntyre; Amarjit Bhomra; Jérôme Nicod; Xiangchao Gan; Wei Yuan; L van der Weyden; Charles A. Steward; Sendu Bala; Jim Stalker; Richard Mott

We report genome sequences of 17 inbred strains of laboratory mice and identify almost ten times more variants than previously known. We use these genomes to explore the phylogenetic history of the laboratory mouse and to examine the functional consequences of allele-specific variation on transcript abundance, revealing that at least 12% of transcripts show a significant tissue-specific expression bias. By identifying candidate functional variants at 718 quantitative trait loci we show that the molecular nature of functional variants and their position relative to genes vary according to the effect size of the locus. These sequences provide a starting point for a new era in the functional analysis of a key model organism.


PLOS Biology | 2009

Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

Deanna M. Church; Leo Goodstadt; LaDeana W. Hillier; Michael C. Zody; Steve Goldstein; Xinwe She; Richa Agarwala; Joshua L. Cherry; Michael DiCuccio; Wratko Hlavina; Yuri Kapustin; Peter Meric; Donna Maglott; Zoë Birtle; Ana C. Marques; Tina Graves; Shiguo Zhou; Brian Teague; Konstantinos Potamousis; Chris Churas; Michael Place; Jill Herschleb; Ron Runnheim; Dan Forrest; James M. Amos-Landgraf; David C. Schwartz; Ze Cheng; Kerstin Lindblad-Toh; Evan E. Eichler; Chris P. Ponting

A finished clone-based assembly of the mouse genome reveals extensive recent sequence duplication during recent evolution and rodent-specific expansion of certain gene families. Newly assembled duplications contain protein-coding genes that are mostly involved in reproductive function.


Nature | 2011

Sequence-based characterization of structural variation in the mouse genome.

Binnaz Yalcin; Kim Wong; Avigail Agam; Martin Goodson; Thomas M. Keane; Xiangchao Gan; Christoffer Nellåker; Leo Goodstadt; Jérôme Nicod; Amarjit Bhomra; Polinka Hernandez-Pliego; Helen Whitley; James Cleak; Rebekah Dutton; Deborah Janowitz; Richard Mott; David J. Adams; Jonathan Flint

Structural variation is widespread in mammalian genomes and is an important cause of disease, but just how abundant and important structural variants (SVs) are in shaping phenotypic variation remains unclear. Without knowing how many SVs there are, and how they arise, it is difficult to discover what they do. Combining experimental with automated analyses, we identified 711,920 SVs at 281,243 sites in the genomes of thirteen classical and four wild-derived inbred mouse strains. The majority of SVs are less than 1 kilobase in size and 98% are deletions or insertions. The breakpoints of 160,000 SVs were mapped to base pair resolution, allowing us to infer that insertion of retrotransposons causes more than half of SVs. Yet, despite their prevalence, SVs are less likely than other sequence variants to cause gene expression or quantitative phenotypic variation. We identified 24 SVs that disrupt coding exons, acting as rare variants of large effect on gene function. One-third of the genes so affected have immunological functions.


Bioinformatics | 2001

CHROMA: consensus-based colouring of multiple alignments for publication.

Leo Goodstadt; Chris P. Ponting

CHROMA annotates multiple protein sequence alignments by consensus to produce formatted and coloured text suitable for incorporation into other documents for publication. The package is designed to be flexible and reliable, and has a simple-to-use graphical user interface running under Microsoft Windows. Both the executables and source code for CHROMA running under Windows and Linux (portable command-line only) are freely available at http://www.lg.ndirect.co.uk/chroma. Software enquiries should be directed to [email protected].


Genome Biology | 2010

Long noncoding RNA genes: conservation of sequence and brain expression among diverse amniotes

Rebecca A. Chodroff; Leo Goodstadt; Tamara Sirey; Peter L. Oliver; Kay E. Davies; Eric D. Green; Zoltán Molnár; Chris P. Ponting

BackgroundLong considered to be the building block of life, it is now apparent that protein is only one of many functional products generated by the eukaryotic genome. Indeed, more of the human genome is transcribed into noncoding sequence than into protein-coding sequence. Nevertheless, whilst we have developed a deep understanding of the relationships between evolutionary constraint and function for protein-coding sequence, little is known about these relationships for non-coding transcribed sequence. This dearth of information is partially attributable to a lack of established non-protein-coding RNA (ncRNA) orthologs among birds and mammals within sequence and expression databases.ResultsHere, we performed a multi-disciplinary study of four highly conserved and brain-expressed transcripts selected from a list of mouse long intergenic noncoding RNA (lncRNA) loci that generally show pronounced evolutionary constraint within their putative promoter regions and across exon-intron boundaries. We identify some of the first lncRNA orthologs present in birds (chicken), marsupial (opossum), and eutherian mammals (mouse), and investigate whether they exhibit conservation of brain expression. In contrast to conventional protein-coding genes, the sequences, transcriptional start sites, exon structures, and lengths for these non-coding genes are all highly variable.ConclusionsThe biological relevance of lncRNAs would be highly questionable if they were limited to closely related phyla. Instead, their preservation across diverse amniotes, their apparent conservation in exon structure, and similarities in their pattern of brain expression during embryonic and early postnatal stages together indicate that these are functional RNA molecules, of which some have roles in vertebrate brain development.


PLOS Computational Biology | 2006

Phylogenetic Reconstruction of Orthology, Paralogy, and Conserved Synteny for Dog and Human

Leo Goodstadt; Chris P. Ponting

Accurate predictions of orthology and paralogy relationships are necessary to infer human molecular function from experiments in model organisms. Previous genome-scale approaches to predicting these relationships have been limited by their use of protein similarity and their failure to take into account multiple splicing events and gene prediction errors. We have developed PhyOP, a new phylogenetic orthology prediction pipeline based on synonymous rate estimates, which accurately predicts orthology and paralogy relationships for transcripts, genes, exons, or genomic segments between closely related genomes. We were able to identify orthologue relationships to human genes for 93% of all dog genes from Ensembl. Among 1:1 orthologues, the alignments covered a median of 97.4% of protein sequences, and 92% of orthologues shared essentially identical gene structures. PhyOP accurately recapitulated genomic maps of conserved synteny. Benchmarking against predictions from Ensembl and Inparanoid showed that PhyOP is more accurate, especially in its predictions of paralogy. Nearly half (46%) of PhyOP paralogy predictions are unique. Using PhyOP to investigate orthologues and paralogues in the human and dog genomes, we found that the human assembly contains 3-fold more gene duplications than the dog. Species-specific duplicate genes, or “in-paralogues,” are generally shorter and have fewer exons than 1:1 orthologues, which is consistent with selective constraints and mutation biases based on the sizes of duplicated genes. In-paralogues have experienced elevated amino acid and synonymous nucleotide substitution rates. Duplicates possess similar biological functions for either the dog or human lineages. Having accounted for 2,954 likely pseudogenes and gene fragments, and after separating 346 erroneously merged genes, we estimated that the human genome encodes a minimum of 19,700 protein-coding genes, similar to the gene count of nematode worms. PhyOP is a fast and robust approach to orthology prediction that will be applicable to whole genomes from multiple closely related species. PhyOP will be particularly useful in predicting orthology for mammalian genomes that have been incompletely sequenced, and for large families of rapidly duplicating genes.


Nature Genetics | 2008

EYS, encoding an ortholog of Drosophila spacemaker, is mutated in autosomal recessive retinitis pigmentosa.

Mai M. Abd El-Aziz; Isabel Barragan; Ciara A. O'Driscoll; Leo Goodstadt; Elena Prigmore; Salud Borrego; Marcela Mena; Juan Ignacio Pieras; Mohamed F. El-Ashry; Leen Abu Safieh; Amna Z. Shah; Michael E. Cheetham; Nigel P. Carter; Christina Chakarova; Chris P. Ponting; S.S. Bhattacharya; Guillermo Antiñolo

Using a positional cloning approach supported by comparative genomics, we have identified a previously unreported gene, EYS, at the RP25 locus on chromosome 6q12 commonly mutated in autosomal recessive retinitis pigmentosa. Spanning over 2 Mb, this is the largest eye-specific gene identified so far. EYS is independently disrupted in four other mammalian lineages, including that of rodents, but is well conserved from Drosophila to man and is likely to have a role in the modeling of retinal architecture.


Genome Research | 2011

Collaborative Cross mice and their power to map host susceptibility to Aspergillus fumigatus infection

Caroline Durrant; Hanna Tayem; Binnaz Yalcin; James Cleak; Leo Goodstadt; Fernando Pardo-Manuel de Villena; Richard Mott; Fuad A. Iraqi

The Collaborative Cross (CC) is a genetic reference panel of recombinant inbred lines of mice, designed for the dissection of complex traits and gene networks. Each line is independently descended from eight genetically diverse founder strains such that the genomes of the CC lines, once fully inbred, are fine-grained homozygous mosaics of the founder haplotypes. We present an analysis of 120 CC lines, from a cohort of the CC bred at Tel Aviv University in collaboration with the University of Oxford, which at the time of this study were between the sixth and 12th generations of inbreeding and substantially homozygous at 170,000 SNPs. We show how CC genomes decompose into mosaics, and we identify loci that carry a deficiency or excess of a founder, many being deficient for the wild-derived strains WSB/EiJ and PWK/PhJ. We phenotyped 371 mice from 66 CC lines for a susceptibility to Aspergillus fumigatus infection. The survival time after infection varied significantly between CC lines. Quantitative trait locus (QTL) mapping identified genome-wide significant QTLs on chromosomes 2, 3, 8, 10 (two QTLs), 15, and 18. Simulations show that QTL mapping resolution (the median distance between the QTL peak and true location) varied between 0.47 and 1.18 Mb. Most of the QTLs involved contrasts between wild-derived founder strains and therefore would not segregate between classical inbred strains. Use of variation data from the genomes of the CC founder strains refined these QTLs further and suggested several candidate genes. These results support the use of the CC for dissecting complex traits.


Bioinformatics | 2010

Ruffus: a lightweight Python library for computational pipelines

Leo Goodstadt

SUMMARY Computational pipelines are common place in scientific research. However, most of the resources for constructing pipelines are heavyweight systems with graphical user interfaces. Ruffus is a library for the creation of computational pipelines. Its lightweight and unobtrusive design recommends it for use even for the most trivial of analyses. At the same time, it is powerful enough to have been used for complex workflows involving more than 50 interdependent stages. AVAILABILITY AND IMPLEMENTATION Ruffus is written in python. Source code, a short tutorial, examples and a comprehensive user manual are freely available at http://www.ruffus.org.uk. The example program is available at http://www.ruffus.org.uk/examples/bioinformatics


BMC Genomics | 2005

Duplication and positive selection among hominin-specific PRAME genes

Zoë Birtle; Leo Goodstadt; Chris P. Ponting

BackgroundThe physiological and phenotypic differences between human and chimpanzee are largely specified by our genomic differences. We have been particularly interested in recent duplications in the human genome as examples of relatively large-scale changes to our genome. We performed an in-depth evolutionary analysis of a region of chromosome 1, which is copy number polymorphic among humans, and that contains at least 32 PRAME (Preferentially expressed antigen of melanoma) genes and pseudogenes. PRAME-like genes are expressed in the testis and in a large number of tumours, and are thought to possess roles in spermatogenesis and oogenesis.ResultsUsing nucleotide substitution rate estimates for exons and introns, we show that two large segmental duplications, of six and seven human PRAME genes respectively, occurred in the last 3 million years. These duplicated genes are thus hominin-specific, having arisen in our genome since the divergence from chimpanzee. This cluster of PRAME genes appears to have arisen initially from a translocation approximately 95–85 million years ago. We identified multiple sites within human or mouse PRAME sequences which exhibit strong evidence of positive selection. These form a pronounced cluster on one face of the predicted PRAME protein structure.ConclusionWe predict that PRAME genes evolved adaptively due to strong competition between rapidly-dividing cells during spermatogenesis and oogenesis. We suggest that as PRAME gene copy number is polymorphic among individuals, positive selection of PRAME alleles may still prevail within the human population.

Collaboration


Dive into the Leo Goodstadt's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Richard Mott

University College London

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jonathan Flint

University of California

View shared research outputs
Top Co-Authors

Avatar

Amarjit Bhomra

Wellcome Trust Centre for Human Genetics

View shared research outputs
Top Co-Authors

Avatar

Anthony G. Doran

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Binnaz Yalcin

Wellcome Trust Centre for Human Genetics

View shared research outputs
Top Co-Authors

Avatar

James Cleak

Wellcome Trust Centre for Human Genetics

View shared research outputs
Top Co-Authors

Avatar

Jérôme Nicod

Wellcome Trust Centre for Human Genetics

View shared research outputs
Researchain Logo
Decentralizing Knowledge