Stephen M. J. Searle | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Stephen M. J. Searle is active.

Explore More

Publication

Featured researches published by Stephen M. J. Searle.

Genome Research | 2012

GENCODE: The reference human genome annotation for The ENCODE Project

Jennifer Harrow; Adam Frankish; José Manuel Rodríguez González; Electra Tapanari; Mark Diekhans; Felix Kokocinski; Bronwen Aken; Daniel Barrell; Amonida Zadissa; Stephen M. J. Searle; I. Barnes; Alexandra Bignell; Veronika Boychenko; Toby Hunt; Mike Kay; Gaurab Mukherjee; Jeena Rajan; Gloria Despacio-Reyes; Gary Saunders; Charles A. Steward; Rachel A. Harte; Mike Lin; Cédric Howald; Andrea Tanzer; Thomas Derrien; Jacqueline Chrast; Nathalie Walters; Suganthi Balasubramanian; Baikang Pei; Michael L. Tress

The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.

Bioinformatics | 2004

The Jalview Java alignment editor

Michele Clamp; James Cuff; Stephen M. J. Searle; Geoffrey J. Barton

Multiple sequence alignment remains a crucial method for understanding the function of groups of related nucleic acid and protein sequences. However, it is known that automatic multiple sequence alignments can often be improved by manual editing. Therefore, tools are needed to view and edit multiple sequence alignments. Due to growth in the sequence databases, multiple sequence alignments can often be large and difficult to view efficiently. The Jalview Java alignment editor is presented here, which enables fast viewing and editing of large multiple sequence alignments.

Nucleic Acids Research | 2002

The Ensembl genome database project

Tim Hubbard; Darren Barker; Ewan Birney; Graham Cameron; Yuan Chen; L. Clark; Tony Cox; James Cuff; V. Curwen; Thomas A. Down; Richard Durbin; E. Eyras; James Gilbert; Martin Hammond; L. Huminiecki; Arek Kasprzyk; Heikki Lehväslaiho; Philip Lijnzaad; Craig Melsopp; Emmanuel Mongin; R. Pettett; M. Pocock; Simon Potter; A. Rust; Esther Schmidt; Stephen M. J. Searle; Guy Slater; J. Smith; W. Spooner; A. Stabenau

The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of the human genome sequence, with confirmed gene predictions that have been integrated with external data sources, and is available as either an interactive web site or as flat files. It is also an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements from sequence analysis to data storage and visualisation. The Ensembl site is one of the leading sources of human genome sequence annotation and provided much of the analysis for publication by the international human genome project of the draft genome. The Ensembl system is being installed around the world in both companies and academic sites on machines ranging from supercomputers to laptops.

Nature | 2012

The genomic basis of adaptive evolution in threespine sticklebacks

Felicity C. Jones; Manfred Grabherr; Yingguang Frank Chan; Pamela Russell; Evan Mauceli; Jeremy A. Johnson; Ross Swofford; Mono Pirun; Michael C. Zody; Simon D. M. White; Ewan Birney; Stephen M. J. Searle; Jeremy Schmutz; Jane Grimwood; Mark Dickson; Richard M. Myers; Craig T. Miller; Brian R. Summers; Anne K. Knecht; Shannon D. Brady; Haili Zhang; Alex A. Pollen; Timothy R. Howes; Chris T. Amemiya; Eric S. Lander; Federica Di Palma; Kerstin Lindblad-Toh; David M. Kingsley

Marine stickleback fish have colonized and adapted to thousands of streams and lakes formed since the last ice age, providing an exceptional opportunity to characterize genomic mechanisms underlying repeated ecological adaptation in nature. Here we develop a high-quality reference genome assembly for threespine sticklebacks. By sequencing the genomes of twenty additional individuals from a global set of marine and freshwater populations, we identify a genome-wide set of loci that are consistently associated with marine–freshwater divergence. Our results indicate that reuse of globally shared standing genetic variation, including chromosomal inversions, has an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation. Both coding and regulatory changes occur in the set of loci underlying marine–freshwater evolution, but regulatory changes appear to predominate in this well known example of repeated adaptive evolution in nature.

Nature | 2012

Insights into hominid evolution from the gorilla genome sequence.

Aylwyn Scally; Julien Y. Dutheil; LaDeana W. Hillier; Gregory Jordan; Ian Goodhead; Javier Herrero; Asger Hobolth; Tuuli Lappalainen; Thomas Mailund; Tomas Marques-Bonet; Shane McCarthy; Stephen H. Montgomery; Petra C. Schwalie; Y. Amy Tang; Michelle C. Ward; Yali Xue; Bryndis Yngvadottir; Can Alkan; Lars Nørvang Andersen; Qasim Ayub; Edward V. Ball; Kathryn Beal; Brenda J. Bradley; Yuan Chen; Chris Clee; Stephen Fitzgerald; Tina Graves; Yong Gu; Paul Heath; Andreas Heger

Gorillas are humans’ closest living relatives after chimpanzees, and are of comparable importance for the study of human origins and evolution. Here we present the assembly and analysis of a genome sequence for the western lowland gorilla, and compare the whole genomes of all extant great ape genera. We propose a synthesis of genetic and fossil evidence consistent with placing the human–chimpanzee and human–chimpanzee–gorilla speciation events at approximately 6 and 10 million years ago. In 30% of the genome, gorilla is closer to human or chimpanzee than the latter are to each other; this is rarer around coding genes, indicating pervasive selection throughout great ape evolution, and has functional consequences in gene expression. A comparison of protein coding genes reveals approximately 500 genes showing accelerated evolution on each of the gorilla, human and chimpanzee lineages, and evidence for parallel acceleration, particularly of genes involved in hearing. We also compare the western and eastern gorilla species, estimating an average sequence divergence time 1.75 million years ago, but with evidence for more recent genetic exchange and a population bottleneck in the eastern species. The use of the genome sequence in these and future analyses will promote a deeper understanding of great ape biology and evolution.

Nature | 2013

The African coelacanth genome provides insights into tetrapod evolution.

Chris T. Amemiya; Jessica Alföldi; Alison P. Lee; Shaohua Fan; Hervé Philippe; Iain MacCallum; Ingo Braasch; Tereza Manousaki; Igor Schneider; Nicolas Rohner; Chris Organ; Domitille Chalopin; Jeramiah J. Smith; Mark Robinson; Rosemary A. Dorrington; Marco Gerdol; Bronwen Aken; Maria Assunta Biscotti; Marco Barucca; Denis Baurain; Aaron M. Berlin; Francesco Buonocore; Thorsten Burmester; Michael S. Campbell; Adriana Canapa; John P. Cannon; Alan Christoffels; Gianluca De Moro; Adrienne L. Edkins; Lin Fan

The discovery of a living coelacanth specimen in 1938 was remarkable, as this lineage of lobe-finned fish was thought to have become extinct 70 million years ago. The modern coelacanth looks remarkably similar to many of its ancient relatives, and its evolutionary proximity to our own fish ancestors provides a glimpse of the fish that first walked on land. Here we report the genome sequence of the African coelacanth, Latimeria chalumnae. Through a phylogenomic analysis, we conclude that the lungfish, and not the coelacanth, is the closest living relative of tetrapods. Coelacanth protein-coding genes are significantly more slowly evolving than those of tetrapods, unlike other genomic features. Analyses of changes in genes and regulatory elements during the vertebrate adaptation to land highlight genes involved in immunity, nitrogen excretion and the development of fins, tail, ear, eye, brain and olfaction. Functional assays of enhancers involved in the fin-to-limb transition and in the emergence of extra-embryonic tissues show the importance of the coelacanth genome as a blueprint for understanding tetrapod evolution.The discovery of a living coelacanth specimen in 1938 was remarkable, as this lineage of lobe-finned fish was thought to have become extinct 70 million years ago. The modern coelacanth looks remarkably similar to many of its ancient relatives, and its evolutionary proximity to our own fish ancestors provides a glimpse of the fish that first walked on land. Here we report the genome sequence of the African coelacanth, Latimeria chalumnae. Through a phylogenomic analysis, we conclude that the lungfish, and not the coelacanth, is the closest living relative of tetrapods. Coelacanth protein-coding genes are significantly more slowly evolving than those of tetrapods, unlike other genomic features. Analyses of changes in genes and regulatory elements during the vertebrate adaptation to land highlight genes involved in immunity, nitrogen excretion and the development of fins, tail, ear, eye, brain and olfaction. Functional assays of enhancers involved in the fin-to-limb transition and in the emergence of extra-embryonic tissues show the importance of the coelacanth genome as a blueprint for understanding tetrapod evolution.

Nature | 2011

Comparative and demographic analysis of orang-utan genomes

Devin P. Locke; LaDeana W. Hillier; Wesley C. Warren; Kim C. Worley; Lynne V. Nazareth; Donna M. Muzny; Shiaw-Pyng Yang; Zhengyuan Wang; Asif T. Chinwalla; Patrick Minx; Makedonka Mitreva; Lisa Cook; Kim D. Delehaunty; Catrina C. Fronick; Heather K. Schmidt; Lucinda A. Fulton; Robert S. Fulton; Joanne O. Nelson; Vincent Magrini; Craig S. Pohl; Tina Graves; Chris Markovic; Andy Cree; Huyen Dinh; Jennifer Hume; Christie Kovar; Gerald Fowler; Gerton Lunter; Stephen Meader; Andreas Heger

‘Orang-utan’ is derived from a Malay term meaning ‘man of the forest’ and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (Ne) expanded exponentially relative to the ancestral Ne after the split, while Bornean Ne declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.

Nature Genetics | 2013

Sequencing of the sea lamprey (Petromyzon marinus) genome provides insights into vertebrate evolution.

Jeramiah J. Smith; Shigehiro Kuraku; Carson Holt; Tatjana Sauka-Spengler; Ning Jiang; Michael S. Campbell; Mark Yandell; Tereza Manousaki; Axel Meyer; Ona Bloom; Jennifer R. Morgan; Joseph D. Buxbaum; Ravi Sachidanandam; Carrie Sims; Alexander S. Garruss; Malcolm Cook; Robb Krumlauf; Leanne M. Wiedemann; Stacia A. Sower; Wayne A. Decatur; Jeffrey A. Hall; Chris T. Amemiya; Nil Ratan Saha; Katherine M. Buckley; Jonathan P. Rast; Sabyasachi Das; Masayuki Hirano; Nathanael McCurley; Peng Guo; Nicolas Rohner

Lampreys are representatives of an ancient vertebrate lineage that diverged from our own ∼500 million years ago. By virtue of this deeply shared ancestry, the sea lamprey (P. marinus) genome is uniquely poised to provide insight into the ancestry of vertebrate genomes and the underlying principles of vertebrate biology. Here, we present the first lamprey whole-genome sequence and assembly. We note challenges faced owing to its high content of repetitive elements and GC bases, as well as the absence of broad-scale sequence information from closely related species. Analyses of the assembly indicate that two whole-genome duplications likely occurred before the divergence of ancestral lamprey and gnathostome lineages. Moreover, the results help define key evolutionary events within vertebrate lineages, including the origin of myelin-associated proteins and the development of appendages. The lamprey genome provides an important resource for reconstructing vertebrate origins and the evolutionary events that have shaped the genomes of extant organisms.

PLOS Biology | 2010

Multi-platform next-generation sequencing of the domestic Turkey (Meleagris gallopavo): Genome assembly and analysis

Rami A. Dalloul; Julie A Long; Aleksey V. Zimin; Luqman Aslam; Kathryn Beal; Le Ann Blomberg; Pascal Bouffard; David W. Burt; Oswald Crasta; R.P.M.A. Crooijmans; Kristal L. Cooper; Roger A. Coulombe; Supriyo De; Mary E. Delany; Jerry B. Dodgson; Jennifer J Dong; Clive Evans; Karin M. Frederickson; Paul Flicek; Liliana Florea; Otto Folkerts; M.A.M. Groenen; Tim Harkins; Javier Herrero; Steve Hoffmann; Hendrik-Jan Megens; Andrew Jiang; Pieter J. de Jong; Peter K. Kaiser; Heebal Kim

The combined application of next-generation sequencing platforms has provided an economical approach to unlocking the potential of the turkey genome.

Nucleic Acids Research | 2010

Ensembl’s 10th year

Paul Flicek; Bronwen Aken; Benoit Ballester; Kathryn Beal; Eugene Bragin; Simon Brent; Yuan Chen; Peter Clapham; Guy Coates; Susan Fairley; Stephen Fitzgerald; Julio Fernandez-Banet; Leo Gordon; Stefan Gräf; Syed Haider; Martin Hammond; Kerstin Howe; Andrew M. Jenkinson; Nathan Johnson; Andreas Kähäri; Damian Keefe; Stephen Keenan; Rhoda Kinsella; Felix Kokocinski; Gautier Koscielny; Eugene Kulesha; Daniel Lawson; Ian Longden; Tim Massingham; William M. McLaren

Ensembl (http://www.ensembl.org) integrates genomic information for a comprehensive set of chordate genomes with a particular focus on resources for human, mouse, rat, zebrafish and other high-value sequenced genomes. We provide complete gene annotations for all supported species in addition to specific resources that target genome variation, function and evolution. Ensembl data is accessible in a variety of formats including via our genome browser, API and BioMart. This year marks the tenth anniversary of Ensembl and in that time the project has grown with advances in genome technology. As of release 56 (September 2009), Ensembl supports 51 species including marmoset, pig, zebra finch, lizard, gorilla and wallaby, which were added in the past year. Major additions and improvements to Ensembl since our previous report include the incorporation of the human GRCh37 assembly, enhanced visualisation and data-mining options for the Ensembl regulatory features and continued development of our software infrastructure.

Explore More