Is this you? Create Your Porfile

Ewan Birney

European Bioinformatics Institute

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Ewan Birney is active.

Explore More

Publication

Featured researches published by Ewan Birney.

Nature | 2007

Patterns of somatic mutation in human cancer genomes

Christopher Greenman; Philip Stephens; Raffaella Smith; Gillian L. Dalgliesh; Chris Hunter; Graham R. Bignell; Helen Davies; Jon Teague; Adam Butler; Claire Stevens; Sarah Edkins; Sarah O’Meara; Imre Vastrik; Esther Schmidt; Tim Avis; Syd Barthorpe; Gurpreet Bhamra; Gemma Buck; Bhudipa Choudhury; Jody Clements; Jennifer Cole; Ed Dicks; Simon A. Forbes; Kris Gray; Kelly Halliday; Rachel Harrison; Katy Hills; Jon Hinton; Andy Jenkinson; David Jones

Cancers arise owing to mutations in a subset of genes that confer growth advantage. The availability of the human genome sequence led us to propose that systematic resequencing of cancer genomes for mutations would lead to the discovery of many additional cancer genes. Here we report more than 1,000 somatic mutations found in 274 megabases (Mb) of DNA corresponding to the coding exons of 518 protein kinase genes in 210 diverse human cancers. There was substantial variation in the number and pattern of mutations in individual cancers reflecting different exposures, DNA repair defects and cellular origins. Most somatic mutations are likely to be ‘passengers’ that do not contribute to oncogenesis. However, there was evidence for ‘driver’ mutations contributing to the development of the cancers studied in approximately 120 genes. Systematic sequencing of cancer genomes therefore reveals the evolutionary diversity of cancers and implicates a larger repertoire of cancer genes than previously anticipated.

Science | 2010

A draft sequence of the Neandertal genome.

Richard E. Green; Johannes Krause; Adrian W. Briggs; Tomislav Maricic; Udo Stenzel; Martin Kircher; Nick Patterson; Heng Li; Weiwei Zhai; Markus Hsi-Yang Fritz; Nancy F. Hansen; Eric Durand; Anna-Sapfo Malaspinas; Jeffrey D. Jensen; Tomas Marques-Bonet; Can Alkan; Kay Prüfer; Matthias Meyer; Hernán A. Burbano; Jeffrey M. Good; Rigo Schultz; Ayinuer Aximu-Petri; Anne Butthof; Barbara Höber; Barbara Höffner; Madlen Siegemund; Antje Weihmann; Chad Nusbaum; Eric S. Lander; Carsten Russ

Kissing Cousins Neandertals, our closest relatives, ranged across Europe and Southwest Asia before their extinction approximately 30,000 years ago. Green et al. (p. 710) report a draft sequence of the Neandertal genome, created from three individuals, and compare it with genomes of five modern humans. The results suggest that ancient genomes of human relatives can be recovered with acceptably low contamination from modern human DNA. Because ancient DNA can be contaminated with microbial DNA, Burbano et al. (p. 723) developed a target sequence capture approach to obtain 14 kilobases of Neandertal DNA from a fairly poorly preserved sample with a high microbial load. A number of genomic regions and genes were revealed as candidates for positive selection early in modern human history. The genomic data suggest that Neandertals mixed with modern human ancestors some 120,000 years ago, leaving traces of Neandertal DNA in contemporary humans. Gene flow has occurred from Neandertals to humans of Eurasian descent, but not to Africans. Neandertals, the closest evolutionary relatives of present-day humans, lived in large parts of Europe and western Asia before disappearing 30,000 years ago. We present a draft sequence of the Neandertal genome composed of more than 4 billion nucleotides from three individuals. Comparisons of the Neandertal genome to the genomes of five present-day humans from different parts of the world identify a number of genomic regions that may have been affected by positive selection in ancestral modern humans, including genes involved in metabolism and in cognitive and skeletal development. We show that Neandertals shared more genetic variants with present-day humans in Eurasia than with present-day humans in sub-Saharan Africa, suggesting that gene flow from Neandertals into the ancestors of non-Africans occurred before the divergence of Eurasian groups from each other.

Nucleic Acids Research | 2002

The Ensembl genome database project

Tim Hubbard; Darren Barker; Ewan Birney; Graham Cameron; Yuan Chen; L. Clark; Tony Cox; James Cuff; V. Curwen; Thomas A. Down; Richard Durbin; E. Eyras; James Gilbert; Martin Hammond; L. Huminiecki; Arek Kasprzyk; Heikki Lehväslaiho; Philip Lijnzaad; Craig Melsopp; Emmanuel Mongin; R. Pettett; M. Pocock; Simon Potter; A. Rust; Esther Schmidt; Stephen M. J. Searle; Guy Slater; J. Smith; W. Spooner; A. Stabenau

The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of the human genome sequence, with confirmed gene predictions that have been integrated with external data sources, and is available as either an interactive web site or as flat files. It is also an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements from sequence analysis to data storage and visualisation. The Ensembl site is one of the leading sources of human genome sequence annotation and provided much of the analysis for publication by the international human genome project of the draft genome. The Ensembl system is being installed around the world in both companies and academic sites on machines ranging from supercomputers to laptops.

BMC Bioinformatics | 2005

Automated generation of heuristics for biological sequence comparison

Guy Slater; Ewan Birney

BackgroundExhaustive methods of sequence alignment are accurate but slow, whereas heuristic approaches run quickly, but their complexity makes them more difficult to implement. We introduce bounded sparse dynamic programming (BSDP) to allow rapid approximation to exhaustive alignment. This is used within a framework whereby the alignment algorithms are described in terms of their underlying model, to allow automated development of efficient heuristic implementations which may be applied to a general set of sequence comparison problems.ResultsThe speed and accuracy of this approach compares favourably with existing methods. Examples of its use in the context of genome annotation are given.ConclusionsThis system allows rapid implementation of heuristics approximating to many complex alignment models, and has been incorporated into the freely available sequence alignment program, exonerate.

Bioinformatics | 2012

Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels

Marcel H. Schulz; Daniel R. Zerbino; Martin Vingron; Ewan Birney

Motivation: High-throughput sequencing has made the analysis of new model organisms more affordable. Although assembling a new genome can still be costly and difficult, it is possible to use RNA-seq to sequence mRNA. In the absence of a known genome, it is necessary to assemble these sequences de novo, taking into account possible alternative isoforms and the dynamic range of expression values. Results: We present a software package named Oases designed to heuristically assemble RNA-seq reads in the absence of a reference genome, across a broad spectrum of expression values and in presence of alternative isoforms. It achieves this by using an array of hash lengths, a dynamic filtering of noise, a robust resolution of alternative splicing events and the efficient merging of multiple assemblies. It was tested on human and mouse RNA-seq data and is shown to improve significantly on the transABySS and Trinity de novo transcriptome assemblers. Availability and implementation: Oases is freely available under the GPL license at www.ebi.ac.uk/~zerbino/oases/ Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.

Nucleic Acids Research | 2004

Reactome: a knowledgebase of biological pathways

G. Joshi-Tope; Marc Gillespie; Imre Vastrik; Peter D'Eustachio; Esther Schmidt; B. de Bono; Bijay Jassal; G.R. Gopinath; G.R. Wu; Lisa Matthews; Suzanna E. Lewis; Ewan Birney; Lincoln Stein

Reactome, located at http://www.reactome.org is a curated, peer-reviewed resource of human biological processes. Given the genetic makeup of an organism, the complete set of possible reactions constitutes its reactome. The basic unit of the Reactome database is a reaction; reactions are then grouped into causal chains to form pathways. The Reactome data model allows us to represent many diverse processes in the human system, including the pathways of intermediary metabolism, regulatory pathways, and signal transduction, and high-level processes, such as the cell cycle. Reactome provides a qualitative framework, on which quantitative data can be superimposed. Tools have been developed to facilitate custom data entry and annotation by expert biologists, and to allow visualization and exploration of the finished dataset as an interactive process map. Although our primary curational domain is pathways from Homo sapiens, we regularly create electronic projections of human pathways onto other organisms via putative orthologs, thus making Reactome relevant to model organism research communities. The database is publicly available under open source terms, which allows both its content and its software infrastructure to be freely used and redistributed.

Science | 2007

Genome sequence of Aedes aegypti, a major arbovirus vector

Vishvanath Nene; Jennifer R. Wortman; Daniel John Lawson; Brian J. Haas; Chinnappa D. Kodira; Zhijian Jake Tu; Brendan J. Loftus; Zhiyong Xi; Karyn Megy; Manfred Grabherr; Quinghu Ren; Evgeny M. Zdobnov; Neil F. Lobo; Kathryn S. Campbell; Susan E. Brown; Maria F. Bonaldo; Jingsong Zhu; Steven P. Sinkins; David G. Hogenkamp; Paolo Amedeo; Peter Arensburger; Peter W. Atkinson; Shelby Bidwell; Jim Biedler; Ewan Birney; Robert V. Bruggner; Javier Costas; Monique R. Coy; Jonathan Crabtree; Matt Crawford

We present a draft sequence of the genome of Aedes aegypti, the primary vector for yellow fever and dengue fever, which at ∼1376 million base pairs is about 5 times the size of the genome of the malaria vector Anopheles gambiae. Nearly 50% of the Ae. aegypti genome consists of transposable elements. These contribute to a factor of ∼4 to 6 increase in average gene length and in sizes of intergenic regions relative to An. gambiae and Drosophila melanogaster. Nonetheless, chromosomal synteny is generally maintained among all three insects, although conservation of orthologous gene order is higher (by a factor of ∼2) between the mosquito species than between either of them and the fruit fly. An increase in genes encoding odorant binding, cytochrome P450, and cuticle domains relative to An. gambiae suggests that members of these protein families underpin some of the biological differences between the two mosquito species.

Nucleic Acids Research | 2011

Reactome: a database of reactions, pathways and biological processes

David Croft; Gavin O’Kelly; Guanming Wu; Robin Haw; Marc Gillespie; Lisa Matthews; Michael Caudy; Phani Garapati; Gopal Gopinath; Bijay Jassal; Steven Jupe; Irina Kalatskaya; Shahana Mahajan; Bruce May; Nelson Ndegwa; Esther Schmidt; Veronica Shamovsky; Christina K. Yung; Ewan Birney; Henning Hermjakob; Peter D’Eustachio; Lincoln Stein

Reactome (http://www.reactome.org) is a collaboration among groups at the Ontario Institute for Cancer Research, Cold Spring Harbor Laboratory, New York University School of Medicine and The European Bioinformatics Institute, to develop an open source curated bioinformatics database of human pathways and reactions. Recently, we developed a new web site with improved tools for pathway browsing and data analysis. The Pathway Browser is an Systems Biology Graphical Notation (SBGN)-based visualization system that supports zooming, scrolling and event highlighting. It exploits PSIQUIC web services to overlay our curated pathways with molecular interaction data from the Reactome Functional Interaction Network and external interaction databases such as IntAct, BioGRID, ChEMBL, iRefIndex, MINT and STRING. Our Pathway and Expression Analysis tools enable ID mapping, pathway assignment and overrepresentation analysis of user-supplied data sets. To support pathway annotation and analysis in other species, we continue to make orthology-based inferences of pathways in non-human species, applying Ensembl Compara to identify orthologs of curated human proteins in each of 20 other species. The resulting inferred pathway sets can be browsed and analyzed with our Species Comparison tool. Collaborations are also underway to create manually curated data sets on the Reactome framework for chicken, Drosophila and rice.

Nature | 2010

A small-cell lung cancer genome with complex signatures of tobacco exposure

Erin Pleasance; Philip Stephens; Sarah O’Meara; David J. McBride; Alison Meynert; David Jones; Meng-Lay Lin; David Beare; King Wai Lau; Christopher Greenman; Ignacio Varela; Serena Nik-Zainal; Helen Davies; Gonzalo R. Ordóñez; Laura Mudie; Calli Latimer; Sarah Edkins; Lucy Stebbings; Lina Chen; Mingming Jia; Catherine Leroy; John Marshall; Andrew Menzies; Adam Butler; Jon Teague; Jonathon Mangion; Yongming A. Sun; Stephen F. McLaughlin; Heather E. Peckham; Eric F. Tsung

Cancer is driven by mutation. Worldwide, tobacco smoking is the principal lifestyle exposure that causes cancer, exerting carcinogenicity through >60 chemicals that bind and mutate DNA. Using massively parallel sequencing technology, we sequenced a small-cell lung cancer cell line, NCI-H209, to explore the mutational burden associated with tobacco smoking. A total of 22,910 somatic substitutions were identified, including 134 in coding exons. Multiple mutation signatures testify to the cocktail of carcinogens in tobacco smoke and their proclivities for particular bases and surrounding sequence context. Effects of transcription-coupled repair and a second, more general, expression-linked repair pathway were evident. We identified a tandem duplication that duplicates exons 3–8 of CHD7 in frame, and another two lines carrying PVT1–CHD7 fusion genes, indicating that CHD7 may be recurrently rearranged in this disease. These findings illustrate the potential for next-generation sequencing to provide unprecedented insights into mutational processes, cellular repair pathways and gene networks associated with cancer.SUMMARY Cancer is driven by mutation. Worldwide, tobacco smoking is the major lifestyle exposure that causes cancer, exerting carcinogenicity through >60 chemicals that bind and mutate DNA. Using massively parallel sequencing technology, we sequenced a small cell lung cancer cell line, NCI-H209, to explore the mutational burden associated with tobacco smoking. 22,910 somatic substitutions were identified, including 132 in coding exons. Multiple mutation signatures testify to the cocktail of carcinogens in tobacco smoke and their proclivities for particular bases and surrounding sequence context. Effects of transcription-coupled repair and a second, more general expression-linked repair pathway were evident. We identified a tandem duplication that duplicates exons 3-8 of CHD7 in-frame, and another two lines carrying PVT1-CHD7 fusion genes, suggesting that CHD7 may be recurrently rearranged in this disease. These findings illustrate the potential for next-generation sequencing to provide unprecedented insights into mutational processes, cellular repair pathways and gene networks associated with cancer.

Nature | 2012

The genomic basis of adaptive evolution in threespine sticklebacks

Felicity C. Jones; Manfred Grabherr; Yingguang Frank Chan; Pamela Russell; Evan Mauceli; Jeremy A. Johnson; Ross Swofford; Mono Pirun; Michael C. Zody; Simon D. M. White; Ewan Birney; Stephen M. J. Searle; Jeremy Schmutz; Jane Grimwood; Mark Dickson; Richard M. Myers; Craig T. Miller; Brian R. Summers; Anne K. Knecht; Shannon D. Brady; Haili Zhang; Alex A. Pollen; Timothy R. Howes; Chris T. Amemiya; Eric S. Lander; Federica Di Palma; Kerstin Lindblad-Toh; David M. Kingsley

Marine stickleback fish have colonized and adapted to thousands of streams and lakes formed since the last ice age, providing an exceptional opportunity to characterize genomic mechanisms underlying repeated ecological adaptation in nature. Here we develop a high-quality reference genome assembly for threespine sticklebacks. By sequencing the genomes of twenty additional individuals from a global set of marine and freshwater populations, we identify a genome-wide set of loci that are consistently associated with marine–freshwater divergence. Our results indicate that reuse of globally shared standing genetic variation, including chromosomal inversions, has an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation. Both coding and regulatory changes occur in the set of loci underlying marine–freshwater evolution, but regulatory changes appear to predominate in this well known example of repeated adaptive evolution in nature.

Explore More