Robert M. Waterhouse
Swiss Institute of Bioinformatics
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Robert M. Waterhouse.
Bioinformatics | 2015
Felipe A. Simão; Robert M. Waterhouse; Panagiotis Ioannidis; Evgenia V. Kriventseva; Evgeny M. Zdobnov
MOTIVATION Genomics has revolutionized biological research, but quality assessment of the resulting assembled sequences is complicated and remains mostly limited to technical measures like N50. RESULTS We propose a measure for quantitative assessment of genome assembly and annotation completeness based on evolutionarily informed expectations of gene content. We implemented the assessment procedure in open-source software, with sets of Benchmarking Universal Single-Copy Orthologs, named BUSCO. AVAILABILITY AND IMPLEMENTATION Software implemented in Python and datasets available for download from http://busco.ezlab.org. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Science | 2007
Robert M. Waterhouse; Evgenia V. Kriventseva; Stephan Meister; Zhiyong Xi; Kanwal S. Alvarez; Lyric C. Bartholomay; Carolina Barillas-Mury; Guowu Bian; Stéphanie Blandin; Bruce M. Christensen; Yuemei Dong; Haobo Jiang; Michael R. Kanost; Anastasios C. Koutsos; Elena A. Levashina; Jianyong Li; Petros Ligoxygakis; Robert M. MacCallum; George F. Mayhew; Antonio M. Mendes; Kristin Michel; Mike A. Osta; Susan M. Paskewitz; Sang Woon Shin; Dina Vlachou; Lihui Wang; Weiqi Wei; Liangbiao Zheng; Zhen Zou; David W. Severson
Mosquitoes are vectors of parasitic and viral diseases of immense importance for public health. The acquisition of the genome sequence of the yellow fever and Dengue vector, Aedes aegypti (Aa), has enabled a comparative phylogenomic analysis of the insect immune repertoire: in Aa, the malaria vector Anopheles gambiae (Ag), and the fruit fly Drosophila melanogaster (Dm). Analysis of immune signaling pathways and response modules reveals both conservative and rapidly evolving features associated with different functional gene categories and particular aspects of immune reactions. These dynamics reflect in part continuous readjustment between accommodation and rejection of pathogens and suggest how innate immunity may have evolved.
Proceedings of the National Academy of Sciences of the United States of America | 2010
Ewen F. Kirkness; Brian J. Haas; Weilin Sun; Henk R. Braig; M. Alejandra Perotti; John M. Clark; Si Hyeock Lee; Hugh M. Robertson; Ryan C. Kennedy; Eran Elhaik; Daniel Gerlach; Evgenia V. Kriventseva; Christine G. Elsik; Dan Graur; Catherine A. Hill; Jan A. Veenstra; Brian Walenz; Jose M. C. Tubio; José M. C. Ribeiro; Julio Rozas; J. Spencer Johnston; Justin T. Reese; Aleksandar Popadić; Marta Tojo; Didier Raoult; David L. Reed; Yoshinori Tomoyasu; Emily Kraus; Omprakash Mittapalli; Venu M. Margam
As an obligatory parasite of humans, the body louse (Pediculus humanus humanus) is an important vector for human diseases, including epidemic typhus, relapsing fever, and trench fever. Here, we present genome sequences of the body louse and its primary bacterial endosymbiont Candidatus Riesia pediculicola. The body louse has the smallest known insect genome, spanning 108 Mb. Despite its status as an obligate parasite, it retains a remarkably complete basal insect repertoire of 10,773 protein-coding genes and 57 microRNAs. Representing hemimetabolous insects, the genome of the body louse thus provides a reference for studies of holometabolous insects. Compared with other insect genomes, the body louse genome contains significantly fewer genes associated with environmental sensing and response, including odorant and gustatory receptors and detoxifying enzymes. The unique architecture of the 18 minicircular mitochondrial chromosomes of the body louse may be linked to the loss of the gene encoding the mitochondrial single-stranded DNA binding protein. The genome of the obligatory louse endosymbiont Candidatus Riesia pediculicola encodes less than 600 genes on a short, linear chromosome and a circular plasmid. The plasmid harbors a unique arrangement of genes required for the synthesis of pantothenate, an essential vitamin deficient in the louse diet. The human body louse, its primary endosymbiont, and the bacterial pathogens that it vectors all possess genomes reduced in size compared with their free-living close relatives. Thus, the body louse genome project offers unique information and tools to use in advancing understanding of coevolution among vectors, symbionts, and pathogens.
Science | 2010
Peter Arensburger; Karine Megy; Robert M. Waterhouse; Jenica Abrudan; Paolo Amedeo; Beatriz García Antelo; Lyric C. Bartholomay; Shelby Bidwell; Elisabet Caler; Francisco Camara; Corey L. Campbell; Kathryn S. Campbell; Claudio Casola; Marta T. Castro; Ishwar Chandramouliswaran; Sinéad B. Chapman; Scott Christley; Javier Costas; Eric Eisenstadt; Cédric Feschotte; Claire M. Fraser-Liggett; Roderic Guigó; Brian J. Haas; Martin Hammond; Bill S. Hansson; Janet Hemingway; Sharon R. Hill; Clint Howarth; Rickard Ignell; Ryan C. Kennedy
Closing the Vector Circle The genome sequence of Culex quinquefasciatus offers a representative of the third major genus of mosquito disease vectors for comparative analysis. In a major international effort, Arensburger et al. (p. 86) uncovered divergences in the C. quinquefasciatus genome compared with the representatives of the other two genera Aedes aegypti and Anopheles gambiae. The main difference noted is the expansion of numbers of genes, particularly for immunity, oxidoreductive functions, and digestive enzymes, which may reflect specific aspects of the Culex life cycle. Bartholomay et al. (p. 88) explored infection-response genes in Culex in more depth and uncovered 500 immune response-related genes, similar to the numbers seen in Aedes, but fewer than seen in Anopheles or the fruit fly Drosophila melanogaster. The higher numbers of genes were attributed partly to expansions in those encoding serpins, C-type lectins, and fibrinogen-related proteins, consistent with greater immune surveillance and associated signaling needed to monitor the dangers of breeding in polluted, urbanized environments. Transcriptome analysis confirmed that inoculation with unfamiliar bacteria prompted strong immune responses in Culex. The worm and virus pathogens that the mosquitoes transmit naturally provoked little immune activation, however, suggesting that tolerance has evolved to any damage caused by replication of the pathogens in the insects. The genome of a third mosquito species reveals distinctions related to vector capacities and habitat preferences. Culex quinquefasciatus (the southern house mosquito) is an important mosquito vector of viruses such as West Nile virus and St. Louis encephalitis virus, as well as of nematodes that cause lymphatic filariasis. C. quinquefasciatus is one species within the Culex pipiens species complex and can be found throughout tropical and temperate climates of the world. The ability of C. quinquefasciatus to take blood meals from birds, livestock, and humans contributes to its ability to vector pathogens between species. Here, we describe the genomic sequence of C. quinquefasciatus: Its repertoire of 18,883 protein-coding genes is 22% larger than that of Aedes aegypti and 52% larger than that of Anopheles gambiae with multiple gene-family expansions, including olfactory and gustatory receptors, salivary gland genes, and genes associated with xenobiotic detoxification.
Nucleic Acids Research | 2013
Robert M. Waterhouse; Fredrik Tegenfeldt; Jia Li; Evgeny M. Zdobnov; Evgenia V. Kriventseva
The concept of orthology provides a foundation for formulating hypotheses on gene and genome evolution, and thus forms the cornerstone of comparative genomics, phylogenomics and metagenomics. We present the update of OrthoDB—the hierarchical catalog of orthologs (http://www.orthodb.org). From its conception, OrthoDB promoted delineation of orthologs at varying resolution by explicitly referring to the hierarchy of species radiations, now also adopted by other resources. The current release provides comprehensive coverage of animals and fungi representing 252 eukaryotic species, and is now extended to prokaryotes with the inclusion of 1115 bacteria. Functional annotations of orthologous groups are provided through mapping to InterPro, GO, OMIM and model organism phenotypes, with cross-references to major resources including UniProt, NCBI and FlyBase. Uniquely, OrthoDB provides computed evolutionary traits of orthologs, such as gene duplicability and loss profiles, divergence rates, sibling groups, and now extended with exon–intron architectures, syntenic orthologs and parent–child trees. The interactive web interface allows navigation along the species phylogenies, complex queries with various identifiers, annotation keywords and phrases, as well as with gene copy-number profiles and sequence homology searches. With the explosive growth of available data, OrthoDB also provides mapping of newly sequenced genomes and transcriptomes to the current orthologous groups.
Science | 2015
Michael Fontaine; James B. Pease; Aaron Steele; Robert M. Waterhouse; Daniel E. Neafsey; Igor V. Sharakhov; Xiaofang Jiang; Andrew Brantley Hall; Flaminia Catteruccia; Evdoxia G. Kakani; Sara N. Mitchell; Yi-Chieh Wu; Hilary A. Smith; R. Rebecca Love; Mara K. N. Lawniczak; Michel A. Slotman; Scott J. Emrich; Matthew W. Hahn; Nora J. Besansky
Introduction The notion that species boundaries can be porous to introgression is increasingly accepted. Yet the broader role of introgression in evolution remains contentious and poorly documented, partly because of the challenges involved in accurately identifying introgression in the very groups where it is most likely to occur. Recently diverged species often have incomplete reproductive barriers and may hybridize where they overlap. However, because of retention and stochastic sorting of ancestral polymorphisms, inference of the correct species branching order is notoriously challenging for recent speciation events, especially those closely spaced in time. Without knowledge of species relationships, it is impossible to identify instances of introgression. Rationale Since the discovery that the single mosquito taxon described in 1902 as Anopheles gambiae was actually a complex of several closely related and morphologically indistinguishable sibling species, the correct species branching order has remained controversial and unresolved. This Afrotropical complex contains the world’s most important vectors of human malaria, owing to their close association with humans, as well as minor vectors and species that do not bite humans. On the basis of ecology and behavior, one might predict phylogenetic clustering of the three highly anthropophilic vector species. However, previous phylogenetic analyses of the complex based on a limited number of markers strongly disagree about relationships between the major vectors, potentially because of historical introgression between them. To investigate the history of the species complex, we used whole-genome reference assemblies, as well as dozens of resequenced individuals from the field. Results We observed a large amount of phylogenetic discordance between trees generated from the autosomes and X chromosome. The autosomes, which make up the majority of the genome, overwhelmingly supported the grouping of the three major vectors of malaria, An. gambiae, An. coluzzii, and An. arabiensis. In stark contrast, the X chromosome strongly supported the grouping of An. arabiensis with a species that plays no role in malaria transmission, An. quadriannulatus. Although the whole-genome consensus phylogeny unequivocally agrees with the autosomal topology, we found that the topology most often located on the X chromosome follows the historical species branching order, with pervasive introgression on the autosomes producing relationships that group the three highly anthropophilic species together. With knowledge of the correct species branching order, we are further able to uncover introgression between another species pair, as well as a complex history of balancing selection, introgression, and local adaptation of a large autosomal inversion that confers aridity tolerance. Conclusion We identify the correct species branching order of the An. gambiae species complex, resolving a contentious phylogeny. Notably, lineages leading to the principal vectors of human malaria were among the first in the complex to radiate and are not most closely related to each other. Pervasive autosomal introgression between these human malaria vectors, including nonsister vector species, suggests that traits enhancing vectorial capacity can be acquired not only through de novo mutation but also through a more rapid process of interspecific genetic exchange. Time-lapse photographs of an adult anopheline mosquito emerging from its pupal case. RELATED ITEMS IN ScienceD. E. Neafsey et al., Science 347, 1258522 (2015) Introgressive hybridization is now recognized as a widespread phenomenon, but its role in evolution remains contested. Here, we use newly available reference genome assemblies to investigate phylogenetic relationships and introgression in a medically important group of Afrotropical mosquito sibling species. We have identified the correct species branching order to resolve a contentious phylogeny and show that lineages leading to the principal vectors of human malaria were among the first to split. Pervasive autosomal introgression between these malaria vectors means that only a small fraction of the genome, mainly on the X chromosome, has not crossed species boundaries. Our results suggest that traits enhancing vectorial capacity may be gained through interspecific gene flow, including between nonsister species. Mosquito adaptability across genomes Virtually everyone has first-hand experience with mosquitoes. Few recognize the subtle biological distinctions among these bloodsucking flies that render some bites mere nuisances and others the initiation of a potentially life-threatening infection. By sequencing the genomes of several mosquitoes in depth, Neafsey et al. and Fontaine et al. reveal clues that explain the mystery of why only some species of one genus of mosquitoes are capable of transmitting human malaria (see the Perspective by Clark and Messer). Science, this issue 10.1126/science.1258524 and 10.1126/science.1258522; see also p. 27 Comparison of several genomes reveals the genetic history of mosquitoes’ ability to vector malaria among humans. [Also see Perspective by Clark and Messer]
BMC Genomics | 2014
Christine G. Elsik; Kim C. Worley; Anna K. Bennett; Martin Beye; Francisco Camara; Christopher P. Childers; Dirk C. de Graaf; Griet Debyser; Jixin Deng; Bart Devreese; Eran Elhaik; Jay D. Evans; Leonard J. Foster; Dan Graur; Roderic Guigó; Katharina Hoff; Michael Holder; Matthew E. Hudson; Greg J. Hunt; Huaiyang Jiang; Vandita Joshi; Radhika S. Khetani; Peter Kosarev; Christie Kovar; Jian Ma; Ryszard Maleszka; Robin F. A. Moritz; Monica Munoz-Torres; Terence Murphy; Donna M. Muzny
BackgroundThe first generation of genome sequence assemblies and annotations have had a significant impact upon our understanding of the biology of the sequenced species, the phylogenetic relationships among species, the study of populations within and across species, and have informed the biology of humans. As only a few Metazoan genomes are approaching finished quality (human, mouse, fly and worm), there is room for improvement of most genome assemblies. The honey bee (Apis mellifera) genome, published in 2006, was noted for its bimodal GC content distribution that affected the quality of the assembly in some regions and for fewer genes in the initial gene set (OGSv1.0) compared to what would be expected based on other sequenced insect genomes.ResultsHere, we report an improved honey bee genome assembly (Amel_4.5) with a new gene annotation set (OGSv3.2), and show that the honey bee genome contains a number of genes similar to that of other insect genomes, contrary to what was suggested in OGSv1.0. The new genome assembly is more contiguous and complete and the new gene set includes ~5000 more protein-coding genes, 50% more than previously reported. About 1/6 of the additional genes were due to improvements to the assembly, and the remaining were inferred based on new RNAseq and protein data.ConclusionsLessons learned from this genome upgrade have important implications for future genome sequencing projects. Furthermore, the improvements significantly enhance genomic resources for the honey bee, a key model for social behavior and essential to global ecology through pollination.
Science | 2009
Michael Povelones; Robert M. Waterhouse; Fotis C. Kafatos; George K. Christophides
Leucine-rich repeat–containing proteins are central to host defense in plants and animals. We show that in the mosquito Anopheles gambiae, two such proteins that antagonize malaria parasite infections, LRIM1 and APL1C, circulate in the hemolymph as a high-molecular-weight complex held together by disulfide bridges. The complex interacts with the complement C3-like protein, TEP1, promoting its cleavage or stabilization and its subsequent localization on the surface of midgut-invading Plasmodium berghei parasites, targeting them for destruction. LRIM1 and APL1C are members of a protein family with orthologs in other disease vector mosquitoes and appear to be important effectors in innate mosquito defenses against human pathogens.
Science | 2015
Karen M. Kapheim; Hailin Pan; Cai Li; Daniela Puiu; Tanja Magoc; Hugh M. Robertson; Matthew E. Hudson; Aarti Venkat; Brielle J. Fischman; Alvaro G. Hernandez; Mark Yandell; Daniel Ence; Carson Holt; George D. Yocum; William P. Kemp; Jordi Bosch; Robert M. Waterhouse; Evgeny M. Zdobnov; Eckart Stolle; F. Bernhard Kraus; Sophie Helbing; Robin F. A. Moritz; Karl M. Glastad; Brendan G. Hunt; Michael A. D. Goodisman; Frank Hauser; Cornelis J. P. Grimmelikhuijzen; Daniel G. Pinheiro; Francis Morais Franco Nunes; Michelle Soares
For bees, many roads lead to social harmony Eusociality, where workers sacrifice their reproductive rights to support the colony, has evolved repeatedly and represents the most evolved form of social evolution in insects. Kapheim et al. looked across the genomes of 10 bee species with varying degrees of sociality to determine the underlying genomic contributions. No one genomic path led to eusociality, but similarities across genomes were seen in features such as increases in gene regulation and methylation. It also seems that selection pressures relaxed after the emergence of complex sociality. Science, this issue p. 1139 Social evolution in bees has followed diverse genomic paths but shares genomic patterns. The evolution of eusociality is one of the major transitions in evolution, but the underlying genomic changes are unknown. We compared the genomes of 10 bee species that vary in social complexity, representing multiple independent transitions in social evolution, and report three major findings. First, many important genes show evidence of neutral evolution as a consequence of relaxed selection with increasing social complexity. Second, there is no single road map to eusociality; independent evolutionary transitions in sociality have independent genetic underpinnings. Third, though clearly independent in detail, these transitions do have similar general features, including an increase in constrained protein evolution accompanied by increases in the potential for gene regulation and decreases in diversity and abundance of transposable elements. Eusociality may arise through different mechanisms each time, but would likely always involve an increase in the complexity of gene networks.
Nucleic Acids Research | 2015
Evgenia V. Kriventseva; Fredrik Tegenfeldt; Tom J. Petty; Robert M. Waterhouse; Felipe A. Simão; Igor A. Pozdnyakov; Panagiotis Ioannidis; Evgeny M. Zdobnov
Orthology, refining the concept of homology, is the cornerstone of evolutionary comparative studies. With the ever-increasing availability of genomic data, inference of orthology has become instrumental for generating hypotheses about gene functions crucial to many studies. This update of the OrthoDB hierarchical catalog of orthologs (http://www.orthodb.org) covers 3027 complete genomes, including the most comprehensive set of 87 arthropods, 61 vertebrates, 227 fungi and 2627 bacteria (sampling the most complete and representative genomes from over 11,000 available). In addition to the most extensive integration of functional annotations from UniProt, InterPro, GO, OMIM, model organism phenotypes and COG functional categories, OrthoDB uniquely provides evolutionary annotations including rates of ortholog sequence divergence, copy-number profiles, sibling groups and gene architectures. We re-designed the entirety of the OrthoDB website from the underlying technology to the user interface, enabling the user to specify species of interest and to select the relevant orthology level by the NCBI taxonomy. The text searches allow use of complex logic with various identifiers of genes, proteins, domains, ontologies or annotation keywords and phrases. Gene copy-number profiles can also be queried. This release comes with the freely available underlying ortholog clustering pipeline (http://www.orthodb.org/software).