Matt E. Oates
University of Bristol
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Matt E. Oates.
Nucleic Acids Research | 2015
Alex L. Mitchell; Hsin-Yu Chang; Louise Daugherty; Matthew Fraser; Sarah Hunter; Rodrigo Lopez; Craig McAnulla; Conor McMenamin; Gift Nuka; Sebastien Pesseat; Amaia Sangrador-Vegas; Maxim Scheremetjew; Claudia Rato; Siew-Yit Yong; Alex Bateman; Marco Punta; Teresa K. Attwood; Christian J. A. Sigrist; Nicole Redaschi; Catherine Rivoire; Ioannis Xenarios; Daniel Kahn; Dominique Guyot; Peer Bork; Ivica Letunic; Julian Gough; Matt E. Oates; Daniel H. Haft; Hongzhan Huang; Darren A. Natale
The InterPro database (http://www.ebi.ac.uk/interpro/) is a freely available resource that can be used to classify sequences into protein families and to predict the presence of important domains and sites. Central to the InterPro database are predictive models, known as signatures, from a range of different protein family databases that have different biological focuses and use different methodological approaches to classify protein families and domains. InterPro integrates these signatures, capitalizing on the respective strengths of the individual databases, to produce a powerful protein classification resource. Here, we report on the status of InterPro as it enters its 15th year of operation, and give an overview of new developments with the database and its associated Web interfaces and software. In particular, the new domain architecture search tool is described and the process of mapping of Gene Ontology terms to InterPro is outlined. We also discuss the challenges faced by the resource given the explosive growth in sequence data in recent years. InterPro (version 48.0) contains 36 766 member database signatures integrated into 26 238 InterPro entries, an increase of over 3993 entries (5081 signatures), since 2012.
Nucleic Acids Research | 2012
Matt E. Oates; Pedro Romero; Takashi Ishida; Mohamed F. Ghalwash; Marcin J. Mizianty; Bin Xue; Zsuzsanna Dosztányi; Vladimir N. Uversky; Zoran Obradovic; Lukasz Kurgan; A. Keith Dunker; Julian Gough
We present the Database of Disordered Protein Prediction (D2P2), available at http://d2p2.pro (including website source code). A battery of disorder predictors and their variants, VL-XT, VSL2b, PrDOS, PV2, Espritz and IUPred, were run on all protein sequences from 1765 complete proteomes (to be updated as more genomes are completed). Integrated with these results are all of the predicted (mostly structured) SCOP domains using the SUPERFAMILY predictor. These disorder/structure annotations together enable comparison of the disorder predictors with each other and examination of the overlap between disordered predictions and SCOP domains on a large scale. D2P2 will increase our understanding of the interplay between disorder and structure, the genomic distribution of disorder, and its evolutionary history. The parsed data are made available in a unified format for download as flat files or SQL tables either by genome, by predictor, or for the complete set. An interactive website provides a graphical view of each protein annotated with the SCOP domains and disordered regions from all predictors overlaid (or shown as a consensus). There are statistics and tools for browsing and comparing genomes and their disorder within the context of their position on the tree of life.
Proceedings of the National Academy of Sciences of the United States of America | 2015
Sebastian Baumgarten; Oleg Simakov; Lisl Y. Esherick; Yi Jin Liew; Erik M. Lehnert; Craig T. Michell; Yong Li; Elizabeth A. Hambleton; Annika Guse; Matt E. Oates; Julian Gough; Virginia M. Weis; Manuel Aranda; John R. Pringle; Christian R. Voolstra
Significance Coral reefs form marine-biodiversity hotspots of enormous ecological, economic, and aesthetic importance that rely energetically on a functional symbiosis between the coral animal and a photosynthetic alga. The ongoing decline of corals worldwide due to anthropogenic influences, including global warming, ocean acidification, and pollution, heightens the need for an experimentally tractable model system to elucidate the molecular and cellular biology underlying the symbiosis and its susceptibility or resilience to stress. The small sea anemone Aiptasia is such a system, and our analysis of its genome provides a foundation for research in this area and has revealed numerous features of interest in relation to the evolution and function of the symbiotic relationship. The most diverse marine ecosystems, coral reefs, depend upon a functional symbiosis between a cnidarian animal host (the coral) and intracellular photosynthetic dinoflagellate algae. The molecular and cellular mechanisms underlying this endosymbiosis are not well understood, in part because of the difficulties of experimental work with corals. The small sea anemone Aiptasia provides a tractable laboratory model for investigating these mechanisms. Here we report on the assembly and analysis of the Aiptasia genome, which will provide a foundation for future studies and has revealed several features that may be key to understanding the evolution and function of the endosymbiosis. These features include genomic rearrangements and taxonomically restricted genes that may be functionally related to the symbiosis, aspects of host dependence on alga-derived nutrients, a novel and expanded cnidarian-specific family of putative pattern-recognition receptors that might be involved in the animal–algal interactions, and extensive lineage-specific horizontal gene transfer. Extensive integration of genes of prokaryotic origin, including genes for antimicrobial peptides, presumably reflects an intimate association of the animal–algal pair also with its prokaryotic microbiome.
Nature Genetics | 2016
Owen J. L. Rackham; Jaber Firas; Hai Fang; Matt E. Oates; Melissa L. Holmes; Anja S. Knaupp; Harukazu Suzuki; Christian M. Nefzger; Carsten O. Daub; Jay W. Shin; Enrico Petretto; Alistair R. R. Forrest; Yoshihide Hayashizaki; Jose M. Polo; Julian Gough
Transdifferentiation, the process of converting from one cell type to another without going through a pluripotent state, has great promise for regenerative medicine. The identification of key transcription factors for reprogramming is currently limited by the cost of exhaustive experimental testing of plausible sets of factors, an approach that is inefficient and unscalable. Here we present a predictive system (Mogrify) that combines gene expression data with regulatory network information to predict the reprogramming factors necessary to induce cell conversion. We have applied Mogrify to 173 human cell types and 134 tissues, defining an atlas of cellular reprogramming. Mogrify correctly predicts the transcription factors used in known transdifferentiations. Furthermore, we validated two new transdifferentiations predicted by Mogrify. We provide a practical and efficient mechanism for systematically implementing novel cell conversions, facilitating the generalization of reprogramming of human cells. Predictions are made available to help rapidly further the field of cell conversion.
Nucleic Acids Research | 2015
Matt E. Oates; Jonathan Stahlhacke; Dimitrios V. Vavoulis; Ben Smithers; Owen J. L. Rackham; Adam Sardar; Jan Zaucha; Natalie Thurlby; Hai Fang; Julian Gough
We present updates to the SUPERFAMILY 1.75 (http://supfam.org) online resource and protein sequence collection. The hidden Markov model library that provides sequence homology to SCOP structural domains remains unchanged at version 1.75. In the last 4 years SUPERFAMILY has more than doubled its holding of curated complete proteomes over all cellular life, from 1400 proteomes reported previously in 2010 up to 3258 at present. Outside of the main sequence collection, SUPERFAMILY continues to provide domain annotation for sequences provided by other resources such as: UniProt, Ensembl, PDB, much of JGI Phytozome and selected subcollections of NCBI RefSeq. Despite this growth in data volume, SUPERFAMILY now provides users with an expanded and daily updated phylogenetic tree of life (sTOL). This tree is built with genomic-scale domain annotation data as before, but constantly updated when new species are introduced to the sequence library. Our Gene Ontology and other functional and phenotypic annotations previously reported have stood up to critical assessment by the function prediction community. We have now introduced these data in an integrated manner online at the level of an individual sequence, and—in the case of whole genomes—with enrichment analysis against a taxonomically defined background.
Current Opinion in Structural Biology | 2014
A. J. Venkatakrishnan; Tilman Flock; Daniel Estévez Prado; Matt E. Oates; Julian Gough; M. Madan Babu
The seven-transmembrane (7TM) helix fold of G-protein coupled receptors (GPCRs) has been adapted for a wide variety of physiologically important signaling functions. Here, we discuss the diversity in the structured and disordered regions of GPCRs based on the recently published crystal structures and sequence analysis of all human GPCRs. A comparison of the structures of rhodopsin-like receptors (class A), secretin-like receptors (class B), metabotropic receptors (class C) and frizzled receptors (class F) shows that the relative arrangement of the transmembrane helices is conserved across all four GPCR classes although individual receptors can be activated by ligand binding at varying positions within and around the transmembrane helical bundle. A systematic analysis of GPCR sequences reveals the presence of disordered segments in the cytoplasmic side, abundant post-translational modification sites, evidence for alternative splicing and several putative linear peptide motifs that have the potential to mediate interactions with cytosolic proteins. While the structured regions permit the receptor to bind diverse ligands, the disordered regions appear to have an underappreciated role in modulating downstream signaling in response to the cellular state. An integrated paradigm combining the knowledge of structured and disordered regions is imperative for gaining a holistic understanding of the GPCR (un)structure-function relationship.
Nucleic Acids Research | 2015
Tony E. Lewis; Ian Sillitoe; Antonina Andreeva; Tom L. Blundell; Daniel W. A. Buchan; Cyrus Chothia; Domenico Cozzetto; Jose M. Dana; Ioannis Filippis; Julian Gough; David Jones; Lawrence A. Kelley; Gerard J. Kleywegt; Federico Minneci; Jaina Mistry; Alexey G. Murzin; Bernardo Ochoa-Montaño; Matt E. Oates; Marco Punta; Owen J. L. Rackham; Jonathan Stahlhacke; Michael J. E. Sternberg; Sameer Velankar; Christine A. Orengo
Genome3D (http://www.genome3d.eu) is a collaborative resource that provides predicted domain annotations and structural models for key sequences. Since introducing Genome3D in a previous NAR paper, we have substantially extended and improved the resource. We have annotated representatives from Pfam families to improve coverage of diverse sequences and added a fast sequence search to the website to allow users to find Genome3D-annotated sequences similar to their own. We have improved and extended the Genome3D data, enlarging the source data set from three model organisms to 10, and adding VIVACE, a resource new to Genome3D. We have analysed and updated Genome3Ds SCOP/CATH mapping. Finally, we have improved the superposition tools, which now give users a more powerful interface for investigating similarities and differences between structural models.
Plant Physiology | 2017
Bangjun Zhou; Ravi V. Mural; Xuanyang Chen; Matt E. Oates; Richard Connor; Gregory B. Martin; Julian Gough; Lirong Zeng
The tomato genome encodes 40 ubiquitin-conjugating enzymes (E2), among which members of group III are required for the suppression of host immunity by the AvrPtoB effector and for plant pattern-triggered immunity. Of the three classes of enzymes involved in ubiquitination, ubiquitin-conjugating enzymes (E2) have been often incorrectly considered to play merely an auxiliary role in the process, and few E2 enzymes have been investigated in plants. To reveal the role of E2 in plant innate immunity, we identified and cloned 40 tomato genes encoding ubiquitin E2 proteins. Thioester assays indicated that the majority of the genes encode enzymatically active E2. Phylogenetic analysis classified the 40 tomato E2 enzymes into 13 groups, of which members of group III were found to interact and act specifically with AvrPtoB, a Pseudomonas syringae pv tomato effector that uses its ubiquitin ligase (E3) activity to suppress host immunity. Knocking down the expression of group III E2 genes in Nicotiana benthamiana diminished the AvrPtoB-promoted degradation of the Fen kinase and the AvrPtoB suppression of host immunity-associated programmed cell death. Importantly, silencing group III E2 genes also resulted in reduced pattern-triggered immunity (PTI). By contrast, programmed cell death induced by several effector-triggered immunity elicitors was not affected on group III-silenced plants. Functional characterization suggested redundancy among group III members for their role in the suppression of plant immunity by AvrPtoB and in PTI and identified UBIQUITIN-CONJUGATING11 (UBC11), UBC28, UBC29, UBC39, and UBC40 as playing a more significant role in PTI than other group III members. Our work builds a foundation for the further characterization of E2s in plant immunity and reveals that AvrPtoB has evolved a strategy for suppressing host immunity that is difficult for the plant to thwart.
Molecular Cell | 2016
Natasha S. Latysheva; Matt E. Oates; Louis Maddox; Tilman Flock; Julian Gough; Marija Buljan; Robert J. Weatheritt; M. Madan Babu
Summary Gene fusions are common cancer-causing mutations, but the molecular principles by which fusion protein products affect interaction networks and cause disease are not well understood. Here, we perform an integrative analysis of the structural, interactomic, and regulatory properties of thousands of putative fusion proteins. We demonstrate that genes that form fusions (i.e., parent genes) tend to be highly connected hub genes, whose protein products are enriched in structured and disordered interaction-mediating features. Fusion often results in the loss of these parental features and the depletion of regulatory sites such as post-translational modifications. Fusion products disproportionately connect proteins that did not previously interact in the protein interaction network. In this manner, fusion products can escape cellular regulation and constitutively rewire protein interaction networks. We suggest that the deregulation of central, interaction-prone proteins may represent a widespread mechanism by which fusion proteins alter the topology of cellular signaling pathways and promote cancer.
Nucleic Acids Research | 2015
Ben Smithers; Matt E. Oates; Julian Gough
We have discovered that positions of splice junctions in genes are constrained by the tolerance for disorder-promoting amino acids in the translated protein region. It is known that efficient splicing requires nucleotide bias at the splice junction; the preferred usage produces a distribution of amino acids that is disorder-promoting. We observe that efficiency of splicing, as seen in the amino-acid distribution, is not compromised to accommodate globular structure. Thus we infer that it is the positions of splice junctions in the gene that must be under constraint by the local protein environment. Examining exonic splicing enhancers found near the splice junction in the gene, reveals that these (short DNA motifs) are more prevalent in exons that encode disordered protein regions than exons encoding structured regions. Thus we also conclude that local protein features constrain efficient splicing more in structure than in disorder.