Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Gabrielle A. Reeves is active.

Publication


Featured researches published by Gabrielle A. Reeves.


Nucleic Acids Research | 2004

The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis

Frances M. G. Pearl; Annabel E. Todd; Ian Sillitoe; Mark Dibley; Oliver Redfern; Tony E. Lewis; Christopher G. Bennett; Russell L. Marsden; Alastair Grant; David A. Lee; Adrian Akpor; Michael Maibaum; Andrew P. Harrison; Timothy Dallman; Gabrielle A. Reeves; Ilhem Diboun; Sarah Addou; Stefano Lise; Caroline E. Johnston; Antonio Sillero; Janet M. Thornton; Christine A. Orengo

The CATH database of protein domain structures (http://www.biochem.ucl.ac.uk/bsm/cath/) currently contains 43 229 domains classified into 1467 superfamilies and 5107 sequence families. Each structural family is expanded with sequence relatives from GenBank and completed genomes, using a variety of efficient sequence search protocols and reliable thresholds. This extended CATH protein family database contains 616 470 domain sequences classified into 23 876 sequence families. This results in the significant expansion of the CATH HMM model library to include models built from the CATH sequence relatives, giving a 10% increase in coverage for detecting remote homologues. An improved Dictionary of Homologous superfamilies (DHS) (http://www.biochem.ucl.ac.uk/bsm/dhs/) containing specific sequence, structural and functional information for each superfamily in CATH considerably assists manual validation of homologues. Information on sequence relatives in CATH superfamilies, GenBank and completed genomes is presented in the CATH associated DHS and Gene3D resources. Domain partnership information can be obtained from Gene3D (http://www.biochem.ucl.ac.uk/bsm/cath/Gene3D/). A new CATH server has been implemented (http://www.biochem.ucl.ac.uk/cgi-bin/cath/CathServer.pl) providing automatic classification of newly determined sequences and structures using a suite of rapid sequence and structure comparison methods. The statistical significance of matches is assessed and links are provided to the putative superfamily or fold group to which the query sequence or structure is assigned.


Proceedings of the National Academy of Sciences of the United States of America | 2007

The implications of alternative splicing in the ENCODE protein complement.

Michael L. Tress; Pier Luigi Martelli; Adam Frankish; Gabrielle A. Reeves; Jan Jaap Wesselink; Corin Yeats; Páll ĺsólfur Ólason; Mario Albrecht; Hedi Hegyi; Alejandro Giorgetti; Domenico Raimondo; Julien Lagarde; Roman A. Laskowski; Gonzalo López; Michael I. Sadowski; James D. Watson; Piero Fariselli; Ivan Rossi; Alinda Nagy; Wang Kai; Zenia M Størling; Massimiliano Orsini; Yassen Assenov; Hagen Blankenburg; Carola Huthmacher; Fidel Ramírez; Andreas Schlicker; P. D. Jones; Samuel Kerrien; Sandra Orchard

Alternative premessenger RNA splicing enables genes to generate more than one gene product. Splicing events that occur within protein coding regions have the potential to alter the biological function of the expressed protein and even to create new protein functions. Alternative splicing has been suggested as one explanation for the discrepancy between the number of human genes and functional complexity. Here, we carry out a detailed study of the alternatively spliced gene products annotated in the ENCODE pilot project. We find that alternative splicing in human genes is more frequent than has commonly been suggested, and we demonstrate that many of the potential alternative gene products will have markedly different structure and function from their constitutively spliced counterparts. For the vast majority of these alternative isoforms, little evidence exists to suggest they have a role as functional proteins, and it seems unlikely that the spectrum of conventional enzymatic or structural functions can be substantially extended through alternative splicing.


BMC Bioinformatics | 2008

Integrating biological data – the Distributed Annotation System

Andrew M. Jenkinson; Mario Albrecht; Ewan Birney; Hagen Blankenburg; Thomas A. Down; Robert D. Finn; Henning Hermjakob; Tim Hubbard; Rafael C. Jimenez; Philip Jones; Andreas Kähäri; Eugene Kulesha; José R. Macías; Gabrielle A. Reeves; Andreas Prlić

BackgroundThe Distributed Annotation System (DAS) is a widely adopted protocol for dynamically integrating a wide range of biological data from geographically diverse sources. DAS continues to expand its applicability and evolve in response to new challenges facing integrative bioinformatics.ResultsHere we describe the various infrastructure components of DAS and present a new extended version of the DAS specification. Version 1.53E incorporates several recent developments, including its extension to serve new data types and an ontology for protein features.ConclusionOur extensions to the DAS protocol have facilitated the integration of new data types, and our improvements to the existing DAS infrastructure have addressed recent challenges. The steadily increasing numbers of available data sources demonstrates further adoption of the DAS protocol.


Nucleic Acids Research | 2001

A rapid classification protocol for the CATH Domain Database to support structural genomics

Frances M. G. Pearl; Nigel J. Martin; James E. Bray; Daniel W. A. Buchan; Andrew P. Harrison; David A. Lee; Gabrielle A. Reeves; Adrian J. Shepherd; Ian Sillitoe; Annabel E. Todd; Janet M. Thornton; Christine A. Orengo

In order to support the structural genomic initiatives, both by rapidly classifying newly determined structures and by suggesting suitable targets for structure determination, we have recently developed several new protocols for classifying structures in the CATH domain database (http://www.biochem.ucl.ac.uk/bsm/cath). These aim to increase the speed of classification of new structures using fast algorithms for structure comparison (GRATH) and to improve the sensitivity in recognising distant structural relatives by incorporating sequence information from relatives in the genomes (DomainFinder). In order to ensure the integrity of the database given the expected increase in data, the CATH Protein Family Database (CATH-PFDB), which currently includes 25,320 structural domains and a further 160,000 sequence relatives has now been installed in a relational ORACLE database. This was essential for developing more rigorous validation procedures and for allowing efficient querying of the database, particularly for genome analysis. The associated Dictionary of Homologous Superfamilies [Bray,J.E., Todd,A.E., Pearl,F.M.G., Thornton,J.M. and Orengo,C.A. (2000) Protein Eng., 13, 153-165], which provides multiple structural alignments and functional information to assist in assigning new relatives, has also been expanded recently and now includes information for 903 homologous superfamilies. In order to improve coverage of known structures, preliminary classification levels are now provided for new structures at interim stages in the classification protocol. Since a large proportion of new structures can be rapidly classified using profile-based sequence analysis [e.g. PSI-BLAST: Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Nucleic Acids Res., 25, 3389-3402], this provides preliminary classification for easily recognisable homologues, which in the latest release of CATH (version 1.7) represented nearly three-quarters of the non-identical structures.


Journal of the Royal Society Interface | 2009

Genome and proteome annotation: organization, interpretation and integration

Gabrielle A. Reeves; David Talavera; Janet M. Thornton

Recent years have seen a huge increase in the generation of genomic and proteomic data. This has been due to improvements in current biological methodologies, the development of new experimental techniques and the use of computers as support tools. All these raw data are useless if they cannot be properly analysed, annotated, stored and displayed. Consequently, a vast number of resources have been created to present the data to the wider community. Annotation tools and databases provide the means to disseminate these data and to comprehend their biological importance. This review examines the various aspects of annotation: type, methodology and availability. Moreover, it puts a special interest on novel annotation fields, such as that of phenotypes, and highlights the recent efforts focused on the integrating annotations.


Philosophical Transactions of the Royal Society B | 2006

Exploiting protein structure data to explore the evolution of protein function and biological complexity

Russell L. Marsden; Juan A. G. Ranea; Antonio Sillero; Oliver Redfern; Corin Yeats; Michael Maibaum; David A. Lee; Sarah Addou; Gabrielle A. Reeves; Timothy Dallman; Christine A. Orengo

New directions in biology are being driven by the complete sequencing of genomes, which has given us the protein repertoires of diverse organisms from all kingdoms of life. In tandem with this accumulation of sequence data, worldwide structural genomics initiatives, advanced by the development of improved technologies in X-ray crystallography and NMR, are expanding our knowledge of structural families and increasing our fold libraries. Methods for detecting remote sequence similarities have also been made more sensitive and this means that we can map domains from these structural families onto genome sequences to understand how these families are distributed throughout the genomes and reveal how they might influence the functional repertoires and biological complexities of the organisms. We have used robust protocols to assign sequences from completed genomes to domain structures in the CATH database, allowing up to 60% of domain sequences in these genomes, depending on the organism, to be assigned to a domain family of known structure. Analysis of the distribution of these families throughout bacterial genomes identified more than 300 universal families, some of which had expanded significantly in proportion to genome size. These highly expanded families are primarily involved in metabolism and regulation and appear to make major contributions to the functional repertoire and complexity of bacterial organisms. When comparisons are made across all kingdoms of life, we find a smaller set of universal domain families (approx. 140), of which families involved in protein biosynthesis are the largest conserved component. Analysis of the behaviour of other families reveals that some (e.g. those involved in metabolism, regulation) have remained highly innovative during evolution, making it harder to trace their evolutionary ancestry. Structural analyses of metabolic families provide some insights into the mechanisms of functional innovation, which include changes in domain partnerships and significant structural embellishments leading to modulation of active sites and protein interactions.


Bioinformatics | 2008

The Protein Feature Ontology

Gabrielle A. Reeves; Karen Eilbeck; Michele Magrane; Claire O'Donovan; Luisa Montecchi-Palazzi; Midori A. Harris; Sandra Orchard; Rafael C. Jimenez; Andreas Prlić; Tim Hubbard; Henning Hermjakob; Janet M. Thornton

MOTIVATION The advent of sequencing and structural genomics projects has provided a dramatic boost in the number of uncharacterized protein structures and sequences. Consequently, many computational tools have been developed to help elucidate protein function. However, such services are spread throughout the world, often with standalone web pages. Integration of these methods is needed and so far this has not been possible as there was no common vocabulary available that could be used as a standard language. RESULTS The Protein Feature Ontology has been developed to provide a structured controlled vocabulary for features on a protein sequence or structure and comprises approximately 100 positional terms, now integrated into the Sequence Ontology (SO) and 40 non-positional terms which describe features relating to the whole-protein sequence. In addition, post-translational modifications are described by using a pre-existing ontology, the Protein Modification Ontology (MOD). This ontology is being used to integrate over 150 distinct annotations provided by the BioSapiens Network of Excellence, a consortium comprising 19 partner sites in Europe. AVAILABILITY The Protein Feature Ontology can be browsed by accessing the ontology lookup service at the European Bioinformatics Institute (http://www.ebi.ac.uk/ontology-lookup/browse.do?ontName=BS).


Archive | 2008

Infrastructure for distributed protein annotation

Gabrielle A. Reeves; Andreas Prlić; R. C. Jimenez; Eugene Kulesha; Henning Hermjakob

Understanding human variation and disease often requires knowledge of a broad array of biomolecular data items, down to the role of an individual amino acid in a protein, and how mutations or alternative splicing events can change function and phenotype. There are a number of key databases that collect biomolecular information; the EMBL DNA database (Cochrane et al. 2006) and Ensembl (Flicek et al. 2007) collect annotations on genomic sequence features, the UniProt knowledge base (Bairoch et al. 2005) provides detailed annotation on protein sequences, and the Worldwide PDB member databases (Berman et al. 2007) provide protein structural information. Whilst these databases house a great deal of information on sequences and structures, the advent of high throughput methods in genome sequencing and structural genomics initiatives has produced an explosion in the quantity of uncharacterised data. As a result, the development of tools which annotate these sequences and structures by prediction or transfer of information from homologous relatives has also increased in number and diversity. These methods are crucial in order to fill in the functional space between characterised and uncharacterised protein sequences and structures.


Archive | 2008

Alternative splicing in the ENCODE protein complement

Michael L. Tress; Rita Casadio; Alejandro Giorgetti; Peter F. Hallin; Agnieszka Sierakowska Juncker; Eleonora Kulberkyte; Pier Luigi Martelli; Domenico Raimondo; Gabrielle A. Reeves; Janet M. Thornton; Anna Tramontano; Kai Wang; J.J. Wesselink; Alfonso Valencia

In eukaryotic cells transcribed precursor messenger RNA (mRNA) contains introns and exons. Precursor mRNA is converted into mature mRNA by removal of introns and the splicing together of the remaining exons by the spliceosome. Alternative splicing is the process whereby the splicing process can generate a diverse range of mature RNA transcripts from different combinations of exons and allows the cell to generate a series of distinct protein isoforms. Alternative splicing events that occur within exons that are protein coding are likely to alter the structure and biological function of the expressed protein isoform and may even create new protein functions. This has lead to suggestions that alternative splicing has the potential to expand the cellular protein repertoire (Lopez 1998; Black 2000; Modrek and Lee 2002).


Journal of Molecular Biology | 2006

Structural Diversity of Domain Superfamilies in the CATH Database

Gabrielle A. Reeves; Timothy Dallman; Oliver Redfern; Adrian Akpor; Christine A. Orengo

Collaboration


Dive into the Gabrielle A. Reeves's collaboration.

Top Co-Authors

Avatar

Janet M. Thornton

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

David A. Lee

Queen Mary University of London

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ian Sillitoe

University College London

View shared research outputs
Top Co-Authors

Avatar

Oliver Redfern

University College London

View shared research outputs
Top Co-Authors

Avatar

Timothy Dallman

University College London

View shared research outputs
Top Co-Authors

Avatar

Andreas Prlić

University of California

View shared research outputs
Top Co-Authors

Avatar

Adrian Akpor

University College London

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge