Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Claire O’Donovan is active.

Publication


Featured researches published by Claire O’Donovan.


Nucleic Acids Research | 2004

UniProt: the Universal Protein knowledgebase

Rolf Apweiler; Amos Marc Bairoch; Cathy H. Wu; Winona C. Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; María Martín; Darren A. Natale; Claire O’Donovan; Nicole Redaschi; Lai-Su L. Yeh

To provide the scientific community with a single, centralized, authoritative resource for protein sequences and functional information, the Swiss-Prot, TrEMBL and PIR protein database activities have united to form the Universal Protein Knowledgebase (UniProt) consortium. Our mission is to provide a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and query interfaces. The central database will have two sections, corresponding to the familiar Swiss-Prot (fully manually curated entries) and TrEMBL (enriched with automated classification, annotation and extensive cross-references). For convenient sequence searches, UniProt also provides several non-redundant sequence databases. The UniProt NREF (UniRef) databases provide representative subsets of the knowledgebase suitable for efficient searching. The comprehensive UniProt Archive (UniParc) is updated daily from many public source databases. The UniProt databases can be accessed online (http://www.uniprot.org) or downloaded in several formats (ftp://ftp.uniprot.org/pub). The scientific community is encouraged to submit data for inclusion in UniProt.


Nucleic Acids Research | 2012

SIFTS: Structure Integration with Function, Taxonomy and Sequences resource

Sameer Velankar; Jose M. Dana; Julius Jacobsen; Glen van Ginkel; Paul J. Gane; Jie Luo; Thomas J. Oldfield; Claire O’Donovan; Maria-Jesus Martin; Gerard J. Kleywegt

The Structure Integration with Function, Taxonomy and Sequences resource (SIFTS; http://pdbe.org/sifts) is a close collaboration between the Protein Data Bank in Europe (PDBe) and UniProt. The two teams have developed a semi-automated process for maintaining up-to-date cross-reference information to UniProt entries, for all protein chains in the PDB entries present in the UniProt database. This process is carried out for every weekly PDB release and the information is stored in the SIFTS database. The SIFTS process includes cross-references to other biological resources such as Pfam, SCOP, CATH, GO, InterPro and the NCBI taxonomy database. The information is exported in XML format, one file for each PDB entry, and is made available by FTP. Many bioinformatics resources use SIFTS data to obtain cross-references between the PDB and other biological databases so as to provide their users with up-to-date information.


Methods of Molecular Biology | 2017

UniProt Protein Knowledgebase

Sangya Pundir; María Martín; Claire O’Donovan

The Universal Protein Resource (UniProt) is a freely available comprehensive resource for protein sequence and annotation data. UniProt is a collaboration between the European Bioinformatics Institute (EMBL-EBI), the Swiss Institute of Bioinformatics (SIB), and the Protein Information Resource (PIR). Across the three institutes more than 100 people are involved through different tasks such as expert curation, software development, and support.This chapter introduces the functionality and data provided by UniProt. It describes example use cases for which you might come to UniProt and the methods to help you achieve your goals.


GigaScience | 2014

Understanding how and why the Gene Ontology and its annotations evolve: the GO within UniProt

Rachael P. Huntley; Tony Sawford; María Martín; Claire O’Donovan

The Gene Ontology Consortium (GOC) is a major bioinformatics project that provides structured controlled vocabularies to classify gene product function and location. GOC members create annotations to gene products using the Gene Ontology (GO) vocabularies, thus providing an extensive, publicly available resource. The GO and its annotations to gene products are now an integral part of functional analysis, and statistical tests using GO data are becoming routine for researchers to include when publishing functional information. While many helpful articles about the GOC are available, there are certain updates to the ontology and annotation sets that sometimes go unobserved. Here we describe some of the ways in which GO can change that should be carefully considered by all users of GO as they may have a significant impact on the resulting gene product annotations, and therefore the functional description of the gene product, or the interpretation of analyses performed on GO datasets. GO annotations for gene products change for many reasons, and while these changes generally improve the accuracy of the representation of the underlying biology, they do not necessarily imply that previous annotations were incorrect. We additionally describe the quality assurance mechanisms we employ to improve the accuracy of annotations, which necessarily changes the composition of the annotation sets we provide. We use the Universal Protein Resource (UniProt) for illustrative purposes of how the GO Consortium, as a whole, manages these changes.


Standards in Genomic Sciences | 2011

Solving the Problem: Genome Annotation Standards before the Data Deluge

William Klimke; Claire O’Donovan; Owen White; J. Rodney Brister; Karen Clark; Boris Fedorov; Ilene Mizrachi; Kim D. Pruitt; Tatiana Tatusova

The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboration with sequencing centers, archival databases, and researchers, has developed the first international annotation standards, a fundamental step in ensuring that high quality complete prokaryotic genomes are available as gold standard references. Highlights include the development of annotation assessment tools, community acceptance of protein naming standards, comparison of annotation resources to provide consistent annotation, and improved tracking of the evidence used to generate a particular annotation. The development of a set of minimal standards, including the requirement for annotated complete prokaryotic genomes to contain a full set of ribosomal RNAs, transfer RNAs, and proteins encoding core conserved functions, is an historic milestone. The use of these standards in existing genomes and future submissions will increase the quality of databases, enabling researchers to make accurate biological discoveries.


Journal of Biotechnology | 2000

The role SWISS-PROT and TrEMBL play in the genome research environment.

Vivien Junker; Sergio Contrino; Wolfgang Fleischmann; Henning Hermjakob; Fiona Lang; Michele Magrane; María Martín; Nicoletta Mitaritonna; Claire O’Donovan; Rolf Apweiler

SWISS-PROT, a curated protein sequence data bank, contains not only sequence data but also annotation relevant to a particular sequence. The annotation added to each entry is done by a team of biologists and comes, primarily, from articles in journals reporting the actual sequencing and sometimes characterisation. Review articles and collaboration with external experts also play a role along with the use of secondary databases like PROSITE and Pfam in addition to a variety of feature prediction methods. Annotation added by these methods is checked for relevance and likelihood to a particular sequence. The onset of genome sequencing has led to a dramatic increase in sequence data to be included in SWISS-PROT. This has led to the production of TrEMBL (Translation of the EMBL database). TrEMBL consists of entries in a SWISS-PROT format that are derived from the translation of all coding sequences in the EMBL nucleotide sequence database, that are not in SWISS-PROT. Unlike SWISS-PROT entries those in TrEMBL are awaiting manual annotation. However, rather than just representing basic sequence and source information, steps have been taken to add features and annotation automatically. In taking these steps it is hoped that TrEMBL entries are enhanced with some indication as to what a protein is, could or may be.


Database | 2013

Use of Gene Ontology Annotation to understand the peroxisome proteome in humans

Prudence Mutowo-Meullenet; Rachael P. Huntley; Emily Dimmer; Yasmin Alam-Faruque; Tony Sawford; María Martín; Claire O’Donovan; Rolf Apweiler

The Gene Ontology (GO) is the de facto standard for the functional description of gene products, providing a consistent, information-rich terminology applicable across species and information repositories. The UniProt Consortium uses both manual and automatic GO annotation approaches to curate UniProt Knowledgebase (UniProtKB) entries. The selection of a protein set prioritized for manual annotation has implications for the characteristics of the information provided to users working in a specific field or interested in particular pathways or processes. In this article, we describe an organelle-focused, manual curation initiative targeting proteins from the human peroxisome. We discuss the steps taken to define the peroxisome proteome and the challenges encountered in defining the boundaries of this protein set. We illustrate with the use of examples how GO annotations now capture cell and tissue type information and the advantages that such an annotation approach provides to users. Database URL: http://www.ebi.ac.uk/GOA/ and http://www.uniprot.org


Methods of Molecular Biology | 2011

A Guide to UniProt for Protein Scientists

Claire O’Donovan; Rolf Apweiler

One of the essential requirements of the proteomics community is a high quality annotated nonredundant protein sequence database with stable identifiers and an archival service to enable protein identification and characterization. The scope of this chapter is to illustrate how Universal Protein Resource (UniProt) (The UniProt Consortium, Nucleic Acids Res. 38:D142-D148, 2010) can be best utilized for proteomics purposes with a particular focus on exploiting the knowledge captured in the UniProt databases, the services provided and the availability of complete proteomes.


PLOS ONE | 2014

Representing kidney development using the gene ontology.

Yasmin Alam-Faruque; David P. Hill; Emily Dimmer; Midori A. Harris; Rebecca E. Foulger; Susan Tweedie; Helen Attrill; Douglas G. Howe; Stephen Randall Thomas; Duncan Davidson; Adrian S. Woolf; Judith A. Blake; Christopher J. Mungall; Claire O’Donovan; Rolf Apweiler; Rachael P. Huntley

Gene Ontology (GO) provides dynamic controlled vocabularies to aid in the description of the functional biological attributes and subcellular locations of gene products from all taxonomic groups (www.geneontology.org). Here we describe collaboration between the renal biomedical research community and the GO Consortium to improve the quality and quantity of GO terms describing renal development. In the associated annotation activity, the new and revised terms were associated with gene products involved in renal development and function. This project resulted in a total of 522 GO terms being added to the ontology and the creation of approximately 9,600 kidney-related GO term associations to 940 UniProt Knowledgebase (UniProtKB) entries, covering 66 taxonomic groups. We demonstrate the impact of these improvements on the interpretation of GO term analyses performed on genes differentially expressed in kidney glomeruli affected by diabetic nephropathy. In summary, we have produced a resource that can be utilized in the interpretation of data from small- and large-scale experiments investigating molecular mechanisms of kidney function and development and thereby help towards alleviating renal disease.


Database | 2012

Recent advances in biocuration: Meeting report from the Fifth International Biocuration Conference

Pascale Gaudet; Cecilia N. Arighi; Frederic B. Bastian; Alex Bateman; Judith A. Blake; Michael J. Cherry; Peter D’Eustachio; Robert D. Finn; Michelle G. Giglio; Lynette Hirschman; Renate Kania; William Klimke; María Martín; Ilene Karsch-Mizrachi; Monica Munoz-Torres; Darren A. Natale; Claire O’Donovan; Francis Ouellette; Kim D. Pruitt; Marc Robinson-Rechavi; Susanna-Assunta Sansone; Paul N. Schofield; Granger Sutton; Kimberly Van Auken; Sona Vasudevan; Cathy H. Wu; Jasmine Young; Raja Mazumder

The 5th International Biocuration Conference brought together over 300 scientists to exchange on their work, as well as discuss issues relevant to the International Society for Biocuration’s (ISB) mission. Recurring themes this year included the creation and promotion of gold standards, the need for more ontologies, and more formal interactions with journals. The conference is an essential part of the ISBs goal to support exchanges among members of the biocuration community. Next years conference will be held in Cambridge, UK, from 7 to 10 April 2013. In the meanwhile, the ISB website provides information about the societys activities (http://biocurator.org), as well as related events of interest.

Collaboration


Dive into the Claire O’Donovan's collaboration.

Top Co-Authors

Avatar

María Martín

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Rolf Apweiler

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Rachael P. Huntley

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Henning Hermjakob

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Tony Sawford

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Cathy H. Wu

University of Delaware

View shared research outputs
Top Co-Authors

Avatar

Darren A. Natale

Georgetown University Medical Center

View shared research outputs
Top Co-Authors

Avatar

Kim D. Pruitt

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Tatiana Tatusova

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

William Klimke

National Institutes of Health

View shared research outputs
Researchain Logo
Decentralizing Knowledge