Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Olivier Bodenreider is active.

Publication


Featured researches published by Olivier Bodenreider.


Nucleic Acids Research | 2004

The Unified Medical Language System (UMLS): integrating biomedical terminology

Olivier Bodenreider

The Unified Medical Language System (http://umlsks.nlm.nih.gov) is a repository of biomedical vocabularies developed by the US National Library of Medicine. The UMLS integrates over 2 million names for some 900,000 concepts from more than 60 families of biomedical vocabularies, as well as 12 million relations among these concepts. Vocabularies integrated in the UMLS Metathesaurus include the NCBI taxonomy, Gene Ontology, the Medical Subject Headings (MeSH), OMIM and the Digital Anatomist Symbolic Knowledge Base. UMLS concepts are not only inter-related, but may also be linked to external resources such as GenBank. In addition to data, the UMLS includes tools for customizing the Metathesaurus (MetamorphoSys), for generating lexical variants of concept names (lvg) and for extracting UMLS concepts from text (MetaMap). The UMLS knowledge sources are updated quarterly. All vocabularies are available at no fee for research purposes within an institution, but UMLS users are required to sign a license agreement. The UMLS knowledge sources are distributed on CD-ROM and by FTP.


computational intelligence in bioinformatics and computational biology | 2004

Gene expression correlation and gene ontology-based similarity: an assessment of quantitative relationships

Haiying Wang; Francisco Azuaje; Olivier Bodenreider; Joaquín Dopazo

The gene ontology and annotations derived from the S.cerivisiae genome database were analyzed to calculate functional similarity of gene products. Three methods for measuring similarity (including a distance-based approach) were implemented. Significant, quantitative relationships between similarity and expression correlation of pairs of genes were detected. Using a known gene expression dataset in yeast, this study compared more than three million pairs of gene products on the basis of these functional properties. Highly correlated genes exhibit strong similarity based on information originating from the gene ontology taxonomies. Such a similarity is significantly stronger than that observed between weakly correlated genes. This study supports the feasibility of applying gene ontology-driven similarity methods to functional prediction tasks, such as the validation of gene expression analyses and the identification of false positives in protein interaction studies.


Journal of Biomedical Informatics | 2003

Exploring semantic groups through visual approaches

Olivier Bodenreider; Alexa T. McCray

Objectives. We investigate several visual approaches for exploring semantic groups, a grouping of semantic types from the Unified Medical Language System (UMLS) semantic network. We are particularly interested in the semantic coherence of the groups, and we use the semantic relationships as important indicators of that coherence. Methods. First, we create a radial representation of the number of relationships among the groups, generating a profile for each semantic group. Second, we show that, in our partition, the relationships are organized around a limited number of pivot groups and that partitions created at random do not exhibit this property. Finally, we use correspondence analysis to visualize groupings resulting from the association between semantic types and the relationships. Results. The three approaches provide different views on the semantic groups and help detect potential inconsistencies. They make outliers immediately apparent, and, thus, serve as a tool for auditing and validating both the semantic network and the semantic groups.


Journal of Web Semantics | 2006

The foundational model of anatomy in OWL: Experience and perspectives

Christine Golbreich; Songmao Zhang; Olivier Bodenreider

We present the method developed for migrating the Foundational Model of Anatomy (FMA) from its representation with frames in Protégé to its logical representation in OWL and our experience in reasoning with it. Despite the extensive use of metaclasses in Protégé, it proved possible to convert the FMA from Protégé into OWL DL, while capturing most of its original features. The conversion relies on a set of translation and enrichment rules implemented with flexible options. Unsurprisingly, reasoning with the FMA in OWL proved to be a real challenge, due to its sheer size and complexity, and raised significant inference problems in terms of time and memory requirements. However, various smaller versions have been successfully handled by Racer. Some inconsistencies were identified and several classes reclassified. The results obtained so far show the advantage of OWL DL over frames and, more generally, the usefulness of DLs reasoners for building and maintaining the large-scale biomedical ontologies of the future Semantic Web.


pacific symposium on biocomputing | 2004

NON-LEXICAL APPROACHES TO IDENTIFYING ASSOCIATIVE RELATIONS IN THE GENE ONTOLOGY

Olivier Bodenreider; Marc Aubry; Anita Burgun

The Gene Ontology (GO) is a controlled vocabulary widely used for the annotation of gene products. GO is organized in three hierarchies for molecular functions, cellular components, and biological processes but no relations are provided among terms across hierarchies. The objective of this study is to investigate three non-lexical approaches to identifying such associative relations in GO and compare them among themselves and to lexical approaches. The three approaches are: computing similarity in a vector space model, statistical analysis of co-occurrence of GO terms in annotation databases, and association rule mining. Five annotation databases (FlyBase, the Human subset of GOA, MGI, SGD, and WormBase) are used in this study. A total of 7,665 associations were identified by at least one of the three non-lexical approaches. Of these, 12% were identified by more than one approach. While there are almost 6,000 lexical relations among GO terms, only 203 associations were identified by both non-lexical and lexical approaches. The associations identified in this study could serve as the starting point for adding associative relations across hierarchies to GO, but would require manual curation. The application to quality assurance of annotation databases is also discussed.


Artificial Intelligence in Medicine | 2007

Investigating subsumption in SNOMED CT: An exploration into large description logic-based biomedical terminologies

Olivier Bodenreider; Barry Smith; Anand Kumar; Anita Burgun

OBJECTIVE Formalisms based on one or other flavor of description logic (DL) are sometimes put forward as helping to ensure that terminologies and controlled vocabularies comply with sound ontological principles. The objective of this paper is to study the degree to which one DL-based biomedical terminology (SNOMED CT) does indeed comply with such principles. MATERIALS AND METHODS We defined seven ontological principles (for example: each class must have at least one parent, each class must differ from its parent) and examined the properties of SNOMED CT classes with respect to these principles. RESULTS Our major results are 31% of these classes have a single child; 27% have multiple parents; 51% do not exhibit any differentiae between the description of the parent and that of the child. CONCLUSIONS The applications of this principles to quality assurance for ontologies are discussed and suggestions are made for dealing with the phenomenon of multiple inheritance. The advantages and limitations of our approach are also discussed.


Journal of Biomedical Semantics | 2011

The Translational Medicine Ontology and Knowledge Base: driving personalized medicine by bridging the gap between bench and bedside

Joanne S. Luciano; Bosse Andersson; Colin R. Batchelor; Olivier Bodenreider; Timothy W.I. Clark; Christine Denney; Christopher Domarew; Thomas Gambet; Lee Harland; Anja Jentzsch; Vipul Kashyap; Peter Kos; Julia Kozlovsky; Timothy Lebo; Scott M Marshall; James P. McCusker; Deborah L. McGuinness; Chimezie Ogbuji; Elgar Pichler; Robert L Powers; Eric Prud’hommeaux; Matthias Samwald; Lynn M. Schriml; Peter J. Tonellato; Patricia L. Whetzel; Jun Zhao; Susie Stephens; Michel Dumontier

BackgroundTranslational medicine requires the integration of knowledge using heterogeneous data from health care to the life sciences. Here, we describe a collaborative effort to produce a prototype Translational Medicine Knowledge Base (TMKB) capable of answering questions relating to clinical practice and pharmaceutical drug discovery.ResultsWe developed the Translational Medicine Ontology (TMO) as a unifying ontology to integrate chemical, genomic and proteomic data with disease, treatment, and electronic health records. We demonstrate the use of Semantic Web technologies in the integration of patient and biomedical data, and reveal how such a knowledge base can aid physicians in providing tailored patient care and facilitate the recruitment of patients into active clinical trials. Thus, patients, physicians and researchers may explore the knowledge base to better understand therapeutic options, efficacy, and mechanisms of action.ConclusionsThis work takes an important step in using Semantic Web technologies to facilitate integration of relevant, distributed, external sources and progress towards a computational platform to support personalized medicine.AvailabilityTMO can be downloaded from http://code.google.com/p/translationalmedicineontology and TMKB can be accessed at http://tm.semanticscience.org/sparql.


International Journal on Semantic Web and Information Systems | 2007

Experience in Aligning Anatomical Ontologies.

Songmao Zhang; Olivier Bodenreider

An ontology is a formal representation of a domain modeling the entities in the domain and their relations. When a domain is represented by multiple ontologies, there is need for creating mappings among these ontologies in order to facilitate the integration of data annotated with these ontologies and reasoning across ontologies. The objective of this paper is to recapitulate our experience in aligning large anatomical ontologies and to reflect on some of the issues and challenges encountered along the way. The four anatomical ontologies under investigation are the Foundational Model of Anatomy, GALEN, the Adult Mouse Anatomical Dictionary and the NCI Thesaurus. Their underlying representation formalisms are all different. Our approach to aligning concepts (directly) is automatic, rule-based, and operates at the schema level, generating mostly point-to-point mappings. It uses a combination of domain-specific lexical techniques and structural and semantic techniques (to validate the mappings suggested lexically). It also takes advantage of domain-specific knowledge (lexical knowledge from external resources such as the Unified Medical Language System, as well as knowledge augmentation and inference techniques). In addition to point-to-point mapping of concepts, we present the alignment of relationships and the mapping of concepts group-to-group. We have also successfully tested an indirect alignment through a domain-specific reference ontology. We present an evaluation of our techniques, both against a gold standard established manually and against a generic schema matching system. The advantages and limitations of our approach are analyzed and discussed throughout the paper.


Journal of Biomedical Informatics | 2008

An ontology-driven semantic mashup of gene and biological pathway information: Application to the domain of nicotine dependence

Satya S. Sahoo; Olivier Bodenreider; Joni L. Rutter; Karen J. Skinner; Amit P. Sheth

OBJECTIVES This paper illustrates how Semantic Web technologies (especially RDF, OWL, and SPARQL) can support information integration and make it easy to create semantic mashups (semantically integrated resources). In the context of understanding the genetic basis of nicotine dependence, we integrate gene and pathway information and show how three complex biological queries can be answered by the integrated knowledge base. METHODS We use an ontology-driven approach to integrate two gene resources (Entrez Gene and HomoloGene) and three pathway resources (KEGG, Reactome and BioCyc), for five organisms, including humans. We created the Entrez Knowledge Model (EKoM), an information model in OWL for the gene resources, and integrated it with the extant BioPAX ontology designed for pathway resources. The integrated schema is populated with data from the pathway resources, publicly available in BioPAX-compatible format, and gene resources for which a population procedure was created. The SPARQL query language is used to formulate queries over the integrated knowledge base to answer the three biological queries. RESULTS Simple SPARQL queries could easily identify hub genes, i.e., those genes whose gene products participate in many pathways or interact with many other gene products. The identification of the genes expressed in the brain turned out to be more difficult, due to the lack of a common identification scheme for proteins. CONCLUSION Semantic Web technologies provide a valid framework for information integration in the life sciences. Ontology-driven integration represents a flexible, sustainable and extensible solution to the integration of large volumes of information. Additional resources, which enable the creation of mappings between information sources, are required to compensate for heterogeneity across namespaces. RESOURCE PAGE: http://knoesis.wright.edu/research/lifesci/integration/structured_data/JBI-2008/


BMC Evolutionary Biology | 2006

Global similarity and local divergence in human and mouse gene co-expression networks

Panayiotis Tsaparas; Leonardo Mariño-Ramírez; Olivier Bodenreider; Eugene V. Koonin; I. King Jordan

BackgroundA genome-wide comparative analysis of human and mouse gene expression patterns was performed in order to evaluate the evolutionary divergence of mammalian gene expression. Tissue-specific expression profiles were analyzed for 9,105 human-mouse orthologous gene pairs across 28 tissues. Expression profiles were resolved into species-specific coexpression networks, and the topological properties of the networks were compared between species.ResultsAt the global level, the topological properties of the human and mouse gene coexpression networks are, essentially, identical. For instance, both networks have topologies with small-world and scale-free properties as well as closely similar average node degrees, clustering coefficients, and path lengths. However, the human and mouse coexpression networks are highly divergent at the local level: only a small fraction (<10%) of coexpressed gene pair relationships are conserved between the two species. A series of controls for experimental and biological variance show that most of this divergence does not result from experimental noise. We further show that, while the expression divergence between species is genuinely rapid, expression does not evolve free from selective (functional) constraint. Indeed, the coexpression networks analyzed here are demonstrably functionally coherent as indicated by the functional similarity of coexpressed gene pairs, and this pattern is most pronounced in the conserved human-mouse intersection network. Numerous dense network clusters show evidence of dedicated functions, such as spermatogenesis and immune response, that are clearly consistent with the coherence of the expression patterns of their constituent gene members.ConclusionThe dissonance between global versus local network divergence suggests that the interspecies similarity of the global network properties is of limited biological significance, at best, and that the biologically relevant aspects of the architectures of gene coexpression are specific and particular, rather than universal. Nevertheless, there is substantial evolutionary conservation of the local network structure which is compatible with the notion that gene coexpression networks are subject to purifying selection.

Collaboration


Dive into the Olivier Bodenreider's collaboration.

Top Co-Authors

Avatar

Anita Burgun

Paris Descartes University

View shared research outputs
Top Co-Authors

Avatar

Lee B. Peters

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Songmao Zhang

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Kelly Zeng

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Kin Wah Fung

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Stuart J. Nelson

National Institutes of Health

View shared research outputs
Researchain Logo
Decentralizing Knowledge