Tudor Groza
Garvan Institute of Medical Research
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Tudor Groza.
Nucleic Acids Research | 2017
Sebastian Köhler; Nicole Vasilevsky; Mark Engelstad; Erin Foster; Julie McMurry; Ségolène Aymé; Gareth Baynam; Susan M. Bello; Cornelius F. Boerkoel; Kym M. Boycott; Michael Brudno; Orion J. Buske; Patrick F. Chinnery; Valentina Cipriani; Laureen E. Connell; Hugh Dawkins; Laura E. DeMare; Andrew Devereau; Bert B.A. de Vries; Helen V. Firth; Kathleen Freson; Daniel Greene; Ada Hamosh; Ingo Helbig; Courtney Hum; Johanna A. Jähn; Roger James; Roland Krause; Stanley J. F. Laulederkind; Hanns Lochmüller
Deep phenotyping has been defined as the precise and comprehensive analysis of phenotypic abnormalities in which the individual components of the phenotype are observed and described. The three components of the Human Phenotype Ontology (HPO; www.human-phenotype-ontology.org) project are the phenotype vocabulary, disease-phenotype annotations and the algorithms that operate on these. These components are being used for computational deep phenotyping and precision medicine as well as integration of clinical data into translational research. The HPO is being increasingly adopted as a standard for phenotypic abnormalities by diverse groups such as international rare disease organizations, registries, clinical labs, biomedical resources, and clinical software tools and will thereby contribute toward nascent efforts at global data exchange for identifying disease etiologies. This update article reviews the progress of the HPO project since the debut Nucleic Acids Research database article in 2014, including specific areas of expansion such as common (complex) disease, new algorithms for phenotype driven genomic discovery and diagnostics, integration of cross-species mapping efforts with the Mammalian Phenotype Ontology, an improved quality control pipeline, and the addition of patient-friendly terminology.
American Journal of Human Genetics | 2015
Tudor Groza; Sebastian Köhler; Dawid Moldenhauer; Nicole Vasilevsky; Gareth Baynam; Tomasz Zemojtel; Lynn M. Schriml; Warren A. Kibbe; Paul N. Schofield; Tim Beck; Drashtti Vasant; Anthony J. Brookes; Andreas Zankl; Nicole L. Washington; Christopher J. Mungall; Suzanna E. Lewis; Melissa Haendel; Helen Parkinson; Peter N. Robinson
The Human Phenotype Ontology (HPO) is widely used in the rare disease community for differential diagnostics, phenotype-driven analysis of next-generation sequence-variation data, and translational research, but a comparable resource has not been available for common disease. Here, we have developed a concept-recognition procedure that analyzes the frequencies of HPO disease annotations as identified in over five million PubMed abstracts by employing an iterative procedure to optimize precision and recall of the identified terms. We derived disease models for 3,145 common human diseases comprising a total of 132,006 HPO annotations. The HPO now comprises over 250,000 phenotypic annotations for over 10,000 rare and common diseases and can be used for examining the phenotypic overlap among common diseases that share risk alleles, as well as between Mendelian diseases and common diseases linked by genomic location. The annotations, as well as the HPO itself, are freely available.
european semantic web conference | 2007
Tudor Groza; Siegfried Handschuh; Knud Möller; Stefan Decker
Machine-understandable data constitutes the foundation for the Semantic Web. This paper presents a viable way for authoring and annotating Semantic Documents on the desktop. In our approach, the PDF file format is the container for document semantics, being able to store both the content and the related metadata in a single file. To achieve this, we provide a framework (SALT - Semantically Annotated
Semantic Web archive | 2013
Jodi Schneider; Tudor Groza; Alexandre Passant
\mbox{\LaTeX}
Nucleic Acids Research | 2017
Christopher J. Mungall; Julie McMurry; Sebastian Köhler; James P. Balhoff; Charles D. Borromeo; Matthew H. Brush; Seth Carbon; Tom Conlin; Nathan Dunn; Mark Engelstad; Erin Foster; Jean-Philippe F. Gourdine; Julius Jacobsen; Daniel Keith; Bryan Laraway; Suzanna E. Lewis; Jeremy NguyenXuan; Kent Shefchek; Nicole Vasilevsky; Zhou Yuan; Nicole L. Washington; Harry Hochheiser; Tudor Groza; Damian Smedley; Peter N. Robinson; Melissa Haendel
), that extends the
Database | 2015
Tudor Groza; Sebastian Köhler; Sandra C. Doelken; Nigel Collier; Anika Oellrich; Damian Smedley; Francisco M. Couto; Gareth Baynam; Andreas Zankl; Peter N. Robinson
\mbox{\LaTeX}
Human Mutation | 2015
Christopher J. Mungall; Nicole L. Washington; Jeremy NguyenXuan; Christopher Condit; Damian Smedley; Sebastian Köhler; Tudor Groza; Kent Shefchek; Harry Hochheiser; Peter N. Robinson; Suzanna E. Lewis; Melissa Haendel
writing environment and supports the creation of metadata for scientific publications. SALT allows the author to create metadata concurrently, i.e. while in the process of writing a document. We discuss some of the requirements which have to be met when developing such a support for creating semantic documents. In addition, we describe a usage scenario to show the feasability and benefit of our approach.
BMC Bioinformatics | 2012
Tudor Groza; Jane Hunter; Andreas Zankl
Argumentation represents the study of views and opinions that humans express with the goal of reaching a conclusion through logical reasoning. Since the 1950s, several models have been proposed to capture the essence of informal argumentation in different settings. With the emergence of the Web, and then the Semantic Web, this modeling shifted towards ontologies, while from the development perspective, we witnessed an important increase in Web 2.0 human-centered collaborative deliberation tools. Through a review of more than 150 scholarly papers, this article provides a comprehensive and comparative overview of approaches to modeling argumentation for the Social Semantic Web. We start from theoretical foundational models and investigate how they have influenced Social Web tools. We also look into Semantic Web argumentation models. Finally we end with Social Web tools for argumentation, including online applications combining Web 2.0 and Semantic Web technologies, following the path to a global World Wide Argument Web.
Journal of Biomedical Informatics | 2013
Tudor Groza; Tania Tudorache; Michel Dumontier
The correlation of phenotypic outcomes with genetic variation and environmental factors is a core pursuit in biology and biomedicine. Numerous challenges impede our progress: patient phenotypes may not match known diseases, candidate variants may be in genes that have not been characterized, model organisms may not recapitulate human or veterinary diseases, filling evolutionary gaps is difficult, and many resources must be queried to find potentially significant genotype–phenotype associations. Non-human organisms have proven instrumental in revealing biological mechanisms. Advanced informatics tools can identify phenotypically relevant disease models in research and diagnostic contexts. Large-scale integration of model organism and clinical research data can provide a breadth of knowledge not available from individual sources and can provide contextualization of data back to these sources. The Monarch Initiative (monarchinitiative.org) is a collaborative, open science effort that aims to semantically integrate genotype–phenotype data from many species and sources in order to support precision medicine, disease modeling, and mechanistic exploration. Our integrated knowledge graph, analytic tools, and web services enable diverse users to explore relationships between phenotypes and genotypes across species.
Journal of Paediatrics and Child Health | 2014
Gareth Baynam; Mark Walters; Peter Claes; Stefanie Kung; Peter N. LeSouëf; Hugh Dawkins; M. Bellgard; Marta Girdea; Michael Brudno; Peter N. Robinson; Andreas Zankl; Tudor Groza; David Gillett; Jack Goldblatt
Concept recognition tools rely on the availability of textual corpora to assess their performance and enable the identification of areas for improvement. Typically, corpora are developed for specific purposes, such as gene name recognition. Gene and protein name identification are longstanding goals of biomedical text mining, and therefore a number of different corpora exist. However, phenotypes only recently became an entity of interest for specialized concept recognition systems, and hardly any annotated text is available for performance testing and training. Here, we present a unique corpus, capturing text spans from 228 abstracts manually annotated with Human Phenotype Ontology (HPO) concepts and harmonized by three curators, which can be used as a reference standard for free text annotation of human phenotypes. Furthermore, we developed a test suite for standardized concept recognition error analysis, incorporating 32 different types of test cases corresponding to 2164 HPO concepts. Finally, three established phenotype concept recognizers (NCBO Annotator, OBO Annotator and Bio-LarK CR) were comprehensively evaluated, and results are reported against both the text corpus and the test suites. The gold standard and test suites corpora are available from http://bio-lark.org/hpo_res.html. Database URL: http://bio-lark.org/hpo_res.html