Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Nicole Vasilevsky is active.

Publication


Featured researches published by Nicole Vasilevsky.


Nucleic Acids Research | 2017

The Human Phenotype Ontology in 2017

Sebastian Köhler; Nicole Vasilevsky; Mark Engelstad; Erin Foster; Julie McMurry; Ségolène Aymé; Gareth Baynam; Susan M. Bello; Cornelius F. Boerkoel; Kym M. Boycott; Michael Brudno; Orion J. Buske; Patrick F. Chinnery; Valentina Cipriani; Laureen E. Connell; Hugh Dawkins; Laura E. DeMare; Andrew Devereau; Bert B.A. de Vries; Helen V. Firth; Kathleen Freson; Daniel Greene; Ada Hamosh; Ingo Helbig; Courtney Hum; Johanna A. Jähn; Roger James; Roland Krause; Stanley J. F. Laulederkind; Hanns Lochmüller

Deep phenotyping has been defined as the precise and comprehensive analysis of phenotypic abnormalities in which the individual components of the phenotype are observed and described. The three components of the Human Phenotype Ontology (HPO; www.human-phenotype-ontology.org) project are the phenotype vocabulary, disease-phenotype annotations and the algorithms that operate on these. These components are being used for computational deep phenotyping and precision medicine as well as integration of clinical data into translational research. The HPO is being increasingly adopted as a standard for phenotypic abnormalities by diverse groups such as international rare disease organizations, registries, clinical labs, biomedical resources, and clinical software tools and will thereby contribute toward nascent efforts at global data exchange for identifying disease etiologies. This update article reviews the progress of the HPO project since the debut Nucleic Acids Research database article in 2014, including specific areas of expansion such as common (complex) disease, new algorithms for phenotype driven genomic discovery and diagnostics, integration of cross-species mapping efforts with the Mammalian Phenotype Ontology, an improved quality control pipeline, and the addition of patient-friendly terminology.


PeerJ | 2013

On the reproducibility of science: unique identification of research resources in the biomedical literature

Nicole Vasilevsky; Matthew H. Brush; Holly Paddock; Laura Ponting; Shreejoy J. Tripathy; Gregory M. LaRocca; Melissa Haendel

Scientific reproducibility has been at the forefront of many news stories and there exist numerous initiatives to help address this problem. We posit that a contributor is simply a lack of specificity that is required to enable adequate research reproducibility. In particular, the inability to uniquely identify research resources, such as antibodies and model organisms, makes it difficult or impossible to reproduce experiments even where the science is otherwise sound. In order to better understand the magnitude of this problem, we designed an experiment to ascertain the “identifiability” of research resources in the biomedical literature. We evaluated recent journal articles in the fields of Neuroscience, Developmental Biology, Immunology, Cell and Molecular Biology and General Biology, selected randomly based on a diversity of impact factors for the journals, publishers, and experimental method reporting guidelines. We attempted to uniquely identify model organisms (mouse, rat, zebrafish, worm, fly and yeast), antibodies, knockdown reagents (morpholinos or RNAi), constructs, and cell lines. Specific criteria were developed to determine if a resource was uniquely identifiable, and included examining relevant repositories (such as model organism databases, and the Antibody Registry), as well as vendor sites. The results of this experiment show that 54% of resources are not uniquely identifiable in publications, regardless of domain, journal impact factor, or reporting requirements. For example, in many cases the organism strain in which the experiment was performed or antibody that was used could not be identified. Our results show that identifiability is a serious problem for reproducibility. Based on these results, we provide recommendations to authors, reviewers, journal editors, vendors, and publishers. Scientific efficiency and reproducibility depend upon a research-wide improvement of this substantial problem in science today.


American Journal of Human Genetics | 2015

The human phenotype ontology: semantic unification of common and rare disease

Tudor Groza; Sebastian Köhler; Dawid Moldenhauer; Nicole Vasilevsky; Gareth Baynam; Tomasz Zemojtel; Lynn M. Schriml; Warren A. Kibbe; Paul N. Schofield; Tim Beck; Drashtti Vasant; Anthony J. Brookes; Andreas Zankl; Nicole L. Washington; Christopher J. Mungall; Suzanna E. Lewis; Melissa Haendel; Helen Parkinson; Peter N. Robinson

The Human Phenotype Ontology (HPO) is widely used in the rare disease community for differential diagnostics, phenotype-driven analysis of next-generation sequence-variation data, and translational research, but a comparable resource has not been available for common disease. Here, we have developed a concept-recognition procedure that analyzes the frequencies of HPO disease annotations as identified in over five million PubMed abstracts by employing an iterative procedure to optimize precision and recall of the identified terms. We derived disease models for 3,145 common human diseases comprising a total of 132,006 HPO annotations. The HPO now comprises over 250,000 phenotypic annotations for over 10,000 rare and common diseases and can be used for examining the phenotypic overlap among common diseases that share risk alleles, as well as between Mendelian diseases and common diseases linked by genomic location. The annotations, as well as the HPO itself, are freely available.


Database | 2016

Overview of the interactive task in BioCreative V

Qinghua Wang; Shabbir Syed Abdul; Lara Monteiro Almeida; Sophia Ananiadou; Yalbi Itzel Balderas-Martínez; Riza Theresa Batista-Navarro; David Campos; Lucy Chilton; Hui-Jou Chou; Gabriela Contreras; Laurel Cooper; Hong-Jie Dai; Barbra Ferrell; Juliane Fluck; Socorro Gama-Castro; Nancy George; Georgios V. Gkoutos; Afroza Khanam Irin; Lars Juhl Jensen; Silvia Jimenez; Toni Rose Jue; Ingrid M. Keseler; Sumit Madan; Sérgio Matos; Peter McQuilton; Marija Milacic; Matthew Mort; Jeyakumar Natarajan; Evangelos Pafilis; Emiliano Pereira

Fully automated text mining (TM) systems promote efficient literature searching, retrieval, and review but are not sufficient to produce ready-to-consume curated documents. These systems are not meant to replace biocurators, but instead to assist them in one or more literature curation steps. To do so, the user interface is an important aspect that needs to be considered for tool adoption. The BioCreative Interactive task (IAT) is a track designed for exploring user-system interactions, promoting development of useful TM tools, and providing a communication channel between the biocuration and the TM communities. In BioCreative V, the IAT track followed a format similar to previous interactive tracks, where the utility and usability of TM tools, as well as the generation of use cases, have been the focal points. The proposed curation tasks are user-centric and formally evaluated by biocurators. In BioCreative V IAT, seven TM systems and 43 biocurators participated. Two levels of user participation were offered to broaden curator involvement and obtain more feedback on usability aspects. The full level participation involved training on the system, curation of a set of documents with and without TM assistance, tracking of time-on-task, and completion of a user survey. The partial level participation was designed to focus on usability aspects of the interface and not the performance per se. In this case, biocurators navigated the system by performing pre-designed tasks and then were asked whether they were able to achieve the task and the level of difficulty in completing the task. In this manuscript, we describe the development of the interactive task, from planning to execution and discuss major findings for the systems tested. Database URL: http://www.biocreative.org


PLOS Biology | 2012

Dealing with Data: A Case Study on Information and Data Management Literacy

Melissa Haendel; Nicole Vasilevsky; Jacqueline A. Wirz

The launch of the eagle-i Consortium, a collaborative network for sharing information about research resources, such as protocols and reagents, provides a vivid demonstration of the challenges that researchers, libraries and institutions face in making their data available to others.


Nucleic Acids Research | 2017

The Monarch Initiative: an integrative data and analytic platform connecting phenotypes to genotypes across species.

Christopher J. Mungall; Julie McMurry; Sebastian Köhler; James P. Balhoff; Charles D. Borromeo; Matthew H. Brush; Seth Carbon; Tom Conlin; Nathan Dunn; Mark Engelstad; Erin Foster; Jean-Philippe F. Gourdine; Julius Jacobsen; Daniel Keith; Bryan Laraway; Suzanna E. Lewis; Jeremy NguyenXuan; Kent Shefchek; Nicole Vasilevsky; Zhou Yuan; Nicole L. Washington; Harry Hochheiser; Tudor Groza; Damian Smedley; Peter N. Robinson; Melissa Haendel

The correlation of phenotypic outcomes with genetic variation and environmental factors is a core pursuit in biology and biomedicine. Numerous challenges impede our progress: patient phenotypes may not match known diseases, candidate variants may be in genes that have not been characterized, model organisms may not recapitulate human or veterinary diseases, filling evolutionary gaps is difficult, and many resources must be queried to find potentially significant genotype–phenotype associations. Non-human organisms have proven instrumental in revealing biological mechanisms. Advanced informatics tools can identify phenotypically relevant disease models in research and diagnostic contexts. Large-scale integration of model organism and clinical research data can provide a breadth of knowledge not available from individual sources and can provide contextualization of data back to these sources. The Monarch Initiative (monarchinitiative.org) is a collaborative, open science effort that aims to semantically integrate genotype–phenotype data from many species and sources in order to support precision medicine, disease modeling, and mechanistic exploration. Our integrated knowledge graph, analytic tools, and web services enable diverse users to explore relationships between phenotypes and genotypes across species.


Genetics in Medicine | 2016

Computational evaluation of exome sequence data using human and model organism phenotypes improves diagnostic efficiency.

William P. Bone; Nicole L. Washington; Orion J. Buske; David Adams; Joie Davis; David D. Draper; Elise Flynn; Marta Girdea; Rena Godfrey; Gretchen Golas; Catherine Groden; Julius Jacobsen; Sebastian Köhler; Elizabeth M.J. Lee; Amanda E. Links; Thomas C. Markello; Christopher J. Mungall; Michele E. Nehrebecky; Peter N. Robinson; Murat Sincan; Ariane Soldatos; Cynthia J. Tifft; Camilo Toro; Heather Trang; Elise Valkanas; Nicole Vasilevsky; Colleen Wahl; Lynne A. Wolfe; Cornelius F. Boerkoel; Michael Brudno

Purpose:Medical diagnosis and molecular or biochemical confirmation typically rely on the knowledge of the clinician. Although this is very difficult in extremely rare diseases, we hypothesized that the recording of patient phenotypes in Human Phenotype Ontology (HPO) terms and computationally ranking putative disease-associated sequence variants improves diagnosis, particularly for patients with atypical clinical profiles.Methods:Using simulated exomes and the National Institutes of Health Undiagnosed Diseases Program (UDP) patient cohort and associated exome sequence, we tested our hypothesis using Exomiser. Exomiser ranks candidate variants based on patient phenotype similarity to (i) known disease–gene phenotypes, (ii) model organism phenotypes of candidate orthologs, and (iii) phenotypes of protein–protein association neighbors.Results:Benchmarking showed Exomiser ranked the causal variant as the top hit in 97% of known disease–gene associations and ranked the correct seeded variant in up to 87% when detectable disease–gene associations were unavailable. Using UDP data, Exomiser ranked the causative variant(s) within the top 10 variants for 11 previously diagnosed variants and achieved a diagnosis for 4 of 23 cases undiagnosed by clinical evaluation.Conclusion:Structured phenotyping of patients and computational analysis are effective adjuncts for diagnosing patients with genetic disorders.Genet Med 18 6, 608–617.


Database | 2012

Research resources: curating the new eagle-i discovery system

Nicole Vasilevsky; Tenille Johnson; Karen Corday; Carlo Torniai; Matthew H. Brush; Erik Segerdell; Melanie Wilson; Chris Shaffer; David W. Robinson; Melissa Haendel

Development of biocuration processes and guidelines for new data types or projects is a challenging task. Each project finds its way toward defining annotation standards and ensuring data consistency with varying degrees of planning and different tools to support and/or report on consistency. Further, this process may be data type specific even within the context of a single project. This article describes our experiences with eagle-i, a 2-year pilot project to develop a federated network of data repositories in which unpublished, unshared or otherwise ‘invisible’ scientific resources could be inventoried and made accessible to the scientific community. During the course of eagle-i development, the main challenges we experienced related to the difficulty of collecting and curating data while the system and the data model were simultaneously built, and a deficiency and diversity of data management strategies in the laboratories from which the source data was obtained. We discuss our approach to biocuration and the importance of improving information management strategies to the research process, specifically with regard to the inventorying and usage of research resources. Finally, we highlight the commonalities and differences between eagle-i and similar efforts with the hope that our lessons learned will assist other biocuration endeavors. Database URL: www.eagle-i.net


Brain and behavior | 2016

The Resource Identification Initiative: a cultural shift in publishing

Anita Bandrowski; Matthew H. Brush; Jeffery S. Grethe; Melissa Haendel; David N. Kennedy; Sean Hill; Patrick R. Hof; Maryann E. Martone; Maaike Pols; Serena C. Tan; Nicole L. Washington; Elena Zudilova-Seinstra; Nicole Vasilevsky

A central tenet in support of research reproducibility is the ability to uniquely identify research resources, that is, reagents, tools, and materials that are used to perform experiments. However, current reporting practices for research resources are insufficient to identify the exact resources that are reported or to answer basic questions such as “How did other studies use resource X?” To address this issue, the Resource Identification Initiative was launched as a pilot project to improve the reporting standards for research resources in the methods sections of papers and thereby improve identifiability and scientific reproducibility. The pilot engaged over 25 biomedical journal editors from most major publishers, as well as scientists and funding officials. Authors were asked to include Research Resource Identifiers (RRIDs) in their manuscripts prior to publication for three resource types: antibodies, model organisms, and tools (i.e., software and databases). RRIDs are assigned by an authoritative database, for example, a model organism database for each type of resource. To make it easier for authors to obtain RRIDs, resources were aggregated from the appropriate databases and their RRIDs made available in a central web portal ( http://scicrunch.org/resources). RRIDs meet three key criteria: they are machine readable, free to generate and access, and are consistent across publishers and journals. The pilot was launched in February of 2014 and over 300 papers have appeared that report RRIDs. The number of journals participating has expanded from the original 25 to more than 40 with RRIDs appearing in 62 different journals to date. Here, we present an overview of the pilot project and its outcomes to date. We show that authors are able to identify resources and are supportive of the goals of the project. Identifiability of the resources post‐pilot showed a dramatic improvement for all three resource types, suggesting that the project has had a significant impact on identifiability of research resources.


Genetics | 2016

Navigating the Phenotype Frontier: The Monarch Initiative

Julie McMurry; Sebastian Köhler; Nicole L. Washington; James P. Balhoff; Charles D. Borromeo; Matthew H. Brush; Seth Carbon; Tom Conlin; Nathan Dunn; Mark Engelstad; Erin Foster; Jean Philippe Gourdine; Julius Jacobsen; Daniel Keith; Bryan Laraway; Jeremy Nguyen Xuan; Kent Shefchek; Nicole Vasilevsky; Zhou Yuan; Suzanna E. Lewis; Harry Hochheiser; Tudor Groza; Damian Smedley; Peter N. Robinson; Christopher J. Mungall; Melissa Haendel

The principles of genetics apply across the entire tree of life. At the cellular level we share biological mechanisms with species from which we diverged millions, even billions of years ago. We can exploit this common ancestry to learn about health and disease, by analyzing DNA and protein sequences, but also through the observable outcomes of genetic differences, i.e. phenotypes. To solve challenging disease problems we need to unify the heterogeneous data that relates genomics to disease traits. Without a big-picture view of phenotypic data, many questions in genetics are difficult or impossible to answer. The Monarch Initiative (https://monarchinitiative.org) provides tools for genotype-phenotype analysis, genomic diagnostics, and precision medicine across broad areas of disease.

Collaboration


Dive into the Nicole Vasilevsky's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Christopher J. Mungall

Lawrence Berkeley National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Nicole L. Washington

Lawrence Berkeley National Laboratory

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge