Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Jeremy J. Yang is active.

Publication


Featured researches published by Jeremy J. Yang.


Molecular Informatics | 2011

Associating Drugs, Targets and Clinical Outcomes into an Integrated Network Affords a New Platform for Computer-Aided Drug Repurposing

Tudor I. Oprea; Sonny Kim Nielsen; Oleg Ursu; Jeremy J. Yang; Olivier Taboureau; Stephen L. Mathias; Irene Kouskoumvekaki; Larry A. Sklar; Cristian G. Bologa

Finding new uses for old drugs is a strategy embraced by the pharmaceutical industry, with increasing participation from the academic sector. Drug repurposing efforts focus on identifying novel modes of action, but not in a systematic manner. With intensive data mining and curation, we aim to apply bio‐ and cheminformatics tools using the DRUGS database, containing 3837 unique small molecules annotated on 1750 proteins. These are likely to serve as drug targets and antitargets (i.e., associated with side effects, SE). The academic community, the pharmaceutical sector and clinicians alike could benefit from an integrated, semantic‐web compliant computer‐aided drug repurposing (CADR) effort, one that would enable deep data mining of associations between approved drugs (D), targets (T), clinical outcomes (CO) and SE. We report preliminary results from text mining and multivariate statistics, based on 7684 approved drug labels, ADL (Dailymed) via text mining. From the ADL corresponding to 988 unique drugs, the “adverse reactions” section was mapped onto 174 SE, then clustered via principal component analysis into a 5×5 self‐organizing map that was integrated into a Cytoscape network of SE‐D‐T‐CO. This type of data can be used to streamline drug repurposing and may result in novel insights that can lead to the identification of novel drug actions.


Nucleic Acids Research | 2017

DrugCentral: online drug compendium

Oleg Ursu; Jayme Holmes; Jeffrey Knockel; Cristian G. Bologa; Jeremy J. Yang; Stephen L. Mathias; Stuart J. Nelson; Tudor I. Oprea

DrugCentral (http://drugcentral.org) is an open-access online drug compendium. DrugCentral integrates structure, bioactivity, regulatory, pharmacologic actions and indications for active pharmaceutical ingredients approved by FDA and other regulatory agencies. Monitoring of regulatory agencies for new drugs approvals ensures the resource is up-to-date. DrugCentral integrates content for active ingredients with pharmaceutical formulations, indexing drugs and drug label annotations, complementing similar resources available online. Its complementarity with other online resources is facilitated by cross referencing to external resources. At the molecular level, DrugCentral bridges drug-target interactions with pharmacological action and indications. The integration with FDA drug labels enables text mining applications for drug adverse events and clinical trial information. Chemical structure overlap between DrugCentral and five online drug resources, and the overlap between DrugCentral FDA-approved drugs and their presence in four different chemical collections, are discussed. DrugCentral can be accessed via the web application or downloaded in relational database format.


Journal of Cheminformatics | 2016

Badapple: promiscuity patterns from noisy evidence

Jeremy J. Yang; Oleg Ursu; Christopher A. Lipinski; Larry A. Sklar; Tudor I. Oprea; Cristian G. Bologa

BackgroundBioassay data analysis continues to be an essential, routine, yet challenging task in modern drug discovery and chemical biology research. The challenge is to infer reliable knowledge from big and noisy data. Some aspects of this problem are general with solutions informed by existing and emerging data science best practices. Some aspects are domain specific, and rely on expertise in bioassay methodology and chemical biology. Testing compounds for biological activity requires complex and innovative methodology, producing results varying widely in accuracy, precision, and information content. Hit selection criteria involve optimizing such that the overall probability of success in a project is maximized, and resource-wasteful “false trails” are avoided. This “fail-early” approach is embraced both in pharmaceutical and academic drug discovery, since follow-up capacity is resource-limited. Thus, early identification of likely promiscuous compounds has practical value.ResultsHere we describe an algorithm for identifying likely promiscuous compounds via associated scaffolds which combines general and domain-specific features to assist and accelerate drug discovery informatics, called Badapple: bioassay-data associative promiscuity pattern learning engine. Results are described from an analysis using data from MLP assays via the BioAssay Research Database (BARD) http://bard.nih.gov. Specific examples are analyzed in the context of medicinal chemistry, to illustrate associations with mechanisms of promiscuity. Badapple has been developed at UNM, released and deployed for public use two ways: (1) BARD plugin, integrated into the public BARD REST API and BARD web client; and (2) public web app hosted at UNM.ConclusionsBadapple is a method for rapidly identifying likely promiscuous compounds via associated scaffolds. Badapple generates a score associated with a pragmatic, empirical definition of promiscuity, with the overall goal to identify “false trails” and streamline workflows. Unlike methods reliant on expert curation of chemical substructure patterns, Badapple is fully evidence-driven, automated, self-improving via integration of additional data, and focused on scaffolds. Badapple is robust with respect to noise and errors, and skeptical of scanty evidence.


Journal of Biomolecular Screening | 2014

An Overview of the Challenges in Designing, Integrating, and Delivering BARD: A Public Chemical-Biology Resource and Query Portal for Multiple Organizations, Locations, and Disciplines

Andrea de Souza; Joshua Bittker; David L. Lahr; Steve Brudz; Simon Chatwin; Tudor I. Oprea; Anna Waller; Jeremy J. Yang; Noel Southall; Rajarshi Guha; Stephan C. Schürer; Uma D. Vempati; Mark R. Southern; Eric S. Dawson; Paul A. Clemons; Thomas Dy Chung

Recent industry–academic partnerships involve collaboration among disciplines, locations, and organizations using publicly funded “open-access” and proprietary commercial data sources. These require the effective integration of chemical and biological information from diverse data sources, which presents key informatics, personnel, and organizational challenges. The BioAssay Research Database (BARD) was conceived to address these challenges and serve as a community-wide resource and intuitive web portal for public-sector chemical-biology data. Its initial focus is to enable scientists to more effectively use the National Institutes of Health Roadmap Molecular Libraries Program (MLP) data generated from the 3-year pilot and 6-year production phases of the Molecular Libraries Probe Production Centers Network (MLPCN), which is currently in its final year. BARD evolves the current data standards through structured assay and result annotations that leverage BioAssay Ontology and other industry-standard ontologies, and a core hierarchy of assay definition terms and data standards defined specifically for small-molecule assay data. We initially focused on migrating the highest-value MLP data into BARD and bringing it up to this new standard. We review the technical and organizational challenges overcome by the interdisciplinary BARD team, veterans of public- and private-sector data-integration projects, who are collaborating to describe (functional specifications), design (technical specifications), and implement this next-generation software solution.


Nucleic Acids Research | 2017

Pharos: Collating protein information to shed light on the druggable genome

Dac-Trung Nguyen; Stephen L. Mathias; Cristian G. Bologa; Søren Brunak; Nicolas F. Fernandez; Anna Gaulton; Anne Hersey; Jayme Holmes; Lars Juhl Jensen; Anneli Karlsson; Guixia Liu; Avi Ma'ayan; Geetha Mandava; Subramani Mani; Saurabh Mehta; John P. Overington; Juhee Patel; Andrew D. Rouillard; Stephan C. Schürer; Timothy Sheils; Anton Simeonov; Larry A. Sklar; Noel Southall; Oleg Ursu; Dušica Vidovic; Anna Waller; Jeremy J. Yang; Ajit Jadhav; Tudor I. Oprea; Rajarshi Guha

The ‘druggable genome’ encompasses several protein families, but only a subset of targets within them have attracted significant research attention and thus have information about them publicly available. The Illuminating the Druggable Genome (IDG) program was initiated in 2014, has the goal of developing experimental techniques and a Knowledge Management Center (KMC) that would collect and organize information about protein targets from four families, representing the most common druggable targets with an emphasis on understudied proteins. Here, we describe two resources developed by the KMC: the Target Central Resource Database (TCRD) which collates many heterogeneous gene/protein datasets and Pharos (https://pharos.nih.gov), a multimodal web interface that presents the data from TCRD. We briefly describe the types and sources of data considered by the KMC and then highlight features of the Pharos interface designed to enable intuitive access to the IDG knowledgebase. The aim of Pharos is to encourage ‘serendipitous browsing’, whereby related, relevant information is made easily discoverable. We conclude by describing two use cases that highlight the utility of Pharos and TCRD.


Database | 2013

The CARLSBAD Database: A Confederated Database of Chemical Bioactivities

Stephen L. Mathias; Jarrett Hines-Kay; Jeremy J. Yang; Gergely Zahoransky-Kohalmi; Cristian G. Bologa; Oleg Ursu; Tudor I. Oprea

Many bioactivity databases offer information regarding the biological activity of small molecules on protein targets. Information in these databases is often hard to resolve with certainty because of subsetting different data in a variety of formats; use of different bioactivity metrics; use of different identifiers for chemicals and proteins; and having to access different query interfaces, respectively. Given the multitude of data sources, interfaces and standards, it is challenging to gather relevant facts and make appropriate connections and decisions regarding chemical–protein associations. The CARLSBAD database has been developed as an integrated resource, focused on high-quality subsets from several bioactivity databases, which are aggregated and presented in a uniform manner, suitable for the study of the relationships between small molecules and targets. In contrast to data collection resources, CARLSBAD provides a single normalized activity value of a given type for each unique chemical–protein target pair. Two types of scaffold perception methods have been implemented and are available for datamining: HierS (hierarchical scaffolds) and MCES (maximum common edge subgraph). The 2012 release of CARLSBAD contains 439 985 unique chemical structures, mapped onto 1,420 889 unique bioactivities, and annotated with 277 140 HierS scaffolds and 54 135 MCES chemical patterns, respectively. Of the 890 323 unique structure–target pairs curated in CARLSBAD, 13.95% are aggregated from multiple structure–target values: 94 975 are aggregated from two bioactivities, 14 544 from three, 7 930 from four and 2214 have five bioactivities, respectively. CARLSBAD captures bioactivities and tags for 1435 unique chemical structures of active pharmaceutical ingredients (i.e. ‘drugs’). CARLSBAD processing resulted in a net 17.3% data reduction for chemicals, 34.3% reduction for bioactivities, 23% reduction for HierS and 25% reduction for MCES, respectively. The CARLSBAD database supports a knowledge mining system that provides non-specialists with novel integrative ways of exploring chemical biology space to facilitate knowledge mining in drug discovery and repurposing. Database URL: http://carlsbad.health.unm.edu/carlsbad/.


Nucleic Acids Research | 2015

BioAssay Research Database (BARD): chemical biology and probe-development enabled by structured metadata and result types.

E. A. Howe; A. de Souza; David L. Lahr; S. Chatwin; Philip Montgomery; Benjamin Alexander; Dac-Trung Nguyen; Yasel Cruz; D. A. Stonich; G. Walzer; J. T. Rose; S. C. Picard; Zihan Liu; J. N. Rose; X. Xiang; Jacob K. Asiedu; D. Durkin; J. Levine; Jeremy J. Yang; Stephan C. Schürer; John C. Braisted; Noel Southall; Mark R. Southern; Thomas Dy Chung; Steve Brudz; Cordelle Tanega; Stuart L. Schreiber; Joshua Bittker; Rajarshi Guha; Paul A. Clemons

BARD, the BioAssay Research Database (https://bard.nih.gov/) is a public database and suite of tools developed to provide access to bioassay data produced by the NIH Molecular Libraries Program (MLP). Data from 631 MLP projects were migrated to a new structured vocabulary designed to capture bioassay data in a formalized manner, with particular emphasis placed on the description of assay protocols. New data can be submitted to BARD with a user-friendly set of tools that assist in the creation of appropriately formatted datasets and assay definitions. Data published through the BARD application program interface (API) can be accessed by researchers using web-based query tools or a desktop client. Third-party developers wishing to create new tools can use the API to produce stand-alone tools or new plug-ins that can be integrated into BARD. The entire BARD suite of tools therefore supports three classes of researcher: those who wish to publish data, those who wish to mine data for testable hypotheses, and those in the developer community who wish to build tools that leverage this carefully curated chemical biology resource.


PLOS ONE | 2015

Novel Phenotypic Outcomes Identified for a Public Collection of Approved Drugs from a Publicly Accessible Panel of Assays

Jonathan A. Lee; Paul Shinn; Susan Jaken; Sarah Oliver; Francis S. Willard; Steven A. Heidler; Robert B. Peery; Jennifer Oler; Shaoyou Chu; Noel Southall; Thomas S. Dexheimer; Jeffrey K. Smallwood; Ruili Huang; Rajarshi Guha; Ajit Jadhav; Karen L. Cox; Christopher P. Austin; Anton Simeonov; G. Sitta Sittampalam; Saba Husain; Natalie Franklin; David J. Wild; Jeremy J. Yang; Jeffrey J. Sutherland; Craig J. Thomas

Phenotypic assays have a proven track record for generating leads that become first-in-class therapies. Whole cell assays that inform on a phenotype or mechanism also possess great potential in drug repositioning studies by illuminating new activities for the existing pharmacopeia. The National Center for Advancing Translational Sciences (NCATS) pharmaceutical collection (NPC) is the largest reported collection of approved small molecule therapeutics that is available for screening in a high-throughput setting. Via a wide-ranging collaborative effort, this library was analyzed in the Open Innovation Drug Discovery (OIDD) phenotypic assay modules publicly offered by Lilly. The results of these tests are publically available online at www.ncats.nih.gov/expertise/preclinical/pd2 and via the PubChem Database (https://pubchem.ncbi.nlm.nih.gov/) (AID 1117321). Phenotypic outcomes for numerous drugs were confirmed, including sulfonylureas as insulin secretagogues and the anti-angiogenesis actions of multikinase inhibitors sorafenib, axitinib and pazopanib. Several novel outcomes were also noted including the Wnt potentiating activities of rotenone and the antifolate class of drugs, and the anti-angiogenic activity of cetaben.


Nature Reviews Drug Discovery | 2018

Unexplored therapeutic opportunities in the human genome

Tudor I. Oprea; Cristian G. Bologa; Søren Brunak; Allen Campbell; Gregory Gan; Anna Gaulton; Shawn M. Gomez; Rajarshi Guha; Anne Hersey; Jayme Holmes; Ajit Jadhav; Lars Juhl Jensen; Gary L. Johnson; Anneli Karlson; Andrew R. Leach; Avi Ma'ayan; Anna Malovannaya; Subramani Mani; Stephen L. Mathias; Michael T. McManus; Terrence F. Meehan; Christian von Mering; Daniel Muthas; Dac Trung Nguyen; John P. Overington; George Papadatos; Jun Qin; Christian Reich; Bryan L. Roth; Stephan C. Schürer

A large proportion of biomedical research and the development of therapeutics is focused on a small fraction of the human genome. In a strategic effort to map the knowledge gaps around proteins encoded by the human genome and to promote the exploration of currently understudied, but potentially druggable, proteins, the US National Institutes of Health launched the Illuminating the Druggable Genome (IDG) initiative in 2014. In this article, we discuss how the systematic collection and processing of a wide array of genomic, proteomic, chemical and disease-related resource data by the IDG Knowledge Management Center have enabled the development of evidence-based criteria for tracking the target development level (TDL) of human proteins, which indicates a substantial knowledge deficit for approximately one out of three proteins in the human proteome. We then present spotlights on the TDL categories as well as key drug target classes, including G protein-coupled receptors, protein kinases and ion channels, which illustrate the nature of the unexplored opportunities for biomedical research and therapeutic development.


Journal of Biomedical Semantics | 2017

Drug target ontology to classify and integrate drug discovery data

Yu Lin; Saurabh Mehta; John Paul Turner; Dušica Vidovic; Michele Forlin; Amar Koleti; Dac Trung Nguyen; Lars Juhl Jensen; Rajarshi Guha; Stephen L. Mathias; Oleg Ursu; Vasileios Stathias; Jianbin Duan; Nooshin Nabizadeh; Caty Chung; Christopher Mader; Ubbo Visser; Jeremy J. Yang; Cristian G. Bologa; Tudor I. Oprea; Stephan C. Schürer

BackgroundOne of the most successful approaches to develop new small molecule therapeutics has been to start from a validated druggable protein target. However, only a small subset of potentially druggable targets has attracted significant research and development resources. The Illuminating the Druggable Genome (IDG) project develops resources to catalyze the development of likely targetable, yet currently understudied prospective drug targets. A central component of the IDG program is a comprehensive knowledge resource of the druggable genome.ResultsAs part of that effort, we have developed a framework to integrate, navigate, and analyze drug discovery data based on formalized and standardized classifications and annotations of druggable protein targets, the Drug Target Ontology (DTO). DTO was constructed by extensive curation and consolidation of various resources. DTO classifies the four major drug target protein families, GPCRs, kinases, ion channels and nuclear receptors, based on phylogenecity, function, target development level, disease association, tissue expression, chemical ligand and substrate characteristics, and target-family specific characteristics. The formal ontology was built using a new software tool to auto-generate most axioms from a database while supporting manual knowledge acquisition. A modular, hierarchical implementation facilitate ontology development and maintenance and makes use of various external ontologies, thus integrating the DTO into the ecosystem of biomedical ontologies. As a formal OWL-DL ontology, DTO contains asserted and inferred axioms. Modeling data from the Library of Integrated Network-based Cellular Signatures (LINCS) program illustrates the potential of DTO for contextual data integration and nuanced definition of important drug target characteristics. DTO has been implemented in the IDG user interface Portal, Pharos and the TIN-X explorer of protein target disease relationships.ConclusionsDTO was built based on the need for a formal semantic model for druggable targets including various related information such as protein, gene, protein domain, protein structure, binding site, small molecule drug, mechanism of action, protein tissue localization, disease association, and many other types of information. DTO will further facilitate the otherwise challenging integration and formal linking to biological assays, phenotypes, disease models, drug poly-pharmacology, binding kinetics and many other processes, functions and qualities that are at the core of drug discovery. The first version of DTO is publically available via the website http://drugtargetontology.org/, Github (http://github.com/DrugTargetOntology/DTO), and the NCBO Bioportal (http://bioportal.bioontology.org/ontologies/DTO). The long-term goal of DTO is to provide such an integrative framework and to populate the ontology with this information as a community resource.

Collaboration


Dive into the Jeremy J. Yang's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Tudor I. Oprea

University of New Mexico

View shared research outputs
Top Co-Authors

Avatar

Oleg Ursu

University of New Mexico

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Rajarshi Guha

Pennsylvania State University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jayme Holmes

University of New Mexico

View shared research outputs
Top Co-Authors

Avatar

Larry A. Sklar

University of New Mexico

View shared research outputs
Top Co-Authors

Avatar

Noel Southall

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge