Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Michael C. Rosenstein is active.

Publication


Featured researches published by Michael C. Rosenstein.


Nucleic Acids Research | 2009

Comparative Toxicogenomics Database: a knowledgebase and discovery tool for chemical–gene–disease networks

Allan Peter Davis; Cynthia G. Murphy; Cynthia Saraceni-Richards; Michael C. Rosenstein; Thomas C. Wiegers; Carolyn J. Mattingly

The Comparative Toxicogenomics Database (CTD) is a curated database that promotes understanding about the effects of environmental chemicals on human health. Biocurators at CTD manually curate chemical–gene interactions, chemical–disease relationships and gene–disease relationships from the literature. This strategy allows data to be integrated to construct chemical–gene–disease networks. CTD is unique in numerous respects: curation focuses on environmental chemicals; interactions are manually curated; interactions are constructed using controlled vocabularies and hierarchies; additional gene attributes (such as Gene Ontology, taxonomy and KEGG pathways) are integrated; data can be viewed from the perspective of a chemical, gene or disease; results and batch queries can be downloaded and saved; and most importantly, CTD acts as both a knowledgebase (by reporting data) and a discovery tool (by generating novel inferences). Over 116 000 interactions between 3900 chemicals and 13 300 genes have been curated from 270 species, and 5900 gene–disease and 2500 chemical–disease direct relationships have been captured. By integrating these data, 350 000 gene–disease relationships and 77 000 chemical–disease relationships can be inferred. This wealth of chemical–gene–disease information yields testable hypotheses for understanding the effects of environmental chemicals on human health. CTD is freely available at http://ctd.mdibl.org.


Database | 2012

MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database

Allan Peter Davis; Thomas C. Wiegers; Michael C. Rosenstein; Carolyn J. Mattingly

The Comparative Toxicogenomics Database (CTD) is a public resource that promotes understanding about the effects of environmental chemicals on human health. CTD biocurators manually curate a triad of chemical–gene, chemical–disease and gene–disease relationships from the scientific literature. The CTD curation paradigm uses controlled vocabularies for chemicals, genes and diseases. To curate disease information, CTD first had to identify a source of controlled terms. Two resources seemed to be good candidates: the Online Mendelian Inheritance in Man (OMIM) and the ‘Diseases’ branch of the National Library of Medicines Medical Subject Headers (MeSH). To maximize the advantages of both, CTD biocurators undertook a novel initiative to map the flat list of OMIM disease terms into the hierarchical nature of the MeSH vocabulary. The result is CTD’s ‘merged disease vocabulary’ (MEDIC), a unique resource that integrates OMIM terms, synonyms and identifiers with MeSH terms, synonyms, definitions, identifiers and hierarchical relationships. MEDIC is both a deep and broad vocabulary, composed of 9700 unique diseases described by more than 67 000 terms (including synonyms). It is freely available to download in various formats from CTD. While neither a true ontology nor a perfect solution, this vocabulary has nonetheless proved to be extremely successful and practical for our biocurators in generating over 2.5 million disease-associated toxicogenomic relationships in CTD. Other external databases have also begun to adopt MEDIC for their disease vocabulary. Here, we describe the construction, implementation, maintenance and use of MEDIC to raise awareness of this resource and to offer it as a putative scaffold in the formal construction of an official disease ontology. Database URL: http://ctd.mdibl.org/voc.go?type=disease


BMC Systems Biology | 2009

Genetic and environmental pathways to complex diseases

Julia M. Gohlke; Reuben Thomas; Yonqing Zhang; Michael C. Rosenstein; Allan Peter Davis; Cynthia G. Murphy; Kevin G. Becker; Carolyn J. Mattingly; Christopher J. Portier

BackgroundPathogenesis of complex diseases involves the integration of genetic and environmental factors over time, making it particularly difficult to tease apart relationships between phenotype, genotype, and environmental factors using traditional experimental approaches.ResultsUsing gene-centered databases, we have developed a network of complex diseases and environmental factors through the identification of key molecular pathways associated with both genetic and environmental contributions. Comparison with known chemical disease relationships and analysis of transcriptional regulation from gene expression datasets for several environmental factors and phenotypes clustered in a metabolic syndrome and neuropsychiatric subnetwork supports our network hypotheses. This analysis identifies natural and synthetic retinoids, antipsychotic medications, Omega 3 fatty acids, and pyrethroid pesticides as potential environmental modulators of metabolic syndrome phenotypes through PPAR and adipocytokine signaling and organophosphate pesticides as potential environmental modulators of neuropsychiatric phenotypes.ConclusionIdentification of key regulatory pathways that integrate genetic and environmental modulators define disease associated targets that will allow for efficient screening of large numbers of environmental factors, screening that could set priorities for further research and guide public health decisions.


BMC Medical Genomics | 2008

The Comparative Toxicogenomics Database facilitates identification and understanding of chemical-gene-disease associations: arsenic as a case study

Allan Peter Davis; Cynthia G. Murphy; Michael C. Rosenstein; Thomas C. Wiegers; Carolyn J. Mattingly

BackgroundThe etiology of many chronic diseases involves interactions between environmental factors and genes that modulate physiological processes. Understanding interactions between environmental chemicals and genes/proteins may provide insights into the mechanisms of chemical actions, disease susceptibility, toxicity, and therapeutic drug interactions. The Comparative Toxicogenomics Database (CTD; http://ctd.mdibl.org) provides these insights by curating and integrating data describing relationships between chemicals, genes/proteins, and human diseases. To illustrate the scope and application of CTD, we present an analysis of curated data for the chemical arsenic. Arsenic represents a major global environmental health threat and is associated with many diseases. The mechanisms by which arsenic modulates these diseases are not well understood.MethodsCurated interactions between arsenic compounds and genes were downloaded using export and batch query tools at CTD. The list of genes was analyzed for molecular interactions, Gene Ontology (GO) terms, KEGG pathway annotations, and inferred disease relationships.ResultsCTD contains curated data from the published literature describing 2,738 molecular interactions between 21 different arsenic compounds and 1,456 genes and proteins. Analysis of these genes and proteins provide insight into the biological functions and molecular networks that are affected by exposure to arsenic, including stress response, apoptosis, cell cycle, and specific protein signaling pathways. Integrating arsenic-gene data with gene-disease data yields a list of diseases that may be associated with arsenic exposure and genes that may explain this association.ConclusionCTD data integration and curation strategies yield insight into the actions of environmental chemicals and provide a basis for developing hypotheses about the molecular mechanisms underlying the etiology of environmental diseases. While many reports describe the molecular response to arsenic, CTD integrates these data with additional curated data sets that facilitate construction of chemical-gene-disease networks and provide the groundwork for investigating the molecular basis of arsenic-associated diseases or toxicity. The analysis reported here is extensible to any environmental chemical or therapeutic drug.


Pharmacogenomics Journal | 2004

Promoting comparative molecular studies in environmental health research: an overview of the comparative toxicogenomics database (CTD)

Carolyn J. Mattingly; Glenn T. Colby; Michael C. Rosenstein; John N. Forrest; James L. Boyer

Promoting comparative molecular studies in environmental health research: an overview of the comparative toxicogenomics database (CTD)


PLOS ONE | 2012

Ranking Transitive Chemical-Disease Inferences Using Local Network Topology in the Comparative Toxicogenomics Database

Benjamin L. King; Allan Peter Davis; Michael C. Rosenstein; Thomas C. Wiegers; Carolyn J. Mattingly

Exposure to chemicals in the environment is believed to play a critical role in the etiology of many human diseases. To enhance understanding about environmental effects on human health, the Comparative Toxicogenomics Database (CTD; http://ctdbase.org) provides unique curated data that enable development of novel hypotheses about the relationships between chemicals and diseases. CTD biocurators read the literature and curate direct relationships between chemicals-genes, genes-diseases, and chemicals-diseases. These direct relationships are then computationally integrated to create additional inferred relationships; for example, a direct chemical-gene statement can be combined with a direct gene-disease statement to generate a chemical-disease inference (inferred via the shared gene). In CTD, the number of inferences has increased exponentially as the number of direct chemical, gene and disease interactions has grown. To help users navigate and prioritize these inferences for hypothesis development, we implemented a statistic to score and rank them based on the topology of the local network consisting of the chemical, disease and each of the genes used to make an inference. In this network, chemicals, diseases and genes are nodes connected by edges representing the curated interactions. Like other biological networks, node connectivity is an important consideration when evaluating the CTD network, as the connectivity of nodes follows the power-law distribution. Topological methods reduce the influence of highly connected nodes that are present in biological networks. We evaluated published methods that used local network topology to determine the reliability of protein–protein interactions derived from high-throughput assays. We developed a new metric that combines and weights two of these methods and uniquely takes into account the number of common neighbors and the connectivity of each entity involved. We present several CTD inferences as case studies to demonstrate the value of this metric and the biological relevance of the inferences.


Bioinformation | 2011

DiseaseComps: a metric that discovers similar diseases based upon common toxicogenomic profiles at CTD

Allan Peter Davis; Michael C. Rosenstein; Thomas C. Wiegers; Carolyn J. Mattingly

The Comparative Toxicogenomics Database (CTD) is a free resource that describes chemical-gene-disease networks to help understand the effects of environmental exposures on human health. The database contains more than 13,500 chemical-disease and 14,200 gene-disease interactions. In CTD, chemicals and genes are associated with a disease via two types of relationships: as a biomarker or molecular mechanism for the disease (M-type) or as a real or putative therapy for the disease (T-type). We leveraged these curated datasets to compute similarity indices that can be used to produce lists of comparable diseases (“DiseaseComps”) based upon shared toxicogenomic profiles. This new metric now classifies diseases with common molecular characteristics, instead of the traditional approach of using histology or tissue of origin to define the disorder. In the dawning era of “personalized medicine”, this feature provides a new way to view and describe diseases and will help develop testable hypotheses about chemical-gene-disease networks. Availability The database is available for free at http://ctd.mdibl.org/


Bioinformation | 2009

GeneComps and ChemComps: a new CTD metric to identify genes and chemicals with shared toxicogenomic profiles.

Allan Peter Davis; Cynthia G. Murphy; Cynthia Saraceni-Richards; Michael C. Rosenstein; Thomas C. Wiegers; Thomas H. Hampton; Carolyn J. Mattingly

The Comparative Toxicogenomics Database is a public resource that promotes understanding about the effects of environmental chemicals on human health. Currently, CTD describes over 184,000 molecular interactions for more than 5,100 chemicals and 16,300 genes/proteins. We have leveraged this dataset of chemical-gene relationships to compute similarity indices following the statistical method of the Jaccard index. These scores are used to produce lists of comparable genes (“GeneComps”) or chemicals (“ChemComps”) based on shared toxicogenomic profiles. GeneComps and ChemComps are now provided for every curated gene and chemical in CTD. ChemComps are particularly significant because they provide a way to group chemicals based upon their biological effects, instead of their physical or structural properties. These metrics provide a novel way to view and classify genes and chemicals and will help advance testable hypotheses about environmental chemical-genedisease networks. Availability CTD is freely available at http://ctd.mdibl.org/


Database | 2012

Targeted journal curation as a method to improve data currency at the Comparative Toxicogenomics Database

Allan Peter Davis; Robin J. Johnson; Kelley Lennon-Hopkins; Daniela Sciaky; Michael C. Rosenstein; Thomas C. Wiegers; Carolyn J. Mattingly

The Comparative Toxicogenomics Database (CTD) is a public resource that promotes understanding about the effects of environmental chemicals on human health. CTD biocurators read the scientific literature and manually curate a triad of chemical–gene, chemical–disease and gene–disease interactions. Typically, articles for CTD are selected using a chemical-centric approach by querying PubMed to retrieve a corpus containing the chemical of interest. Although this technique ensures adequate coverage of knowledge about the chemical (i.e. data completeness), it does not necessarily reflect the most current state of all toxicological research in the community at large (i.e. data currency). Keeping databases current with the most recent scientific results, as well as providing a rich historical background from legacy articles, is a challenging process. To address this issue of data currency, CTD designed and tested a journal-centric approach of curation to complement our chemical-centric method. We first identified priority journals based on defined criteria. Next, over 7 weeks, three biocurators reviewed 2425 articles from three consecutive years (2009–2011) of three targeted journals. From this corpus, 1252 articles contained relevant data for CTD and 52 752 interactions were manually curated. Here, we describe our journal selection process, two methods of document delivery for the biocurators and the analysis of the resulting curation metrics, including data currency, and both intra-journal and inter-journal comparisons of research topics. Based on our results, we expect that curation by select journals can (i) be easily incorporated into the curation pipeline to complement our chemical-centric approach; (ii) build content more evenly for chemicals, genes and diseases in CTD (rather than biasing data by chemicals-of-interest); (iii) reflect developing areas in environmental health and (iv) improve overall data currency for chemicals, genes and diseases. Database URL: http://ctdbase.org/


npj Regenerative Medicine | 2018

RegenDbase: a comparative database of noncoding RNA regulation of tissue regeneration circuits across multiple taxa

Benjamin L. King; Michael C. Rosenstein; Ashley M. Smith; Christina A. Dykeman; Grace A. Smith; Viravuth P. Yin

Regeneration is an endogenous process of tissue repair that culminates in complete restoration of tissue and organ function. While regenerative capacity in mammals is limited to select tissues, lower vertebrates like zebrafish and salamanders are endowed with the capacity to regenerate entire limbs and most adult tissues, including heart muscle. Numerous profiling studies have been conducted using these research models in an effort to identify the genetic circuits that accompany tissue regeneration. Most of these studies, however, are confined to an individual injury model and/or research organism and focused primarily on protein encoding transcripts. Here we describe RegenDbase, a new database with the functionality to compare and contrast gene regulatory pathways within and across tissues and research models. RegenDbase combines pipelines that integrate analysis of noncoding RNAs in combination with protein encoding transcripts. We created RegenDbase with a newly generated comprehensive dataset for adult zebrafish heart regeneration combined with existing microarray and RNA-sequencing studies on multiple injured tissues. In this current release, we detail microRNA–mRNA regulatory circuits and the biological processes these interactions control during the early stages of heart regeneration. Moreover, we identify known and putative novel lncRNAs and identify their potential target genes based on proximity searches. We postulate that these candidate factors underscore robust regenerative capacity in lower vertebrates. RegenDbase provides a systems-level analysis of tissue regeneration genetic circuits across injury and animal models and addresses the growing need to understand how noncoding RNAs influence these changes in gene expression.Software: Online tool allows cross-species comparisons of regenerationA new online resource allows researchers to probe the genomic circuitry that underpins tissue regeneration across a range of injury and animal models. A team led by Benjamin King from the University of Maine in Orono and Viravuth Yin from the MDI Biological Laboratory in Bar Harbor, Maine, USA, created a tool called RegenDbase, the Comparative Models of Regeneration Database. It incorporates gene expression datasets for protein-coding genes and regulatory RNA molecules from zebrafish, axolotl salamanders and mice. As a proof of principle, the researchers used RegenDbase to find RNAs common to heart regeneration in both neonatal mice and zebrafish using an extensive new zebrafish dataset. Twenty-eight new zebrafish regulatory RNA molecules were identified using those data. Future incorporation of datasets from other organisms and human tissues will enable broader cross-species comparisons of regenerative biology.

Collaboration


Dive into the Michael C. Rosenstein's collaboration.

Top Co-Authors

Avatar

Carolyn J. Mattingly

North Carolina State University

View shared research outputs
Top Co-Authors

Avatar

Allan Peter Davis

North Carolina State University

View shared research outputs
Top Co-Authors

Avatar

Thomas C. Wiegers

North Carolina State University

View shared research outputs
Top Co-Authors

Avatar

Cynthia G. Murphy

Mount Desert Island Biological Laboratory

View shared research outputs
Top Co-Authors

Avatar

Glenn T. Colby

Mount Desert Island Biological Laboratory

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Benjamin L. King

Mount Desert Island Biological Laboratory

View shared research outputs
Top Co-Authors

Avatar

Cynthia Saraceni-Richards

Mount Desert Island Biological Laboratory

View shared research outputs
Top Co-Authors

Avatar

Ashley M. Smith

Mount Desert Island Biological Laboratory

View shared research outputs
Researchain Logo
Decentralizing Knowledge