Margot Sunshine
SRA International
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Margot Sunshine.
BMC Genomics | 2009
Uma Shankavaram; Sudhir Varma; David Kane; Margot Sunshine; Krishna K Chary; William C. Reinhold; Yves Pommier; John N. Weinstein
BackgroundAdvances in the high-throughput omic technologies have made it possible to profile cells in a large number of ways at the DNA, RNA, protein, chromosomal, functional, and pharmacological levels. A persistent problem is that some classes of molecular data are labeled with gene identifiers, others with transcript or protein identifiers, and still others with chromosomal locations. What has lagged behind is the ability to integrate the resulting data to uncover complex relationships and patterns. Those issues are reflected in full form by molecular profile data on the panel of 60 diverse human cancer cell lines (the NCI-60) used since 1990 by the U.S. National Cancer Institute to screen compounds for anticancer activity. To our knowledge, CellMiner is the first online database resource for integration of the diverse molecular types of NCI-60 and related meta data.DescriptionCellMiner enables scientists to perform advanced querying of molecular information on NCI-60 (and additional types) through a single web interface. CellMiner is a freely available tool that organizes and stores raw and normalized data that represent multiple types of molecular characterizations at the DNA, RNA, protein, and pharmacological levels. Annotations for each project, along with associated metadata on the samples and datasets, are stored in a MySQL database and linked to the molecular profile data. Data can be queried and downloaded along with comprehensive information on experimental and analytic methods for each data set. A Data Intersection tool allows selection of a list of genes (proteins) in common between two or more data sets and outputs the data for those genes (proteins) in the respective sets. In addition to its role as an integrative resource for the NCI-60, the CellMiner package also serves as a shell for incorporation of molecular profile data on other cell or tissue sample types.ConclusionCellMiner is a relational database tool for storing, querying, integrating, and downloading molecular profile data on the NCI-60 and other cancer cell types. More broadly, it provides a template to use in providing such functionality for other molecular profile data generated by academic institutions, public projects, or the private sector. CellMiner is available online at http://discover.nci.nih.gov/cellminer/.
PLOS ONE | 2012
Kurt W. Kohn; Barry R. Zeeberg; William C. Reinhold; Margot Sunshine; Augustin Luna; Yves Pommier
Although there is extensive information on gene expression and molecular interactions in various cell types, integrating those data in a functionally coherent manner remains challenging. This study explores the premise that genes whose expression at the mRNA level is correlated over diverse cell lines are likely to function together in a network of molecular interactions. We previously derived expression-correlated gene clusters from the database of the NCI-60 human tumor cell lines and associated each cluster with function categories of the Gene Ontology (GO) database. From a cluster rich in genes associated with GO categories related to cell migration, we extracted 15 genes that were highly cross-correlated; prominent among them were RRAS, AXL, ADAM9, FN14, and integrin-beta1. We then used those 15 genes as bait to identify other correlated genes in the NCI-60 database. A survey of current literature disclosed, not only that many of the expression-correlated genes engaged in molecular interactions related to migration, invasion, and metastasis, but that highly cross-correlated subsets of those genes engaged in specific cell migration processes. We assembled this information in molecular interaction maps (MIMs) that depict networks governing 3 cell migration processes: degradation of extracellular matrix, production of transient focal complexes at the leading edge of the cell, and retraction of the rear part of the cell. Also depicted are interactions controlling the release and effects of calcium ions, which may regulate migration in a spaciotemporal manner in the cell. The MIMs and associated text comprise a detailed and integrated summary of what is currently known or surmised about the role of the expression cross-correlated genes in molecular networks governing those processes.
Clinical Cancer Research | 2015
William C. Reinhold; Margot Sunshine; Sudhir Varma; James H. Doroshow; Yves Pommier
The NCI-60 cancer cell line panel provides a premier model for data integration, and systems pharmacology being the largest publicly available database of anticancer drug activity, genomic, molecular, and phenotypic data. It comprises gene expression (25,722 transcripts), microRNAs (360 miRNAs), whole-genome DNA copy number (23,413 genes), whole-exome sequencing (variants for 16,568 genes), protein levels (94 genes), and cytotoxic activity (20,861 compounds). Included are 158 FDA-approved drugs and 79 that are in clinical trials. To improve data accessibility to bioinformaticists and non-bioinformaticists alike, we have developed the CellMiner web–based tools. Here, we describe the newest CellMiner version, including integration of novel databases and tools associated with whole-exome sequencing and protein expression, and review the tools. Included are (i) “Cell line signature” for DNA, RNA, protein, and drugs; (ii) “Cross correlations” for up to 150 input genes, microRNAs, and compounds in a single query; (iii) “Pattern comparison” to identify connections among drugs, gene expression, genomic variants, microRNA, and protein expressions; (iv) “Genetic variation versus drug visualization” to identify potential new drug:gene DNA variant relationships; and (v) “Genetic variant summation” designed to provide a synopsis of mutational burden on any pathway or gene group for up to 150 genes. Together, these tools allow users to flexibly query the NCI-60 data for potential relationships between genomic, molecular, and pharmacologic parameters in a manner specific to the users area of expertise. Examples for both gain- (RAS) and loss-of-function (PTEN) alterations are provided. Clin Cancer Res; 21(17); 3841–52. ©2015 AACR.
BMC Bioinformatics | 2011
Augustin Luna; Margot Sunshine; Lucas Chang; Ruth Nussinov; Mirit I Aladjem; Kurt W. Kohn
BackgroundThe Molecular Interaction Map (MIM) notation offers a standard set of symbols and rules on their usage for the depiction of cellular signaling network diagrams. Such diagrams are essential for disseminating biological information in a concise manner. A lack of software tools for the notation restricts wider usage of the notation. Development of software is facilitated by a more detailed specification regarding software requirements than has previously existed for the MIM notation.ResultsA formal implementation of the MIM notation was developed based on a core set of previously defined glyphs. This implementation provides a detailed specification of the properties of the elements of the MIM notation. Building upon this specification, a machine-readable format is provided as a standardized mechanism for the storage and exchange of MIM diagrams. This new format is accompanied by a Java-based application programming interface to help software developers to integrate MIM support into software projects. A validation mechanism is also provided to determine whether MIM datasets are in accordance with syntax rules provided by the new specification.ConclusionsThe work presented here provides key foundational components to promote software development for the MIM notation. These components will speed up the development of interoperable tools supporting the MIM notation and will aid in the translation of data stored in MIM diagrams to other standardized formats. Several projects utilizing this implementation of the notation are outlined herein. The MIM specification is available as an additional file to this publication. Source code, libraries, documentation, and examples are available at http://discover.nci.nih.gov/mim.
Cancer Research | 2017
William C. Reinhold; Sudhir Varma; Margot Sunshine; Vinodh N. Rajapakse; Augustin Luna; Kurt W. Kohn; Holly Stevenson; Yonghong Wang; Holger Heyn; Vanesa Nogales; Sebastian Moran; David J. Goldstein; James H. Doroshow; Paul S. Meltzer; Manel Esteller; Yves Pommier
A unique resource for systems pharmacology and genomic studies is the NCI-60 cancer cell line panel, which provides data for the largest publicly available library of compounds with cytotoxic activity (∼21,000 compounds), including 108 FDA-approved and 70 clinical trial drugs as well as genomic data, including whole-exome sequencing, gene and miRNA transcripts, DNA copy number, and protein levels. Here, we provide the first readily usable genome-wide DNA methylation database for the NCI-60, including 485,577 probes from the Infinium HumanMethylation450k BeadChip array, which yielded DNA methylation signatures for 17,559 genes integrated into our open access CellMiner version 2.0 (https://discover.nci.nih.gov/cellminer). Among new insights, transcript versus DNA methylation correlations revealed the epithelial/mesenchymal gene functional category as being influenced most heavily by methylation. DNA methylation and copy number integration with transcript levels yielded an assessment of their relative influence for 15,798 genes, including tumor suppressor, mitochondrial, and mismatch repair genes. Four forms of molecular data were combined, providing rationale for microsatellite instability for 8 of the 9 cell lines in which it occurred. Individual cell line analyses showed global methylome patterns with overall methylation levels ranging from 17% to 84%. A six-gene model, including PARP1, EP300, KDM5C, SMARCB1, and UHRF1 matched this pattern. In addition, promoter methylation of two translationally relevant genes, Schlafen 11 (SLFN11) and methylguanine methyltransferase (MGMT), served as indicators of therapeutic resistance or susceptibility, respectively. Overall, our database provides a resource of pharmacologic data that can reinforce known therapeutic strategies and identify novel drugs and drug targets across multiple cancer types. Cancer Res; 77(3); 601-12. ©2016 AACR.
Cancer Research | 2012
William C. Reinhold; Margot Sunshine; Sudir Varma; H Liu; John N. Weinstein; Joel Morris; James H. Doroshow; Yves Pommier
Proceedings: AACR 103rd Annual Meeting 2012‐‐ Mar 31‐Apr 4, 2012; Chicago, IL High-throughput data is increasingly being integrated into the fields of biology, molecular biology, and pharmacology. However, a difficult problem has been the rapid and fluid access to and integration of this data, which tends to reside in huge unwieldy databases. One set of cell lines with substantial potential for benefit from this type of access and integration is the NCI-60 cancerous cell lines. We present here a set of tools within our CellMiner web-application designed to address this need for the areas of transcript expression, microRNA expression, gene DNA copy number, and drug activity. CellMiner allows the user to rapidly access data for relative levels of transcript expression for 18,532 genes, 360 microRNAs, and 16,861 compounds including 91 Food and Drug Administration (FDA)-approved drugs. These levels in turn create patterns across the NCI-60 that can be compared to one another using our “pattern match” tool. Together, these tools allow one to query the data for potential relationships between these parameters, in a manner specific to a users area of expertise and interest, in a rapid and flexible manner without the need for expertise in computer science or bioinformatics. The output will be demonstrated with cancer interesting microRNAs 18a, 21, 29a, 30c-2, 30d, and 31, cancer interesting genes ABCB1, ATM, ATR, BRCA2, and TP53, and the clinically relevant drugs belinostat, crizotinib, pazopanib, selumetinib and tanespimycin. Citation Format: {Authors}. {Abstract title} [abstract]. In: Proceedings of the 103rd Annual Meeting of the American Association for Cancer Research; 2012 Mar 31-Apr 4; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2012;72(8 Suppl):Abstract nr 2991. doi:1538-7445.AM2012-2991
Genome Biology | 2003
Barry R. Zeeberg; Weimin Feng; Geoffrey Wang; May D. Wang; Anthony T Fojo; Margot Sunshine; Sudarshan Narasimhan; David Kane; William C. Reinhold; Samir Lababidi; Kimberly J. Bussey; Joseph Riss; J. Carl Barrett; John N. Weinstein
BMC Bioinformatics | 2005
Barry R. Zeeberg; Haiying Qin; Sudarshan Narasimhan; Margot Sunshine; Hong Cao; David Kane; Mark Reimers; Robert M. Stephens; David Bryant; Stanley K. Burt; Eldad Elnekave; Danielle M. Hari; Thomas A. Wynn; Charlotte Cunningham-Rundles; Donn M. Stewart; David E. Nelson; John N. Weinstein
Genome Biology | 2003
Kimberly J. Bussey; David Kane; Margot Sunshine; Sudar Narasimhan; Satoshi Nishizuka; William C. Reinhold; Barry R. Zeeberg; Ajay; John N. Weinstein
BMC Bioinformatics | 2006
Sylvia Major; Satoshi Nishizuka; Daisaku Morita; Rick Rowland; Margot Sunshine; Uma Shankavaram; Frank Washburn; Daniel Asin; Hosein Kouros-Mehr; David Kane; John N. Weinstein