Esther T. Chan
Stanford University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Esther T. Chan.
Nucleic Acids Research | 2012
J. Michael Cherry; Eurie L. Hong; Craig Amundsen; Rama Balakrishnan; Gail Binkley; Esther T. Chan; Karen R. Christie; Maria C. Costanzo; Selina S. Dwight; Stacia R. Engel; Dianna G. Fisk; Jodi E. Hirschman; Benjamin C. Hitz; Kalpana Karra; Cynthia J. Krieger; Stuart R. Miyasato; Robert S. Nash; Julie Park; Marek S. Skrzypek; Matt Simison; Shuai Weng; Edith D. Wong
The Saccharomyces Genome Database (SGD, http://www.yeastgenome.org) is the community resource for the budding yeast Saccharomyces cerevisiae. The SGD project provides the highest-quality manually curated information from peer-reviewed literature. The experimental results reported in the literature are extracted and integrated within a well-developed database. These data are combined with quality high-throughput results and provided through Locus Summary pages, a powerful query engine and rich genome browser. The acquisition, integration and retrieval of these data allow SGD to facilitate experimental design and analysis by providing an encyclopedia of the yeast genome, its chromosomal features, their functions and interactions. Public access to these data is provided to researchers and educators via web pages designed for optimal ease of use.
Nucleic Acids Research | 2016
Cricket A. Sloan; Esther T. Chan; Jean M. Davidson; Venkat S. Malladi; J. Seth Strattan; Benjamin C. Hitz; Idan Gabdank; Aditi K. Narayanan; Marcus Ho; Brian T. Lee; Laurence D. Rowe; Timothy R. Dreszer; Greg Roe; Nikhil R. Podduturi; Forrest Tanaka; Eurie L. Hong; J. Michael Cherry
The Encyclopedia of DNA Elements (ENCODE) Project is in its third phase of creating a comprehensive catalog of functional elements in the human genome. This phase of the project includes an expansion of assays that measure diverse RNA populations, identify proteins that interact with RNA and DNA, probe regions of DNA hypersensitivity, and measure levels of DNA methylation in a wide range of cell and tissue types to identify putative regulatory elements. To date, results for almost 5000 experiments have been released for use by the scientific community. These data are available for searching, visualization and download at the new ENCODE Portal (www.encodeproject.org). The revamped ENCODE Portal provides new ways to browse and search the ENCODE data based on the metadata that describe the assays as well as summaries of the assays that focus on data provenance. In addition, it is a flexible platform that allows integration of genomic data from multiple projects. The portal experience was designed to improve access to ENCODE data by relying on metadata that allow reusability and reproducibility of the experiments.
Nucleic Acids Research | 2014
Maria C. Costanzo; Stacia R. Engel; Edith D. Wong; Paul Lloyd; Kalpana Karra; Esther T. Chan; Shuai Weng; Kelley M. Paskov; Greg Roe; Gail Binkley; Benjamin C. Hitz; J. Michael Cherry
The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is the community resource for genomic, gene and protein information about the budding yeast Saccharomyces cerevisiae, containing a variety of functional information about each yeast gene and gene product. We have recently added regulatory information to SGD and present it on a new tabbed section of the Locus Summary entitled ‘Regulation’. We are compiling transcriptional regulator–target gene relationships, which are curated from the literature at SGD or imported, with permission, from the YEASTRACT database. For nearly every S. cerevisiae gene, the Regulation page displays a table of annotations showing the regulators of that gene, and a graphical visualization of its regulatory network. For genes whose products act as transcription factors, the Regulation page also shows a table of their target genes, accompanied by a Gene Ontology enrichment analysis of the biological processes in which those genes participate. We additionally synthesize information from the literature for each transcription factor in a free-text Regulation Summary, and provide other information relevant to its regulatory function, such as DNA binding site motifs and protein domains. All of the regulation data are available for querying, analysis and download via YeastMine, the InterMine-based data warehouse system in use at SGD.
Database | 2015
Venkat S. Malladi; Drew T. Erickson; Nikhil R. Podduturi; Laurence D. Rowe; Esther T. Chan; Jean M. Davidson; Benjamin C. Hitz; Marcus Ho; Brian T. Lee; Stuart R. Miyasato; Gregory R. Roe; Matt Simison; Cricket A. Sloan; J. Seth Strattan; Forrest Tanaka; W. James Kent; J. Michael Cherry; Eurie L. Hong
The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a catalog of genomic annotations. To date, the project has generated over 4000 experiments across more than 350 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory network and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All ENCODE experimental data, metadata and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage and distribution to community resources and the scientific community. As the volume of data increases, the organization of experimental details becomes increasingly complicated and demands careful curation to identify related experiments. Here, we describe the ENCODE DCC’s use of ontologies to standardize experimental metadata. We discuss how ontologies, when used to annotate metadata, provide improved searching capabilities and facilitate the ability to find connections within a set of experiments. Additionally, we provide examples of how ontologies are used to annotate ENCODE metadata and how the annotations can be identified via ontology-driven searches at the ENCODE portal. As genomic datasets grow larger and more interconnected, standardization of metadata becomes increasingly vital to allow for exploration and comparison of data between different scientific projects. Database URL: https://www.encodeproject.org/
Database | 2016
Eurie L. Hong; Cricket A. Sloan; Esther T. Chan; Jean M. Davidson; Venkat S. Malladi; J. Seth Strattan; Benjamin C. Hitz; Idan Gabdank; Aditi K. Narayanan; Marcus Ho; Brian T. Lee; Laurence D. Rowe; Timothy R. Dreszer; Greg Roe; Nikhil R. Podduturi; Forrest Tanaka; Jason A Hilton; J. Michael Cherry
The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/). Database URL: www.encodeproject.org
Nucleic Acids Research | 2018
Carrie A Davis; Benjamin C. Hitz; Cricket A. Sloan; Esther T. Chan; Jean M. Davidson; Idan Gabdank; Jason A Hilton; Kriti Jain; Ulugbek K Baymuradov; Aditi K. Narayanan; Kathrina C. Onate; Keenan Graham; Stuart R. Miyasato; Timothy R. Dreszer; J. Seth Strattan; Otto Jolanki; Forrest Tanaka; J. Michael Cherry
Abstract The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center has developed the ENCODE Portal database and website as the source for the data and metadata generated by the ENCODE Consortium. Two principles have motivated the design. First, experimental protocols, analytical procedures and the data themselves should be made publicly accessible through a coherent, web-based search and download interface. Second, the same interface should serve carefully curated metadata that record the provenance of the data and justify its interpretation in biological terms. Since its initial release in 2013 and in response to recommendations from consortium members and the wider community of scientists who use the Portal to access ENCODE data, the Portal has been regularly updated to better reflect these design principles. Here we report on these updates, including results from new experiments, uniformly-processed data from other projects, new visualization tools and more comprehensive metadata to describe experiments and analyses. Additionally, the Portal is now home to meta(data) from related projects including Genomics of Gene Regulation, Roadmap Epigenome Project, Model organism ENCODE (modENCODE) and modERN. The Portal now makes available over 13000 datasets and their accompanying metadata and can be accessed at: https://www.encodeproject.org/.
Database | 2012
Esther T. Chan; J. Michael Cherry
The Saccharomyces Genome Database (SGD) is compiling and annotating a comprehensive catalogue of functional sequence elements identified in the budding yeast genome. Recent advances in deep sequencing technologies have enabled for example, global analyses of transcription profiling and assembly of maps of transcription factor occupancy and higher order chromatin organization, at nucleotide level resolution. With this growing influx of published genome-scale data, come new challenges for their storage, display, analysis and integration. Here, we describe SGDs progress in the creation of a consolidated resource for genome sequence elements in the budding yeast, the considerations taken in its design and the lessons learned thus far. The data within this collection can be accessed at http://browse.yeastgenome.org and downloaded from http://downloads.yeastgenome.org. Database URL: http://www.yeastgenome.org
PLOS ONE | 2017
Benjamin C. Hitz; Laurence D. Rowe; Nikhil R. Podduturi; David I. Glick; Ulugbek K Baymuradov; Venkat S. Malladi; Esther T. Chan; Jean M. Davidson; Idan Gabdank; Aditi K. Narayana; Kathrina C. Onate; Jason A Hilton; Marcus Ho; Brian T. Lee; Stuart R. Miyasato; Timothy R. Dreszer; Cricket A. Sloan; J. Seth Strattan; Forrest Tanaka; Eurie L. Hong; J. Michael Cherry
The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a comprehensive catalog of functional elements initiated shortly after the completion of the Human Genome Project. The current database exceeds 6500 experiments across more than 450 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the H. sapiens and M. musculus genomes. All ENCODE experimental data, metadata, and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage, unified processing, and distribution to community resources and the scientific community. As the volume of data increases, the identification and organization of experimental details becomes increasingly intricate and demands careful curation. The ENCODE DCC has created a general purpose software system, known as SnoVault, that supports metadata and file submission, a database used for metadata storage, web pages for displaying the metadata and a robust API for querying the metadata. The software is fully open-source, code and installation instructions can be found at: http://github.com/ENCODE-DCC/snovault/ (for the generic database) and http://github.com/ENCODE-DCC/encoded/ to store genomic data in the manner of ENCODE. The core database engine, SnoVault (which is completely independent of ENCODE, genomic data, or bioinformatic data) has been released as a separate Python package.
Database | 2018
Idan Gabdank; Esther T. Chan; Jean M. Davidson; Jason A Hilton; Carrie A Davis; Ulugbek K Baymuradov; Aditi K. Narayanan; Kathrina C. Onate; Keenan Graham; Stuart R. Miyasato; Timothy R. Dreszer; J. Seth Strattan; Otto Jolanki; Forrest Tanaka; Benjamin C. Hitz; Cricket A. Sloan; J. Michael Cherry
Abstract Prevention of unintended duplication is one of the ongoing challenges many databases have to address. Working with high-throughput sequencing data, the complexity of that challenge increases with the complexity of the definition of a duplicate. In a computational data model, a data object represents a real entity like a reagent or a biosample. This representation is similar to how a card represents a book in a paper library catalog. Duplicated data objects not only waste storage, they can mislead users into assuming the model represents more than the single entity. Even if it is clear that two objects represent a single entity, data duplication opens the door to potential inconsistencies between the objects since the content of the duplicated objects can be updated independently, allowing divergence of the metadata associated with the objects. Analogously to a situation in which a catalog in a paper library would contain by mistake two cards for a single copy of a book. If these cards are listing simultaneously two different individuals as current book borrowers, it would be difficult to determine which borrower (out of the two listed) actually has the book. Unfortunately, in a large database with multiple submitters, unintended duplication is to be expected. In this article, we present three principal guidelines the Encyclopedia of DNA Elements (ENCODE) Portal follows in order to prevent unintended duplication of both actual files and data objects: definition of identifiable data objects (I), object uniqueness validation (II) and de-duplication mechanism (III). In addition to explaining our modus operandi, we elaborate on the methods used for identification of sequencing data files. Comparison of the approach taken by the ENCODE Portal vs other widely used biological data repositories is provided. Database URL: https://www.encodeproject.org/
F1000Research | 2017
Esther T. Chan; Jason A Hilton; Kathrina C. Onate; Idan Gabdank; Marcus Ho; Aditi K. Narayanan; J. Seth Strattan; Ulugbek K Baymuradov; Forrest Tanaka; Chris D. Thomas; Timothy R. Dreszer; Cricket A. Sloan; Benjamin C. Hitz; J. Michael Cherry