Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Natali Kolker is active.

Publication


Featured researches published by Natali Kolker.


Nucleic Acids Research | 2012

MOPED: Model Organism Protein Expression Database

Eugene Kolker; Roger Higdon; Winston A. Haynes; Dean Welch; William Broomall; Doron Lancet; Larissa Stanberry; Natali Kolker

Large numbers of mass spectrometry proteomics studies are being conducted to understand all types of biological processes. The size and complexity of proteomics data hinders efforts to easily share, integrate, query and compare the studies. The Model Organism Protein Expression Database (MOPED, htttp://moped.proteinspire.org) is a new and expanding proteomics resource that enables rapid browsing of protein expression information from publicly available studies on humans and model organisms. MOPED is designed to simplify the comparison and sharing of proteomics data for the greater research community. MOPED uniquely provides protein level expression data, meta-analysis capabilities and quantitative data from standardized analysis. Data can be queried for specific proteins, browsed based on organism, tissue, localization and condition and sorted by false discovery rate and expression. MOPED empowers users to visualize their own expression data and compare it with existing studies. Further, MOPED links to various protein and pathway databases, including GeneCards, Entrez, UniProt, KEGG and Reactome. The current version of MOPED contains over 43 000 proteins with at least one spectral match and more than 11 million high certainty spectra.


Omics A Journal of Integrative Biology | 2014

Toward more transparent and reproducible omics studies through a common metadata checklist and data publications.

Eugene Kolker; Vural Ozdemir; Lennart Martens; William S. Hancock; Gordon A. Anderson; Nathaniel Anderson; Sukru Aynacioglu; Ancha Baranova; Shawn R. Campagna; Rui Chen; John Choiniere; Stephen P. Dearth; Wu-chun Feng; Lynnette R. Ferguson; Geoffrey C. Fox; Dmitrij Frishman; Robert L. Grossman; Allison P. Heath; Roger Higdon; Mara H. Hutz; Imre Janko; Lihua Jiang; Sanjay Joshi; Alexander E. Kel; Joseph W. Kemnitz; Isaac S. Kohane; Natali Kolker; Doron Lancet; Elaine Lee; Weizhong Li

Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.


Big data | 2013

Unraveling the Complexities of Life Sciences Data

Roger Higdon; Winston A. Haynes; Larissa Stanberry; Elizabeth Stewart; Gregory Yandl; Chris Howard; William Broomall; Natali Kolker; Eugene Kolker

The life sciences have entered into the realm of big data and data-enabled science, where data can either empower or overwhelm. These data bring the challenges of the 5 Vs of big data: volume, veracity, velocity, variety, and value. Both independently and through our involvement with DELSA Global (Data-Enabled Life Sciences Alliance, DELSAglobal.org), the Kolker Lab ( kolkerlab.org ) is creating partnerships that identify data challenges and solve community needs. We specialize in solutions to complex biological data challenges, as exemplified by the community resource of MOPED (Model Organism Protein Expression Database, MOPED.proteinspire.org ) and the analysis pipeline of SPIRE (Systematic Protein Investigative Research Environment, PROTEINSPIRE.org ). Our collaborative work extends into the computationally intensive tasks of analysis and visualization of millions of protein sequences through innovative implementations of sequence alignment algorithms and creation of the Protein Sequence Universe tool (PSU). Pushing into the future together with our collaborators, our lab is pursuing integration of multi-omics data and exploration of biological pathways, as well as assigning function to proteins and porting solutions to the cloud. Big data have come to the life sciences; discovering the knowledge in the data will bring breakthroughs and benefits.


Omics A Journal of Integrative Biology | 2015

The Promise of Multi-Omics and Clinical Data Integration to Identify and Target Personalized Healthcare Approaches in Autism Spectrum Disorders

Roger Higdon; Rachel K. Earl; Larissa Stanberry; Caitlin M. Hudac; Elizabeth Montague; Elizabeth Stewart; Imre Janko; John Choiniere; William Broomall; Natali Kolker; Raphael Bernier; Eugene Kolker

Complex diseases are caused by a combination of genetic and environmental factors, creating a difficult challenge for diagnosis and defining subtypes. This review article describes how distinct disease subtypes can be identified through integration and analysis of clinical and multi-omics data. A broad shift toward molecular subtyping of disease using genetic and omics data has yielded successful results in cancer and other complex diseases. To determine molecular subtypes, patients are first classified by applying clustering methods to different types of omics data, then these results are integrated with clinical data to characterize distinct disease subtypes. An example of this molecular-data-first approach is in research on Autism Spectrum Disorder (ASD), a spectrum of social communication disorders marked by tremendous etiological and phenotypic heterogeneity. In the case of ASD, omics data such as exome sequences and gene and protein expression data are combined with clinical data such as psychometric testing and imaging to enable subtype identification. Novel ASD subtypes have been proposed, such as CHD8, using this molecular subtyping approach. Broader use of molecular subtyping in complex disease research is impeded by data heterogeneity, diversity of standards, and ineffective analysis tools. The future of molecular subtyping for ASD and other complex diseases calls for an integrated resource to identify disease mechanisms, classify new patients, and inform effective treatment options. This in turn will empower and accelerate precision medicine and personalized healthcare.


Omics A Journal of Integrative Biology | 2014

MOPED 2.5--an integrated multi-omics resource: multi-omics profiling expression database now includes transcriptomics data.

Elizabeth Montague; Larissa Stanberry; Roger Higdon; Imre Janko; Elaine Lee; Nathaniel Anderson; John Choiniere; Elizabeth Stewart; Gregory Yandl; William Broomall; Natali Kolker; Eugene Kolker

Multi-omics data-driven scientific discovery crucially rests on high-throughput technologies and data sharing. Currently, data are scattered across single omics repositories, stored in varying raw and processed formats, and are often accompanied by limited or no metadata. The Multi-Omics Profiling Expression Database (MOPED, http://moped.proteinspire.org ) version 2.5 is a freely accessible multi-omics expression database. Continual improvement and expansion of MOPED is driven by feedback from the Life Sciences Community. In order to meet the emergent need for an integrated multi-omics data resource, MOPED 2.5 now includes gene relative expression data in addition to protein absolute and relative expression data from over 250 large-scale experiments. To facilitate accurate integration of experiments and increase reproducibility, MOPED provides extensive metadata through the Data-Enabled Life Sciences Alliance (DELSA Global, http://delsaglobal.org ) metadata checklist. MOPED 2.5 has greatly increased the number of proteomics absolute and relative expression records to over 500,000, in addition to adding more than four million transcriptomics relative expression records. MOPED has an intuitive user interface with tabs for querying different types of omics expression data and new tools for data visualization. Summary information including expression data, pathway mappings, and direct connection between proteins and genes can be viewed on Protein and Gene Details pages. These connections in MOPED provide a context for multi-omics expression data exploration. Researchers are encouraged to submit omics data which will be consistently processed into expression summaries. MOPED as a multi-omics data resource is a pivotal public database, interdisciplinary knowledge resource, and platform for multi-omics understanding.


Journal of Proteomics | 2011

SPIRE: Systematic protein investigative research environment

Eugene Kolker; Roger Higdon; Dean Welch; Andrew Bauman; Elizabeth Stewart; Winston Haynes; William Broomall; Natali Kolker

The SPIRE (Systematic Protein Investigative Research Environment) provides web-based experiment-specific mass spectrometry (MS) proteomics analysis (https://www.proteinspire.org). Its emphasis is on usability and integration of the best analytic tools. SPIRE provides an easy to use web-interface and generates results in both interactive and simple data formats. In contrast to run-based approaches, SPIRE conducts the analysis based on the experimental design. It employs novel methods to generate false discovery rates and local false discovery rates (FDR, LFDR) and integrates the best and complementary open-source search and data analysis methods. The SPIRE approach of integrating X!Tandem, OMSSA and SpectraST can produce an increase in protein IDs (52-88%) over current combinations of scoring and single search engines while also providing accurate multi-faceted error estimation. One of SPIREs primary assets is combining the results with data on protein function, pathways and protein expression from model organisms. We demonstrate some of SPIREs capabilities by analyzing mitochondrial proteins from the wild type and 3 mutants of C. elegans. SPIRE also connects results to publically available proteomics data through its Model Organism Protein Expression Database (MOPED). SPIRE can also provide analysis and annotation for user supplied protein ID and expression data.


PLOS ONE | 2010

The United States of America and scientific research.

Gregory Hather; Winston Haynes; Roger Higdon; Natali Kolker; Elizabeth Stewart; Peter W. Arzberger; Patrick Chain; Dawn Field; B. Robert Franza; Biaoyang Lin; Folker Meyer; Vural Ozdemir; Charles V. Smith; Gerald van Belle; John Wooley; Eugene Kolker

To gauge the current commitment to scientific research in the United States of America (US), we compared federal research funding (FRF) with the US gross domestic product (GDP) and industry research spending during the past six decades. In order to address the recent globalization of scientific research, we also focused on four key indicators of research activities: research and development (R&D) funding, total science and engineering doctoral degrees, patents, and scientific publications. We compared these indicators across three major population and economic regions: the US, the European Union (EU) and the Peoples Republic of China (China) over the past decade. We discovered a number of interesting trends with direct relevance for science policy. The level of US FRF has varied between 0.2% and 0.6% of the GDP during the last six decades. Since the 1960s, the US FRF contribution has fallen from twice that of industrial research funding to roughly equal. Also, in the last two decades, the portion of the US government R&D spending devoted to research has increased. Although well below the US and the EU in overall funding, the current growth rate for R&D funding in China greatly exceeds that of both. Finally, the EU currently produces more science and engineering doctoral graduates and scientific publications than the US in absolute terms, but not per capita. This studys aim is to facilitate a serious discussion of key questions by the research community and federal policy makers. In particular, our results raise two questions with respect to: a) the increasing globalization of science: “What role is the US playing now, and what role will it play in the future of international science?”; and b) the ability to produce beneficial innovations for society: “How will the US continue to foster its strengths?”


Journal of Proteome Research | 2014

MOPED Enables Discoveries through Consistently Processed Proteomics Data

Roger Higdon; Elizabeth Stewart; Larissa Stanberry; Winston A. Haynes; John Choiniere; Elizabeth Montague; Nathaniel Anderson; Gregory Yandl; Imre Janko; William Broomall; Simon Fishilevich; Doron Lancet; Natali Kolker; Eugene Kolker

The Model Organism Protein Expression Database (MOPED, http://moped.proteinspire.org) is an expanding proteomics resource to enable biological and biomedical discoveries. MOPED aggregates simple, standardized and consistently processed summaries of protein expression and metadata from proteomics (mass spectrometry) experiments from human and model organisms (mouse, worm, and yeast). The latest version of MOPED adds new estimates of protein abundance and concentration as well as relative (differential) expression data. MOPED provides a new updated query interface that allows users to explore information by organism, tissue, localization, condition, experiment, or keyword. MOPED supports the Human Proteome Projects efforts to generate chromosome- and diseases-specific proteomes by providing links from proteins to chromosome and disease information as well as many complementary resources. MOPED supports a new omics metadata checklist to harmonize data integration, analysis, and use. MOPEDs development is driven by the user community, which spans 90 countries and guides future development that will transform MOPED into a multiomics resource. MOPED encourages users to submit data in a simple format. They can use the metadata checklist to generate a data publication for this submission. As a result, MOPED will provide even greater insights into complex biological processes and systems and enable deeper and more comprehensive biological and biomedical discoveries.


Nucleic Acids Research | 2015

Beyond protein expression, MOPED goes multi-omics

Elizabeth Montague; Imre Janko; Larissa Stanberry; Elaine Lee; John Choiniere; Nathaniel Anderson; Elizabeth Stewart; William Broomall; Roger Higdon; Natali Kolker; Eugene Kolker

MOPED (Multi-Omics Profiling Expression Database; http://moped.proteinspire.org) has transitioned from solely a protein expression database to a multi-omics resource for human and model organisms. Through a web-based interface, MOPED presents consistently processed data for gene, protein and pathway expression. To improve data quality, consistency and use, MOPED includes metadata detailing experimental design and analysis methods. The multi-omics data are integrated through direct links between genes and proteins and further connected to pathways and experiments. MOPED now contains over 5 million records, information for approximately 75 000 genes and 50 000 proteins from four organisms (human, mouse, worm, yeast). These records correspond to 670 unique combinations of experiment, condition, localization and tissue. MOPED includes the following new features: pathway expression, Pathway Details pages, experimental metadata checklists, experiment summary statistics and more advanced searching tools. Advanced searching enables querying for genes, proteins, experiments, pathways and keywords of interest. The system is enhanced with visualizations for comparing across different data types. In the future MOPED will expand the number of organisms, increase integration with pathways and provide connections to disease.


Journal of Proteomics | 2011

IPM: An integrated protein model for false discovery rate estimation and identification in high-throughput proteomics.

Roger Higdon; Lukas Reiter; Gregory Hather; Winston Haynes; Natali Kolker; Elizabeth Stewart; Andrew Bauman; Paola Picotti; Alexander Schmidt; Gerald van Belle; Ruedi Aebersold; Eugene Kolker

In high-throughput mass spectrometry proteomics, peptides and proteins are not simply identified as present or not present in a sample, rather the identifications are associated with differing levels of confidence. The false discovery rate (FDR) has emerged as an accepted means for measuring the confidence associated with identifications. We have developed the Systematic Protein Investigative Research Environment (SPIRE) for the purpose of integrating the best available proteomics methods. Two successful approaches to estimating the FDR for MS protein identifications are the MAYU and our current SPIRE methods. We present here a method to combine these two approaches to estimating the FDR for MS protein identifications into an integrated protein model (IPM). We illustrate the high quality performance of this IPM approach through testing on two large publicly available proteomics datasets. MAYU and SPIRE show remarkable consistency in identifying proteins in these datasets. Still, IPM results in a more robust FDR estimation approach and additional identifications, particularly among low abundance proteins. IPM is now implemented as a part of the SPIRE system.

Collaboration


Dive into the Natali Kolker's collaboration.

Top Co-Authors

Avatar

Eugene Kolker

University of Washington

View shared research outputs
Top Co-Authors

Avatar

Roger Higdon

Seattle Children's Research Institute

View shared research outputs
Top Co-Authors

Avatar

Elizabeth Stewart

Boston Children's Hospital

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

William Broomall

Seattle Children's Research Institute

View shared research outputs
Top Co-Authors

Avatar

Imre Janko

Boston Children's Hospital

View shared research outputs
Top Co-Authors

Avatar

John Choiniere

Seattle Children's Research Institute

View shared research outputs
Top Co-Authors

Avatar

Elizabeth Montague

Seattle Children's Research Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Nathaniel Anderson

Seattle Children's Research Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge