Kristen M. Naegle
Washington University in St. Louis
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Kristen M. Naegle.
Cell | 2013
Yongcheol Cho; Roman Sloutsky; Kristen M. Naegle; Valeria Cavalli
Reactivation of a silent transcriptional program is a critical step in successful axon regeneration following injury. Yet how such a program is unlocked after injury remains largely unexplored. We found that axon injury in peripheral sensory neurons elicits a back-propagating calcium wave that invades the soma and causes nuclear export of HDAC5 in a PKCμ-dependent manner. Injury-induced HDAC5 nuclear export enhances histone acetylation to activate a proregenerative gene-expression program. HDAC5 nuclear export is required for axon regeneration, as expression of a nuclear-trapped HDAC5 mutant prevents axon regeneration, whereas enhancing HDAC5 nuclear export promotes axon regeneration in vitro and in vivo. Components of this HDAC5 pathway failed to be activated in a model of central nervous system injury. These studies reveal a signaling mechanism from the axon injury site to the soma that controls neuronal growth competence and suggest a role for HDAC5 as a transcriptional switch controlling axon regeneration.
Biochemical Journal | 2013
Leo K. Iwai; Leo S. Payne; Maciej T. Luczynski; Francis Chang; Huifang Xu; Ryan Clinton; Angela Paul; Edward A. Esposito; Scott Gridley; Birgit Leitinger; Kristen M. Naegle; Paul H. Huang
Collagen is an important extracellular matrix component that directs many fundamental cellular processes including differentiation, proliferation and motility. The signalling networks driving these processes are propagated by collagen receptors such as the β1 integrins and the DDRs (discoidin domain receptors). To gain an insight into the molecular mechanisms of collagen receptor signalling, we have performed a quantitative analysis of the phosphorylation networks downstream of collagen activation of integrins and DDR2. Temporal analysis over seven time points identified 424 phosphorylated proteins. Distinct DDR2 tyrosine phosphorylation sites displayed unique temporal activation profiles in agreement with in vitro kinase data. Multiple clustering analysis of the phosphoproteomic data revealed several DDR2 candidate downstream signalling nodes, including SHP-2 (Src homology 2 domain-containing protein tyrosine phosphatase 2), NCK1 (non-catalytic region of tyrosine kinase adaptor protein 1), LYN, SHIP-2 [SH2 (Src homology 2)-domain-containing inositol phosphatase 2], PIK3C2A (phosphatidylinositol-4-phosphate 3-kinase, catalytic subunit type 2α) and PLCL2 (phospholipase C-like 2). Biochemical validation showed that SHP-2 tyrosine phosphorylation is dependent on DDR2 kinase activity. Targeted proteomic profiling of a panel of lung SCC (squamous cell carcinoma) DDR2 mutants demonstrated that SHP-2 is tyrosine-phosphorylated by the L63V and G505S mutants. In contrast, the I638F kinase domain mutant exhibited diminished DDR2 and SHP-2 tyrosine phosphorylation levels which have an inverse relationship with clonogenic potential. Taken together, the results of the present study indicate that SHP-2 is a key signalling node downstream of the DDR2 receptor which may have therapeutic implications in a subset of DDR2 mutations recently uncovered in genome-wide lung SCC sequencing screens.
Molecular & Cellular Proteomics | 2010
Kristen M. Naegle; Melissa Gymrek; Brian A. Joughin; Joel P. Wagner; Roy E. Welsch; Michael B. Yaffe; Douglas A. Lauffenburger; Forest M. White
The rate of discovery of post-translational modification (PTM) sites is increasing rapidly and is significantly outpacing our biological understanding of the function and regulation of those modifications. To help meet this challenge, we have created PTMScout, a web-based interface for viewing, manipulating, and analyzing high throughput experimental measurements of PTMs in an effort to facilitate biological understanding of protein modifications in signaling networks. PTMScout is constructed around a custom database of PTM experiments and contains information from external protein and post-translational resources, including gene ontology annotations, Pfam domains, and Scansite predictions of kinase and phosphopeptide binding domain interactions. PTMScout functionality comprises data set comparison tools, data set summary views, and tools for protein assignments of peptides identified by mass spectrometry. Analysis tools in PTMScout focus on informed subset selection via common criteria and on automated hypothesis generation through subset labeling derived from identification of statistically significant enrichment of other annotations in the experiment. Subset selection can be applied through the PTMScout flexible query interface available for quantitative data measurements and data annotations as well as an interface for importing data set groupings by external means, such as unsupervised learning. We exemplify the various functions of PTMScout in application to data sets that contain relative quantitative measurements as well as data sets lacking quantitative measurements, producing a set of interesting biological hypotheses. PTMScout is designed to be a widely accessible tool, enabling generation of multiple types of biological hypotheses from high throughput PTM experiments and advancing functional assignment of novel PTM sites. PTMScout is available at http://ptmscout.mit.edu.
Science Signaling | 2016
Tom Ronan; Zhijie Qi; Kristen M. Naegle
Properly applied, clustering methods reveal meaning in high-throughput biological data. Gloss Clustering is an unsupervised learning method, grouping data points based on similarity, with the goal of revealing the underlying structure of data. Advances in molecular biology have yielded large and complex data sets, making clustering essential to understand and visualize the data. Clustering can be a powerful technique, but it harbors potential pitfalls due to the high-dimensional nature of biological data, the failure to consider more than one clustering method for a given problem, and the difficulty in determining whether clustering has produced meaningful results. We present concrete examples of problems and solutions (clustering results) in the form of toy problems and real biological data for these pitfalls, illustrating how to avoid overinterpreting the data and missing valuable insights within high-throughput molecular measurements. The article contains six figures, four tables, and 77 references. Clustering is an unsupervised learning method, which groups data points based on similarity, and is used to reveal the underlying structure of data. This computational approach is essential to understanding and visualizing the complex data that are acquired in high-throughput multidimensional biological experiments. Clustering enables researchers to make biological inferences for further experiments. Although a powerful technique, inappropriate application can lead biological researchers to waste resources and time in experimental follow-up. We review common pitfalls identified from the published molecular biology literature and present methods to avoid them. Commonly encountered pitfalls relate to the high-dimensional nature of biological data from high-throughput experiments, the failure to consider more than one clustering method for a given problem, and the difficulty in determining whether clustering has produced meaningful results. We present concrete examples of problems and solutions (clustering results) in the form of toy problems and real biological data for these issues. We also discuss ensemble clustering as an easy-to-implement method that enables the exploration of multiple clustering solutions and improves robustness of clustering solutions. Increased awareness of common clustering pitfalls will help researchers avoid overinterpreting or misinterpreting the results and missing valuable insights when clustering biological data.
Nucleic Acids Research | 2015
Matthew Matlock; Alex S. Holehouse; Kristen M. Naegle
ProteomeScout (https://proteomescout.wustl.edu) is a resource for the study of proteins and their post-translational modifications (PTMs) consisting of a database of PTMs, a repository for experimental data, an analysis suite for PTM experiments, and a tool for visualizing the relationships between complex protein annotations. The PTM database is a compendium of public PTM data, coupled with user-uploaded experimental data. ProteomeScout provides analysis tools for experimental datasets, including summary views and subset selection, which can identify relationships within subsets of data by testing for statistically significant enrichment of protein annotations. Protein annotations are incorporated in the ProteomeScout database from external resources and include terms such as Gene Ontology annotations, domains, secondary structure and non-synonymous polymorphisms. These annotations are available in the database download, in the analysis tools and in the protein viewer. The protein viewer allows for the simultaneous visualization of annotations in an interactive web graphic, which can be exported in Scalable Vector Graphics (SVG) format. Finally, quantitative data measurements associated with public experiments are also easily viewable within protein records, allowing researchers to see how PTMs change across different contexts. ProteomeScout should prove useful for protein researchers and should benefit the proteomics community by providing a stable repository for PTM experiments.
Journal of Biological Chemistry | 2016
Tom Ronan; Jennifer L. Macdonald-Obermann; Lorel Huelsmann; Nicholas J. Bessman; Kristen M. Naegle; Linda J. Pike
The EGF receptor can bind seven different agonist ligands. Although each agonist appears to stimulate the same suite of downstream signaling proteins, different agonists are capable of inducing distinct responses in the same cell. To determine the basis for these differences, we used luciferase fragment complementation imaging to monitor the recruitment of Cbl, CrkL, Gab1, Grb2, PI3K, p52 Shc, p66 Shc, and Shp2 to the EGF receptor when stimulated by the seven EGF receptor ligands. Recruitment of all eight proteins was rapid, dose-dependent, and inhibited by erlotinib and lapatinib, although to differing extents. Comparison of the time course of recruitment of the eight proteins in response to a fixed concentration of each growth factor revealed differences among the growth factors that could contribute to their differing biological effects. Principal component analysis of the resulting data set confirmed that the recruitment of these proteins differed between agonists and also between different doses of the same agonist. Ensemble clustering of the overall response to the different growth factors suggests that these EGF receptor ligands fall into two major groups as follows: (i) EGF, amphiregulin, and EPR; and (ii) betacellulin, TGFα, and epigen. Heparin-binding EGF is distantly related to both clusters. Our data identify differences in network utilization by different EGF receptor agonists and highlight the need to characterize network interactions under conditions other than high dose EGF.
Molecular BioSystems | 2012
Kristen M. Naegle; Forest White; Douglas A. Lauffenburger; Michael B. Yaffe
Cell signaling networks propagate information from extracellular cues via dynamic modulation of protein-protein interactions in a context-dependent manner. Networks based on receptor tyrosine kinases (RTKs), for example, phosphorylate intracellular proteins in response to extracellular ligands, resulting in dynamic protein-protein interactions that drive phenotypic changes. Most commonly used methods for discovering these protein-protein interactions, however, are optimized for detecting stable, longer-lived complexes, rather than the type of transient interactions that are essential components of dynamic signaling networks such as those mediated by RTKs. Substrate phosphorylation downstream of RTK activation modifies substrate activity and induces phospho-specific binding interactions, resulting in the formation of large transient macromolecular signaling complexes. Since protein complex formation should follow the trajectory of events that drive it, we reasoned that mining phosphoproteomic datasets for highly similar dynamic behavior of measured phosphorylation sites on different proteins could be used to predict novel, transient protein-protein interactions that had not been previously identified. We applied this method to explore signaling events downstream of EGFR stimulation. Our computational analysis of robustly co-regulated phosphorylation sites, based on multiple clustering analysis of quantitative time-resolved mass-spectrometry phosphoproteomic data, not only identified known sitewise-specific recruitment of proteins to EGFR, but also predicted novel, a priori interactions. A particularly intriguing prediction of EGFR interaction with the cytoskeleton-associated protein PDLIM1 was verified within cells using co-immunoprecipitation and in situ proximity ligation assays. Our approach thus offers a new way to discover protein-protein interactions in a dynamic context- and phosphorylation site-specific manner.
Briefings in Bioinformatics | 2013
Roman Sloutsky; Nicolas Jimenez; S. Joshua Swamidass; Kristen M. Naegle
Clustering is a powerful and commonly used technique that organizes and elucidates the structure of biological data. Clustering data from gene expression, metabolomics and proteomics experiments has proven to be useful at deriving a variety of insights, such as the shared regulation or function of biochemical components within networks. However, experimental measurements of biological processes are subject to substantial noise-stemming from both technical and biological variability-and most clustering algorithms are sensitive to this noise. In this article, we explore several methods of accounting for noise when analyzing biological data sets through clustering. Using a toy data set and two different case studies-gene expression and protein phosphorylation-we demonstrate the sensitivity of clustering algorithms to noise. Several methods of accounting for this noise can be used to establish when clustering results can be trusted. These methods span a range of assumptions about the statistical properties of the noise and can therefore be applied to virtually any biological data source.
PLOS ONE | 2015
Alex S. Holehouse; Kristen M. Naegle
Background Protein post-translational modifications (PTMs) are an important aspect of protein regulation. The number of PTMs discovered within the human proteome, and other proteomes, has been rapidly expanding in recent years. As a consequence of the rate in which new PTMs are identified, analysis done in one year may result in different conclusions when repeated in subsequent years. Among the various functional questions pertaining to PTMs, one important relationship to address is the interplay between modifications and mutations. Specifically, because the linear sequence surrounding a modification site often determines molecular recognition, it is hypothesized that mutations near sites of PTMs may be more likely to result in a detrimental effect on protein function, resulting in the development of disease. Methods and Results We wrote an application programming interface (API) to make analysis of ProteomeScout, a comprehensive database of PTMs and protein information, easy and reproducible. We used this API to analyze the relationship between PTMs and human mutations associated with disease (based on the ‘Clinical Significance’ annotation from dbSNP). Proteins containing pathogenic mutations demonstrated a significant study bias which was controlled for by analyzing only well-studied proteins, based on their having at least one pathogenic mutation. We found that pathogenic mutations are significantly more likely to lie within eight amino acids of a phosphoserine, phosphotyrosine or ubiquitination site when compared to mutations in general, based on a Fisher’s Exact test. Despite the skew of pathogenic mutations occurring on positively charged arginines, we could not account for this relationship based only on residue type. Finally, we hypothesize a potential mechanism for a pathogenic mutation on RAF1, based on its proximity to a phosphorylation site, which represents a subtle regulation difference that may explain why its biochemical effect has failed to be uncovered previously. The combination of the API and a dynamically expanding PTM database will make the reanalysis of this question and other systems-level questions easier in the future.
Science Signaling | 2015
Kristen M. Naegle; Nancy R. Gough; Michael B. Yaffe
Biological reproducibility leads to discoveries about biological processes that constitute a fundamental truth about nature. This Focus tackles the issue of technical versus biological replicates, what constitutes appropriate biological replicates, and appropriate statistical analysis for data with small sample sizes.