Kerstin Scheubert
University of Jena
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Kerstin Scheubert.
Journal of Cheminformatics | 2013
Kerstin Scheubert; Franziska Hufsky; Sebastian Böcker
The identification of small molecules from mass spectrometry (MS) data remains a major challenge in the interpretation of MS data. This review covers the computational aspects of identifying small molecules, from the identification of a compound searching a reference spectral library, to the structural elucidation of unknowns. In detail, we describe the basic principles and pitfalls of searching mass spectral reference libraries. Determining the molecular formula of the compound can serve as a basis for subsequent structural elucidation; consequently, we cover different methods for molecular formula identification, focussing on isotope pattern analysis. We then discuss automated methods to deal with mass spectra of compounds that are not present in spectral libraries, and provide an insight into de novo analysis of fragmentation spectra using fragmentation trees. In addition, this review shortly covers the reconstruction of metabolic networks using MS data. Finally, we list available software for different steps of the analysis pipeline.
research in computational molecular biology | 2011
Kerstin Scheubert; Franziska Hufsky; Florian Rasche; Sebastian Böcker
Since metabolites cannot be predicted from the genome sequence, high-throughput de novo identification of small molecules is highly sought. Mass spectrometry (MS) in combination with a fragmentation technique is commonly used for this task. Unfortunately, automated analysis of such data is in its infancy. Recently, fragmentation trees have been proposed as an analysis tool for such data. Additional fragmentation steps (MS(n)) reveal more information about the molecule. We propose to use MS(n) data for the computation of fragmentation trees, and present the Colorful Subtree Closure problem to formalize this task: There, we search for a colorful subtree inside a vertex-colored graph, such that the weight of the transitive closure of the subtree is maximal. We give several negative results regarding the tractability and approximability of this and related problems. We then present an exact dynamic programming algorithm, which is parameterized by the number of colors in the graph and is swift in practice. Evaluation of our method on a dataset of 45 reference compounds showed that the quality of constructed fragmentation trees is improved by using MS(n) instead of MS² measurements.
Nature Communications | 2017
Kerstin Scheubert; Franziska Hufsky; Daniel Petras; Mingxun Wang; Louis-Félix Nothias; Kai Dührkop; Nuno Bandeira; Pieter C. Dorrestein; Sebastian Böcker
The annotation of small molecules in untargeted mass spectrometry relies on the matching of fragment spectra to reference library spectra. While various spectrum-spectrum match scores exist, the field lacks statistical methods for estimating the false discovery rates (FDR) of these annotations. We present empirical Bayes and target-decoy based methods to estimate the false discovery rate (FDR) for 70 public metabolomics data sets. We show that the spectral matching settings need to be adjusted for each project. By adjusting the scoring parameters and thresholds, the number of annotations rose, on average, by +139% (ranging from −92 up to +5705%) when compared with a default parameter set available at GNPS. The FDR estimation methods presented will enable a user to assess the scoring criteria for large scale analysis of mass spectrometry based metabolomics data that has been essential in the advancement of proteomics, transcriptomics, and genomics science.Matching fragment spectra to reference library spectra is an important procedure for annotating small molecules in untargeted mass spectrometry based metabolomics studies. Here, the authors develop strategies to estimate false discovery rates (FDR) by empirical Bayes and target-decoy based methods which enable a user to define the scoring criteria for spectral matching.
Metabolites | 2013
Kai Dührkop; Kerstin Scheubert; Sebastian Böcker
We present results of the SIRIUS2 submission to the 2012 CASMI contest. Only results for Category 1 (molecular formula identification) were submitted. The SIRIUS method and the parameters used are briefly described, followed by detailed analysis of the results and a discussion of cases where SIRIUS2 was unable to come up with the correct molecular formula. SIRIUS2 returns consistently high quality results, with the exception of fragmentation pattern analysis of time-of-flight data. We then discuss possibilities for further improving SIRIUS2 in the future.
Rapid Communications in Mass Spectrometry | 2011
Anja Baumgaertel; Kerstin Scheubert; Bernhard Pietsch; Kristian Kempe; Anna C. Crecelius; Sebastian Böcker; Ulrich S. Schubert
The manual interpretation of tandem mass spectra of synthetic polymers is very time-consuming. Therefore, a new software tool was developed to accelerate the interpretation of spectra obtained without requiring any further knowledge about the polymer class or the fragmentation behavior under high-energy collision-induced dissociation (CID) conditions. The software only requires an alphabetical list of elements and a peak list of the measured substance as an xml file for the evaluation of the chosen mass spectrum. Tandem mass spectra of different homopolymers, like poly(2-oxazoline)s, poly(ethylene glycol) and poly(styrene), were interpreted by the new software tool. This contribution describes a fast and automated software tool for the rapid analysis of homopolymers.
Journal of Ethnopharmacology | 2014
Mayuri Napagoda; Jana Gerstmeier; Sandra Wesely; Sven Popella; Sybille Lorenz; Kerstin Scheubert; Aleš Svatoš; Oliver Werz
ETHNOPHARMACOLOGICAL RELEVANCE The perennial herb Plectranthus zeylanicus Benth is extensively used in traditional medicine in Sri Lanka and South India for treating inflammatory conditions, but pharmacological features of Plectranthus zeylanicus are hardly explored in order to understand and rationalize its use in ethnomedicine. As 5-lipoxygenase (5-LO) is a key enzyme in inflammatory disorders such as asthma or atherosclerosis, we investigated 5-LO inhibition by Plectranthus zeylanicus extracts and analyzed relevant constituents. MATERIALS AND METHODS We applied cell-free and cell-based assays to investigate suppression of 5-LO activity. Cell viability, radical scavenger activities, and inhibition of reactive oxygen species formation (ROS) in neutrophils were analysed to exclude unspecific cytotoxic or antioxidant effects. Constituents of the extracts were characterized by bioassay-guided fractionation and by analysis using gas or liquid chromatography coupled to mass spectrometric (Orbitrap) analysis. RESULTS Extracts of Plectranthus zeylanicus prepared with n-hexane or dichloromethane potently suppressed 5-LO activity in stimulated human neutrophils (IC50=6.6 and 12µg/ml, respectively) and inhibited isolated human recombinant 5-LO (IC50=0.7 and 1.2µg/ml, respectively). In contrast, no significant radical scavenging activity or suppression of ROS formation was observed, and neutrophil viability was unaffected. Besides ubiquitously occurring ingredients, coleone P, cinncassiol A and C, and callistric acid were identified as constituents in the most active fraction. CONCLUSIONS Together, potent inhibition of 5-LO activity, without concomitant anti-oxidant activity and cytotoxic effects, rationalizes the ethnopharmacological use of Plectranthus zeylanicus as anti-inflammatory remedy. Modern chromatographic/mass spectrometric analysis reveals discrete chemical structures of relevant constituents.
bioRxiv | 2017
Kerstin Scheubert; Franziska Hufsky; Daniel Petras; Mingxun Wang; Louis-Félix Nothias; Kai Duehrkop; Nuno Bandeira; Pieter C. Dorrestein; Sebastian Boecker
The annotation of small molecules in untargeted mass spectrometry relies on the matching of fragment spectra to reference library spectra. While various spectrum-spectrum match scores exist, the field lacks statistical methods for estimating the false discovery rates (FDR) of these annotations. We present empirical Bayes and target-decoy based methods to estimate the false discovery rate. Relying on estimations of false discovery rates, we explore the effect of different spectrum-spectrum match criteria on the number and the nature of the molecules annotated. We show that the spectral matching settings needs to be adjusted for each project. By adjusting the scoring parameters and thresholds, the number of annotations rose, on average, by +139% (ranging from −92% up to +5705%) when compared to a default parameter set available at GNPS. The FDR estimation methods presented will enable a user to define the scoring criteria for large scale analysis of untargeted small molecule data that has been essential in the advancement of large scale proteomics, transcriptomics, and genomics science.
workshop on algorithms in bioinformatics | 2014
Kerstin Scheubert; Franziska Hufsky; Sebastian Böcker
Mass spectrometry (MS) in combination with a fragmentation technique is the method of choice for analyzing small molecules in high throughput experiments. The automated interpretation of such data is highly non-trivial. Recently, fragmentation trees have been introduced for de novo analysis of tandem fragmentation spectra (MS2), describing the fragmentation process of the molecule. Multiple-stage MS (MS n ) reveals additional information about the dependencies between fragments. Unfortunately, the computational analysis of MS n data using fragmentation trees turns out to be more challenging than for tandem mass spectra.
Polymers | 2016
Martin S. Engler; Kerstin Scheubert; Ulrich S. Schubert; Sebastian Böcker
For many years, copolymerization has been studied using mathematical and statistical models. Here, we present new Markov chain models for copolymerization kinetics: the Bernoulli and Geometric models. They model copolymer synthesis as a random process and are based on a basic reaction scheme. In contrast to previous Markov chain approaches to copolymerization, both models take variable chain lengths and time-dependent monomer probabilities into account and allow for computing sequence likelihoods and copolymer fingerprints. Fingerprints can be computed from copolymer mass spectra, potentially allowing us to estimate the model parameters from measured fingerprints. We compare both models against Monte Carlo simulations. We find that computing the models is fast and memory efficient.
Polymers | 2017
Martin S. Engler; Kerstin Scheubert; Ulrich S. Schubert; Sebastian Böcker
The geometric copolymerization model is a recently introduced statistical Markov chain model. Here, we investigate its practicality. First, several approaches to identify the optimal model parameters from observed copolymer fingerprints are evaluated using Monte Carlo simulated data. Directly optimizing the parameters is robust against noise but has impractically long running times. A compromise between robustness and running time is found by exploiting the relationship between monomer concentrations calculated by ordinary differential equations and the geometric model. Second, we investigate the applicability of the model to copolymerizations beyond living polymerization and show that the model is useful for copolymerizations involving termination and depropagation reactions.