Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Christopher S. Oehmen is active.

Publication


Featured researches published by Christopher S. Oehmen.


Infection and Immunity | 2011

Computational Prediction of Type III and IV Secreted Effectors in Gram-Negative Bacteria

Jason E. McDermott; Abigail L. Corrigan; Elena S. Peterson; Christopher S. Oehmen; George S. Niemann; Eric D. Cambronne; Danna Sharp; Joshua N. Adkins; Ram Samudrala; Fred Heffron

ABSTRACT In this review, we provide an overview of the methods employed in four recent studies that described novel methods for computational prediction of secreted effectors from type III and IV secretion systems in Gram-negative bacteria. We present the results of these studies in terms of performance at accurately predicting secreted effectors and similarities found between secretion signals that may reflect biologically relevant features for recognition. We discuss the Web-based tools for secreted effector prediction described in these studies and announce the availability of our tool, the SIEVE server (http://www.sysbep.org/sieve). Finally, we assess the accuracies of the three type III effector prediction methods on a small set of proteins not known prior to the development of these tools that we recently discovered and validated using both experimental and computational approaches. Our comparison shows that all methods use similar approaches and, in general, arrive at similar conclusions. We discuss the possibility of an order-dependent motif in the secretion signal, which was a point of disagreement in the studies. Our results show that there may be classes of effectors in which the signal has a loosely defined motif and others in which secretion is dependent only on compositional biases. Computational prediction of secreted effectors from protein sequences represents an important step toward better understanding the interaction between pathogens and hosts.


Bioinformatics | 2008

A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics

Bobbie-Jo M. Webb-Robertson; William R. Cannon; Christopher S. Oehmen; Anuj R. Shah; Vidhya Gurumoorthi; Mary S. Lipton; Katrina M. Waters

MOTIVATION The standard approach to identifying peptides based on accurate mass and elution time (AMT) compares profiles obtained from a high resolution mass spectrometer to a database of peptides previously identified from tandem mass spectrometry (MS/MS) studies. It would be advantageous, with respect to both accuracy and cost, to only search for those peptides that are detectable by MS (proteotypic). RESULTS We present a support vector machine (SVM) model that uses a simple descriptor space based on 35 properties of amino acid content, charge, hydrophilicity and polarity for the quantitative prediction of proteotypic peptides. Using three independently derived AMT databases (Shewanella oneidensis, Salmonella typhimurium, Yersinia pestis) for training and validation within and across species, the SVM resulted in an average accuracy measure of 0.8 with a SD of <0.025. Furthermore, we demonstrate that these results are achievable with a small set of 12 variables and can achieve high proteome coverage. AVAILABILITY http://omics.pnl.gov/software/STEPP.php. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.


Journal of Virology | 2011

Inhibition of Dengue Virus Infections in Cell Cultures and in AG129 Mice by a Small Interfering RNA Targeting a Highly Conserved Sequence

David A. Stein; Stuart T. Perry; Michael D. Buck; Christopher S. Oehmen; Matthew A. Fischer; Elizabeth A. Poore; Jessica L. Smith; Alissa M. Lancaster; Alec J. Hirsch; Mark K. Slifka; Jay A. Nelson; Sujan Shresta; Klaus Früh

ABSTRACT The dengue viruses (DENVs) exist as numerous genetic strains that are grouped into four antigenically distinct serotypes. DENV strains from each serotype can cause severe disease and threaten public health in tropical and subtropical regions worldwide. No licensed antiviral agent to treat DENV infections is currently available, and there is an acute need for the development of novel therapeutics. We found that a synthetic small interfering RNA (siRNA) (DC-3) targeting the highly conserved 5′ cyclization sequence (5′CS) region of the DENV genome reduced, by more than 100-fold, the titers of representative strains from each DENV serotype in vitro. To determine if DC-3 siRNA could inhibit DENV in vivo, an “in vivo-ready” version of DC-3 was synthesized and tested against DENV-2 by using a mouse model of antibody-dependent enhancement of infection (ADE)-induced disease. Compared with the rapid weight loss and 5-day average survival time of the control groups, mice receiving the DC-3 siRNA had an average survival time of 15 days and showed little weight loss for approximately 12 days. DC-3-treated mice also contained significantly less virus than control groups in several tissues at various time points postinfection. These results suggest that exogenously introduced siRNA combined with the endogenous RNA interference processing machinery has the capacity to prevent severe dengue disease. Overall, the data indicate that DC-3 siRNA represents a useful research reagent and has potential as a novel approach to therapeutic intervention against the genetically diverse dengue viruses.


Computational Biology and Chemistry | 2005

Brief communication: SVM-BALSA: Remote homology detection based on Bayesian sequence alignment

Bobbie-Jo M. Webb-Robertson; Christopher S. Oehmen; Melissa M. Matzke

Biopolymer sequence comparison to identify evolutionarily related proteins, or homologs, is one of the most common tasks in bioinformatics. Support vector machines (SVMs) represent a new approach to the problem in which statistical learning theory is employed to classify proteins into families, thus identifying homologous relationships. Current SVM approaches have been shown to outperform iterative profile methods, such as PSI-BLAST, for protein homology classification. In this study, we demonstrate that the utilization of a Bayesian alignment score, which accounts for the uncertainty of all possible alignments, in the SVM construction improves sensitivity compared to the traditional dynamic programming implementation over a benchmark dataset consisting of 54 unique protein families. The SVM-BALSA algorithms returns a higher area under the receiver operating characteristic (ROC) curves for 37 of the 54 families and achieves an improved overall performance curve at a significance level of 0.07.


Bioinformatics | 2013

ScalaBLAST 2.0

Christopher S. Oehmen; Douglas J. Baxter

Motivation: BLAST remains one of the most widely used tools in computational biology. The rate at which new sequence data is available continues to grow exponentially, driving the emergence of new fields of biological research. At the same time, multicore systems and conventional clusters are more accessible. ScalaBLAST has been designed to run on conventional multiprocessor systems with an eye to extreme parallelism, enabling parallel BLAST calculations using >16 000 processing cores with a portable, robust, fault-resilient design that introduces little to no overhead with respect to serial BLAST. Availability: ScalaBLAST 2.0 source code can be freely downloaded from http://omics.pnl.gov/software/ScalaBLAST.php. Contact: [email protected]


BMC Bioinformatics | 2010

Physicochemical property distributions for accurate and rapid pairwise protein homology detection

Bobbie-Jo M. Webb-Robertson; Kyle G Ratuiste; Christopher S. Oehmen

BackgroundThe challenge of remote homology detection is that many evolutionarily related sequences have very little similarity at the amino acid level. Kernel-based discriminative methods, such as support vector machines (SVMs), that use vector representations of sequences derived from sequence properties have been shown to have superior accuracy when compared to traditional approaches for the task of remote homology detection.ResultsWe introduce a new method for feature vector representation based on the physicochemical properties of the primary protein sequence. A distribution of physicochemical property scores are assembled from 4-mers of the sequence and normalized based on the null distribution of the property over all possible 4-mers. With this approach there is little computational cost associated with the transformation of the protein into feature space, and overall performance in terms of remote homology detection is comparable with current state-of-the-art methods. We demonstrate that the features can be used for the task of pairwise remote homology detection with improved accuracy versus sequence-based methods such as BLAST and other feature-based methods of similar computational cost.ConclusionsA protein feature method based on physicochemical properties is a viable approach for extracting features in a computationally inexpensive manner while retaining the sensitivity of SVM protein homology detection. Furthermore, identifying features that can be used for generic pairwise homology detection in lieu of family-based homology detection is important for applications such as large database searches and comparative genomics.


Computational Biology and Chemistry | 2008

Brief Communication: A feature vector integration approach for a generalized support vector machine pairwise homology algorithm

Bobbie-Jo M. Webb-Robertson; Christopher S. Oehmen; Anuj R. Shah

Due to the exponential growth of sequenced genomes, the need to quickly provide accurate annotation for existing and new sequences is paramount to facilitate biological research. Current sequence comparison approaches fail to detect homologous relationships when sequence similarity is low. Support vector machine (SVM) algorithms approach this problem by transforming all proteins into a feature space of equal dimension based on protein properties, such as sequence similarity scores against a basis set of proteins or motifs. This multivariate representation of the protein space is then used to build a classifier specific to a pre-defined protein family. However, this approach is not well suited to large-scale annotation. We have developed a SVM approach that formulates remote homology as a single classifier that answers the pairwise comparison problem by integrating the two feature vectors for a pair of sequences into a single vector representation that can be used to build a classifier that separates sequence pairs into homologs and non-homologs. This pairwise SVM approach significantly improves the task of remote homology detection on the benchmark dataset, quantified as the area under the receiver operating characteristic curve; 0.97 versus 0.73 and 0.70 for PSI-BLAST and Basic Local Alignment Search Tool (BLAST), respectively.


Computational Biology and Chemistry | 2007

Brief communication: Integrating subcellular location for improving machine learning models of remote homology detection in eukaryotic organisms

Anuj R. Shah; Christopher S. Oehmen; Jill Harper; Bobbie-Jo M. Webb-Robertson

A significant challenge in homology detection is to identify sequences that share a common evolutionary ancestor, despite significant primary sequence divergence. Remote homologs will often have less than 30% sequence identity, yet still retain common structural and functional properties. We demonstrate a novel method for identifying remote homologs using a support vector machine (SVM) classifier trained by fusing sequence similarity scores and subcellular location prediction. SVMs have been shown to perform well in a variety of applications where binary classification of data is the goal. At the same time, data fusion methods have been shown to be highly effective in enhancing discriminative power of data. Combining these two approaches in the application SVM-SimLoc resulted in identification of significantly more remote homologs (p-value<0.006) than using either sequence similarity or subcellular location independently.


intelligence and security informatics | 2013

A generalized bio-inspired method for discovering sequence-based signatures

Elena S. Peterson; Darren S. Curtis; Aaron R. Phillips; Jeremy R. Teuton; Christopher S. Oehmen

Many phenomena that we wish to discover are comprised of sequences of events or event primitives. Often signatures are constructed to identify such phenomena using either distributions or frequencies of attributes, or specific subsequences that are known to correlate to the phenomena. Distribution-based identification does not capture the essence of the sequence of behaviors and therefore may suffer from lack of specificity. At the other extreme, using specific subsequences to identify target phenomena is often too specific and suffers from lower sensitivity when natural variations arise in the phenomena, measuring process, or data analysis. We introduce here a method for discovering signatures for phenomena that are well characterized by sequences of event primitives. In this paper, we describe the steps taken and lessons learned in generalizing a sequence analysis method, BLAST, for use on non-biological datasets including expressing and operating on alphabets of varying length, constructing a reward/penalty model for arbitrary datasets, and discovering low complexity segments in sequence data by extending BLASTs native low-complexity estimating algorithms. We also present high-level overviews of several case studies that demonstrate the utility of this method to discovering signatures in a wide array of applications including network traffic, software analysis, server characterization, and others. Finally, we demonstrate how signatures discovered using this method can be expressed using a variety of model formalisms, each having its own relative benefit.


2013 6th International Symposium on Resilient Control Systems (ISRCS) | 2013

LINEBACkER: Bio-inspired data reduction toward real time network traffic analysis

Jeremy R. Teuton; Elena S. Peterson; Douglas J. Nordwall; Bora A. Akyol; Christopher S. Oehmen

One essential component of resilient cyber applications is the ability to detect adversaries and protect systems with the same flexibility adversaries will use to achieve their goals. Current detection techniques do not enable this degree of flexibility because most existing applications are built using exact or regular-expression matching to libraries of rule sets. Further, network traffic defies traditional cyber security approaches that focus on limiting access based on the use of passwords and examination of lists of installed or downloaded programs. These approaches do not readily apply to network traffic occurring beyond the access control point, and when the data in question are combined control and payload data of ever increasing speed and volume. Manual analysis of network traffic is not normally possible because of the magnitude of the data that is being exchanged and the length of time that this analysis takes. At the same time, using an exact matching scheme to identify malicious traffic in real time often fails because the lists against which such searches must operate grow too large. In this work, we propose an adaptation of biosequence alignment as an alternative method for cyber network detection based on similarity-measuring algorithms for gene sequence analysis. These methods are ideal because they were designed to identify similar but non-identical sequences. We demonstrate that our method is generally applicable to the problem of network traffic analysis by illustrating its use in two different areas based on different attributes of network traffic. Our approach provides a logical framework for organizing large collections of network data, prioritizing traffic of interest to human analysts, and makes it possible to discover traffic signatures without the bias introduced by expert-directed signature generation. Pattern recognition on reduced representations of network traffic offers a fast, efficient, and more robust way to detect anomalies.

Collaboration


Dive into the Christopher S. Oehmen's collaboration.

Top Co-Authors

Avatar

Bobbie-Jo M. Webb-Robertson

Pacific Northwest National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Elena S. Peterson

Pacific Northwest National Laboratory

View shared research outputs
Top Co-Authors

Avatar

William R. Cannon

Pacific Northwest National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Anuj R. Shah

Pacific Northwest National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Aaron R. Phillips

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Darren S. Curtis

Pacific Northwest National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Douglas J. Baxter

Pacific Northwest National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Ehab Al-Shaer

University of North Carolina at Charlotte

View shared research outputs
Top Co-Authors

Avatar

Ian Gorton

Pacific Northwest National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Joshua N. Adkins

Pacific Northwest National Laboratory

View shared research outputs
Researchain Logo
Decentralizing Knowledge