Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Guruprasad Kora is active.

Publication


Featured researches published by Guruprasad Kora.


Analytical Chemistry | 2014

MIDAS: A Database-Searching Algorithm for Metabolite Identification in Metabolomics

Yingfeng Wang; Guruprasad Kora; Benjamin P. Bowen; Chongle Pan

A database searching approach can be used for metabolite identification in metabolomics by matching measured tandem mass spectra (MS/MS) against the predicted fragments of metabolites in a database. Here, we present the open-source MIDAS algorithm (Metabolite Identification via Database Searching). To evaluate a metabolite-spectrum match (MSM), MIDAS first enumerates possible fragments from a metabolite by systematic bond dissociation, then calculates the plausibility of the fragments based on their fragmentation pathways, and finally scores the MSM to assess how well the experimental MS/MS spectrum from collision-induced dissociation (CID) is explained by the metabolites predicted CID MS/MS spectrum. MIDAS was designed to search high-resolution tandem mass spectra acquired on time-of-flight or Orbitrap mass spectrometer against a metabolite database in an automated and high-throughput manner. The accuracy of metabolite identification by MIDAS was benchmarked using four sets of standard tandem mass spectra from MassBank. On average, for 77% of original spectra and 84% of composite spectra, MIDAS correctly ranked the true compounds as the first MSMs out of all MetaCyc metabolites as decoys. MIDAS correctly identified 46% more original spectra and 59% more composite spectra at the first MSMs than an existing database-searching algorithm, MetFrag. MIDAS was showcased by searching a published real-world measurement of a metabolome from Synechococcus sp. PCC 7002 against the MetaCyc metabolite database. MIDAS identified many metabolites missed in the previous study. MIDAS identifications should be considered only as candidate metabolites, which need to be confirmed using standard compounds. To facilitate manual validation, MIDAS provides annotated spectra for MSMs and labels observed mass spectral peaks with predicted fragments. The database searching and manual validation can be performed online at http://midas.omicsbio.org.


Standards in Genomic Sciences | 2014

Quality scores for 32,000 genomes

Miriam Land; Doug Hyatt; Se-Ran Jun; Guruprasad Kora; Loren Hauser; Oksana Lukjancenko; David W. Ussery

BackgroundMore than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). We have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences.ResultsScores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes had quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes.ConclusionsThe score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.


BMC Evolutionary Biology | 2014

Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam

Juanjuan Chai; Guruprasad Kora; Tae-Hyuk Ahn; Doug Hyatt; Chongle Pan

BackgroundPhylogenetic studies have provided detailed knowledge on the evolutionary mechanisms of genes and species in Bacteria and Archaea. However, the evolution of cellular functions, represented by metabolic pathways and biological processes, has not been systematically characterized. Many clades in the prokaryotic tree of life have now been covered by sequenced genomes in GenBank. This enables a large-scale functional phylogenomics study of many computationally inferred cellular functions across all sequenced prokaryotes.ResultsA total of 14,727 GenBank prokaryotic genomes were re-annotated using a new protein family database, UniFam, to obtain consistent functional annotations for accurate comparison. The functional profile of a genome was represented by the biological process Gene Ontology (GO) terms in its annotation. The GO term enrichment analysis differentiated the functional profiles between selected archaeal taxa. 706 prokaryotic metabolic pathways were inferred from these genomes using Pathway Tools and MetaCyc. The consistency between the distribution of metabolic pathways in the genomes and the phylogenetic tree of the genomes was measured using parsimony scores and retention indices. The ancestral functional profiles at the internal nodes of the phylogenetic tree were reconstructed to track the gains and losses of metabolic pathways in evolutionary history.ConclusionsOur functional phylogenomics analysis shows divergent functional profiles of taxa and clades. Such function-phylogeny correlation stems from a set of clade-specific cellular functions with low parsimony scores. On the other hand, many cellular functions are sparsely dispersed across many clades with high parsimony scores. These different types of cellular functions have distinct evolutionary patterns reconstructed from the prokaryotic tree.


Functional & Integrative Genomics | 2010

Conserved synteny at the protein family level reveals genes underlying Shewanella species' cold tolerance and predicts their novel phenotypes.

Tatiana V. Karpinets; Anna Obraztsova; Yanbing Wang; Denise Schmoyer; Guruprasad Kora; Byung H. Park; Margrethe H. Serres; Margaret F. Romine; Miriam Land; Terence B. Kothe; Jim K. Fredrickson; Kenneth H. Nealson; Edward C. Uberbacher

Bacteria of the genus Shewanella can thrive in different environments and demonstrate significant variability in their metabolic and ecophysiological capabilities including cold and salt tolerance. Genomic characteristics underlying this variability across species are largely unknown. In this study, we address the problem by a comparison of the physiological, metabolic, and genomic characteristics of 19 sequenced Shewanella species. We have employed two novel approaches based on association of a phenotypic trait with the number of the trait-specific protein families (Pfam domains) and on the conservation of synteny (order in the genome) of the trait-related genes. Our first approach is top-down and involves experimental evaluation and quantification of the species’ cold tolerance followed by identification of the correlated Pfam domains and genes with a conserved synteny. The second, a bottom-up approach, predicts novel phenotypes of the species by calculating profiles of each Pfam domain among their genomes and following pair-wise correlation of the profiles and their network clustering. Using the first approach, we find a link between cold and salt tolerance of the species and the presence in the genome of a Na+/H+ antiporter gene cluster. Other cold-tolerance-related genes include peptidases, chemotaxis sensory transducer proteins, a cysteine exporter, and helicases. Using the bottom-up approach, we found several novel phenotypes in the newly sequenced Shewanella species, including degradation of aromatic compounds by an aerobic hybrid pathway in Shewanella woodyi, degradation of ethanolamine by Shewanella benthica, and propanediol degradation by Shewanella putrefaciens CN32 and Shewanella sp. W3-18-1.


Journal of Physics: Conference Series | 2006

High performance statistical computing with parallel R: applications to biology and climate modelling

Nagiza F. Samatova; Marcia L. Branstetter; Auroop R. Ganguly; Robert L. Hettich; Shiraj Khan; Guruprasad Kora; Jiangtian Li; Xiaosong Ma; Chongle Pan; Arie Shoshani; Srikanth B. Yoginath

Ultrascale computing and high-throughput experimental technologies have enabled the production of scientific data about complex natural phenomena. With this opportunity, comes a new problem – the massive quantities of data so produced. Answers to fundamental questions about the nature of those phenomena remain largely hidden in the produced data. The goal of this work is to provide a scalable high performance statistical data analysis framework to help scientists perform interactive analyses of these raw data to extract knowledge. Towards this goal we have been developing an open source parallel statistical analysis package, called Parallel R, that lets scientists employ a wide range of statistical analysis routines on high performance shared and distributed memory architectures without having to deal with the intricacies of parallelizing these routines.


PLOS Computational Biology | 2010

Conserved synteny at the protein family level reveals genes underlying Shewanella species' cold tolerance and predicts their novel phenotypes

Tatiana V. Karpinets; Anna Obraztsova; Yanbing Wang; Denise Schmoyer; Guruprasad Kora; Byung H. Park; Margrethe H. Serres; Margaret F. Romine; Miriam Land; Terence B. Kothe; James K. Fredrickson; Kenneth H. Nealson; Edward C. Uberbacher

Bacteria of the genus Shewanella can thrive in different environments and demonstrate significant variability in their metabolic and ecophysiological capabilities including cold and salt tolerance. Genomic characteristics underlying this variability across species are largely unknown. In this study, we address the problem by a comparison of the physiological, metabolic, and genomic characteristics of 19 sequenced Shewanella species. We have employed two novel approaches based on association of a phenotypic trait with the number of the trait-specific protein families (Pfam domains) and on the conservation of synteny (order in the genome) of the trait-related genes. Our first approach is top-down and involves experimental evaluation and quantification of the species’ cold tolerance followed by identification of the correlated Pfam domains and genes with a conserved synteny. The second, a bottom-up approach, predicts novel phenotypes of the species by calculating profiles of each Pfam domain among their genomes and following pair-wise correlation of the profiles and their network clustering. Using the first approach, we find a link between cold and salt tolerance of the species and the presence in the genome of a Na+/H+ antiporter gene cluster. Other cold-tolerance-related genes include peptidases, chemotaxis sensory transducer proteins, a cysteine exporter, and helicases. Using the bottom-up approach, we found several novel phenotypes in the newly sequenced Shewanella species, including degradation of aromatic compounds by an aerobic hybrid pathway in Shewanella woodyi, degradation of ethanolamine by Shewanella benthica, and propanediol degradation by Shewanella putrefaciens CN32 and Shewanella sp. W3-18-1.


Bioinformation | 2009

Shewregdb: database and visualization environment for experimental and predicted regulatory information in Shewanella oneidensis mr-1.

Mustafa H Syed; Tatiana V. Karpinets; Michael R. Leuze; Guruprasad Kora; Margaret R. Romine; Edward C. Uberbacher

Shewanella oneidensis MR-1 is an important model organism for environmental research as it has an exceptional metabolic and respiratory versatility regulated by a complex regulatory network. We have developed a database to collect experimental and computational data relating to regulation of gene and protein expression, and, a visualization environment that enables integration of these data types. The regulatory information in the database includes predictions of DNA regulator binding sites, sigma factor binding sites, transcription units, operons, promoters, and RNA regulators including non-coding RNAs, riboswitches, and different types of terminators. Availability http://shewanella-knowledgebase.org:8080/Shewanella/gbrowserLanding.jsp


Analytical Chemistry | 2006

ProRata: A quantitative proteomics program for accurate protein abundance ratio estimation with confidence interval evaluation

Chongle Pan; Guruprasad Kora; W. Hayes McDonald; Dave L Tabb; Nathan C. VerBerkmoes; Gregory B. Hurst; Dale A. Pelletier; Nagiza F. Samatova; Robert L. Hettich


Analytical Chemistry | 2006

Robust Estimation of Peptide Abundance Ratios and Rigorous Scoring of Their Variability and Bias in Quantitative Shotgun Proteomics

Chongle Pan; Guruprasad Kora; Dave L Tabb; Dale A. Pelletier; W. Hayes McDonald; Gregory B. Hurst; Robert L. Hettich; Nagiza F. Samatova


Journal of Parallel and Distributed Computing | 2011

Transparent runtime parallelization of the R scripting language

Jiangtian Li; Xiaosong Ma; Srikanth B. Yoginath; Guruprasad Kora; Nagiza F. Samatova

Collaboration


Dive into the Guruprasad Kora's collaboration.

Top Co-Authors

Avatar

Nagiza F. Samatova

North Carolina State University

View shared research outputs
Top Co-Authors

Avatar

Chongle Pan

Oak Ridge National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Srikanth B. Yoginath

Oak Ridge National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Paul Breimyer

North Carolina State University

View shared research outputs
Top Co-Authors

Avatar

Edward C. Uberbacher

Oak Ridge National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Jiangtian Li

North Carolina State University

View shared research outputs
Top Co-Authors

Avatar

Miriam Land

Oak Ridge National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Robert L. Hettich

Oak Ridge National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Tatiana V. Karpinets

Oak Ridge National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Xiaosong Ma

North Carolina State University

View shared research outputs
Researchain Logo
Decentralizing Knowledge