Usha Muppirala
Iowa State University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Usha Muppirala.
BMC Bioinformatics | 2011
Usha Muppirala; Vasant G. Honavar; Drena Dobbs
BackgroundRNA-protein interactions (RPIs) play important roles in a wide variety of cellular processes, ranging from transcriptional and post-transcriptional regulation of gene expression to host defense against pathogens. High throughput experiments to identify RNA-protein interactions are beginning to provide valuable information about the complexity of RNA-protein interaction networks, but are expensive and time consuming. Hence, there is a need for reliable computational methods for predicting RNA-protein interactions.ResultsWe propose RPISeq, a family of classifiers for predicting R NA-p rotein i nteractions using only seq uence information. Given the sequences of an RNA and a protein as input, RPIseq predicts whether or not the RNA-protein pair interact. The RNA sequence is encoded as a normalized vector of its ribonucleotide 4-mer composition, and the protein sequence is encoded as a normalized vector of its 3-mer composition, based on a 7-letter reduced alphabet representation. Two variants of RPISeq are presented: RPISeq-SVM, which uses a Support Vector Machine (SVM) classifier and RPISeq-RF, which uses a Random Forest classifier. On two non-redundant benchmark datasets extracted from the Protein-RNA Interface Database (PRIDB), RPISeq achieved an AUC (Area Under the Receiver Operating Characteristic (ROC) curve) of 0.96 and 0.92. On a third dataset containing only mRNA-protein interactions, the performance of RPISeq was competitive with that of a published method that requires information regarding many different features (e.g., mRNA half-life, GO annotations) of the putative RNA and protein partners. In addition, RPISeq classifiers trained using the PRIDB data correctly predicted the majority (57-99%) of non-coding RNA-protein interactions in NPInter-derived networks from E. coli, S. cerevisiae, D. melanogaster, M. musculus, and H. sapiens.ConclusionsOur experiments with RPISeq demonstrate that RNA-protein interactions can be reliably predicted using only sequence-derived information. RPISeq offers an inexpensive method for computational construction of RNA-protein interaction networks, and should provide useful insights into the function of non-coding RNAs. RPISeq is freely available as a web-based server at http://pridb.gdcb.iastate.edu/RPISeq/.
Archive | 2017
Carla M. Mann; Usha Muppirala; Drena Dobbs
Experimental methods for identifying protein(s) bound by a specific promoter-associated RNA (paRNA) of interest can be expensive, difficult, and time-consuming. This chapter describes a general computational framework for identifying potential binding partners in RNA-protein complexes or RNA-protein interaction networks. Protocols for using three web-based tools to predict RNA-protein interaction partners are outlined. Also, tables listing additional webservers and software tools for predicting RNA-protein interactions, as well as databases that contain valuable information about known RNA-protein complexes and recognition sites for RNA-binding proteins, are provided. Although only one of the tools described, lncPro, was designed expressly to identify proteins that bind long noncoding RNAs (including paRNAs), all three approaches can be applied to predict potential binding partners for both coding and noncoding RNAs (ncRNAs).
bioRxiv | 2018
Rick E. Masonbrink; Thomas R. Maier; Usha Muppirala; Arun S. Seetharam; Etienne Lord; Parijat S. Juvale; Jeremy Schmutz; Nathan T. Johnson; Dmitry Korkin; Melissa G. Mitchum; Benjamin Mimee; Sebastian Eves-van den Akker; Matthew E. Hudson; Andrew J. Severin; Thomas J. Baum
Heterodera glycines, commonly referred to as the soybean cyst nematode (SCN), is an obligatory and sedentary plant parasite that causes over a billion-dollar yield loss to soybean production annually. Although there are genetic determinants that render soybean plants resistant to certain nematode genotypes, resistant soybean cultivars are increasingly ineffective because their multi-year usage has selected for virulent H. glycines populations. The parasitic success of H. glycines relies on the comprehensive re-engineering of an infection site into a syncytium, as well as the long-term suppression of host defense to ensure syncytial viability. At the forefront of these complex molecular interactions are effectors, the proteins secreted by H. glycines into host root tissues. The mechanisms of effector acquisition, diversification, and selection need to be understood before effective control strategies can be developed, but the lack of an annotated genome has been a major roadblock. Here, we use PacBio long-read technology to assemble a H. glycines genome of 738 contigs into 123Mb with annotations for 29,769 genes. The genome contains significant numbers of repeats (34%), tandem duplicates (18.7Mb), and horizontal gene transfer events (151 genes). Using previously published effector sequences, the newly generated H. glycines genome, and comparisons to other nematode genomes, we investigate the evolutionary mechanisms responsible for the emergence and diversification of effector genes.
bioRxiv | 2018
Hamid Bagheri; Usha Muppirala; Andrew J. Severin; Hridesh Rajan
Background Creating a computational infrastructure to analyze the wealth of information contained in data repositories that scales well is difficult due to significant barriers in organizing, extracting and analyzing relevant data. Shared Data Science Infrastructures like Boa can be used to more efficiently process and parse data contained in large data repositories. The main features of Boa are inspired from existing languages for data intensive computing and can easily integrate data from biological data repositories. Results Here, we present an implementation of Boa for Genomic research (BoaG) on a relatively small data repository: RefSeq’s 97,716 annotation (GFF) and assembly (FASTA) files and metadata. We used BoaG to query the entire RefSeq dataset and gain insight into the RefSeq genome assemblies and gene model annotations and show that assembly quality using the same assembler varies depending on species. Conclusions In order to keep pace with our ability to produce biological data, innovative methods are required. The Shared Data Science Infrastructure, BoaG, can provide greater access to researchers to efficiently explore data in ways previously not possible for anyone but the most well funded research groups. We demonstrate the efficiency of BoaG to explore the RefSeq database of genome assemblies and annotations to identify interesting features of gene annotation as a proof of concept for much larger datasets.
The Plant Cell | 2018
Renu Srivastava; Zhaoxia Li; Giulia Russo; Jie Tang; Ran Bi; Usha Muppirala; Sivanandan Chudalayandi; Andrew J. Severin; Mingze He; Samuel I. Vaitkevicius; Carolyn J. Lawrence-Dill; Peng Liu; Ann E. Stapleton; Diane C. Bassham; Federica Brandizzi; Stephen H. Howell
Persistent ER stress in maize activates a gene expression program interwoven among cellular events that transition from cell survival to cell death. The unfolded protein response (UPR) is a highly conserved response that protects plants from adverse environmental conditions. The UPR is elicited by endoplasmic reticulum (ER) stress, in which unfolded and misfolded proteins accumulate within the ER. Here, we induced the UPR in maize (Zea mays) seedlings to characterize the molecular events that occur over time during persistent ER stress. We found that a multiphasic program of gene expression was interwoven among other cellular events, including the induction of autophagy. One of the earliest phases involved the degradation by regulated IRE1-dependent RNA degradation (RIDD) of RNA transcripts derived from a family of peroxidase genes. RIDD resulted from the activation of the promiscuous ribonuclease activity of ZmIRE1 that attacks the mRNAs of secreted proteins. This was followed by an upsurge in expression of the canonical UPR genes indirectly driven by ZmIRE1 due to its splicing of Zmbzip60 mRNA to make an active transcription factor that directly upregulates many of the UPR genes. At the peak of UPR gene expression, a global wave of RNA processing led to the production of many aberrant UPR gene transcripts, likely tempering the ER stress response. During later stages of ER stress, ZmIRE1’s activity declined, as did the expression of survival modulating genes, Bax inhibitor1 and Bcl-2-associated athanogene7, amid a rising tide of cell death. Thus, in response to persistent ER stress, maize seedlings embark on a course of gene expression and cellular events progressing from adaptive responses to cell death.
Frontiers in Microbiology | 2015
Kathy T. Mou; Usha Muppirala; Andrew J. Severin; Tyson A. Clark; Matthew Boitano; Paul J. Plummer
New Phytologist | 2016
Jason B. Noon; Mingsheng Qi; Danielle N. Sill; Usha Muppirala; Sebastian Eves-van den Akker; Thomas R. Maier; Drena Dobbs; Melissa G. Mitchum; Tarek Hewezi; Thomas J. Baum
Journal of Computer Science & Systems Biology | 2013
Usha Muppirala; Benjamin A. Lewis; Drena Dobbs
BMC Genomics | 2015
Alessandra Lanubile; Usha Muppirala; Andrew J. Severin; Adriano Marocco; Gary P. Munkvold
BMC Genomics | 2015
Tian Lin; Coralie C. Lashbrook; Sung-Ki Cho; Nathaniel M. Butler; Pooja Sharma; Usha Muppirala; Andrew J. Severin; David J. Hannapel