Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Nitish K. Mishra is active.

Publication


Featured researches published by Nitish K. Mishra.


BMC Bioinformatics | 2010

AntiBP2: improved version of antibacterial peptide prediction

Sneh Lata; Nitish K. Mishra; Gajendra P. S. Raghava

BackgroundAntibacterial peptides are one of the effecter molecules of innate immune system. Over the last few decades several antibacterial peptides have successfully approved as drug by FDA, which has prompted an interest in these antibacterial peptides. In our recent study we analyzed 999 antibacterial peptides, which were collected from Antibacterial Peptide Database (APD). We have also developed methods to predict and classify these antibacterial peptides using Support Vector Machine (SVM).ResultsDuring analysis we observed that certain residues are preferred over other in antibacterial peptide, particularly at the N and C terminus. These observation and increased data of antibacterial peptide in APD encouraged us to again develop a new and more robust method for predicting antibacterial peptides in protein from their amino acid sequence or given peptide have antibacterial properties or not. First, the binary patterns of the 15 N terminus residues were used for predicting antibacterial peptide using SVM and achieved accuracy of 85.46% with 0.705 Mathews Correlation Coefficient (MCC). Then we used the binary pattern of 15 C terminus residues and achieved accuracy of 85.05% with 0.701 MCC, latter on we developed prediction method by combining N & C terminus and achieved an accuracy of 91.64% with 0.831 MCC. Finally we developed SVM based model using amino acid composition of whole peptide and achieved 92.14% accuracy with MCC 0.843. In this study we used five-fold cross validation technique to develop all these models and tested the performance of these models on an independent dataset. We further classify antibacterial peptides according to their sources and achieved an overall accuracy of 98.95%. We further classify antibacterial peptides in their respective family and got a satisfactory result.ConclusionAmong antibacterial peptides, there is preference for certain residues at N and C terminus, which helps to discriminate them from non-antibacterial peptides. Amino acid composition of antibacterial peptides helps to demarcate them from non-antibacterial peptide and their further classification in source and family. Antibp2 will be helpful in discovering efficacious antibacterial peptide, which we hope will be helpful against antibiotics resistant bacteria. We also developed user friendly web server for the biological community.


BMC Pharmacology | 2010

Prediction of cytochrome P450 isoform responsible for metabolizing a drug molecule

Nitish K. Mishra; Sandhya Agarwal; Gajendra P. S. Raghava

BackgroundDifferent isoforms of Cytochrome P450 (CYP) metabolized different types of substrates (or drugs molecule) and make them soluble during biotransformation. Therefore, fate of any drug molecule depends on how they are treated or metabolized by CYP isoform. There is a need to develop models for predicting substrate specificity of major isoforms of P450, in order to understand whether a given drug will be metabolized or not. This paper describes an in-silico method for predicting the metabolizing capability of major isoforms (e.g. CYP 3A4, 2D6, 1A2, 2C9 and 2C19).ResultsAll models were trained and tested on 226 approved drug molecules. Firstly, 2392 molecular descriptors for each drug molecule were calculated using various softwares. Secondly, best 41 descriptors were selected using general and genetic algorithm. Thirdly, Support Vector Machine (SVM) based QSAR models were developed using 41 best descriptors and achieved an average accuracy of 86.02%, evaluated using fivefold cross-validation. We have also evaluated the performance of our model on an independent dataset of 146 drug molecules and achieved average accuracy 70.55%. In addition, SVM based models were developed using 26 Chemistry Development Kit (CDK) molecular descriptors and achieved an average accuracy of 86.60%.ConclusionsThis study demonstrates that SVM based QSAR model can predict substrate specificity of major CYP isoforms with high accuracy. These models can be used to predict isoform responsible for metabolizing a drug molecule. Thus these models can used to understand whether a molecule will be metabolized or not. This is possible to develop highly accurate models for predicting substrate specificity of major isoforms using CDK descriptors. A web server MetaPred has been developed for predicting metabolizing isoform of a drug molecule http://crdd.osdd.net/raghava/metapred/.


PLOS ONE | 2014

Prediction of Membrane Transport Proteins and Their Substrate Specificities Using Primary Sequence Information

Nitish K. Mishra; Junil Chang; Patrick Xuechun Zhao

Background Membrane transport proteins (transporters) move hydrophilic substrates across hydrophobic membranes and play vital roles in most cellular functions. Transporters represent a diverse group of proteins that differ in topology, energy coupling mechanism, and substrate specificity as well as sequence similarity. Among the functional annotations of transporters, information about their transporting substrates is especially important. The experimental identification and characterization of transporters is currently costly and time-consuming. The development of robust bioinformatics-based methods for the prediction of membrane transport proteins and their substrate specificities is therefore an important and urgent task. Results Support vector machine (SVM)-based computational models, which comprehensively utilize integrative protein sequence features such as amino acid composition, dipeptide composition, physico-chemical composition, biochemical composition, and position-specific scoring matrices (PSSM), were developed to predict the substrate specificity of seven transporter classes: amino acid, anion, cation, electron, protein/mRNA, sugar, and other transporters. An additional model to differentiate transporters from non-transporters was also developed. Among the developed models, the biochemical composition and PSSM hybrid model outperformed other models and achieved an overall average prediction accuracy of 76.69% with a Mathews correlation coefficient (MCC) of 0.49 and a receiver operating characteristic area under the curve (AUC) of 0.833 on our main dataset. This model also achieved an overall average prediction accuracy of 78.88% and MCC of 0.41 on an independent dataset. Conclusions Our analyses suggest that evolutionary information (i.e., the PSSM) and the AAIndex are key features for the substrate specificity prediction of transport proteins. In comparison, similarity-based methods such as BLAST, PSI-BLAST, and hidden Markov models do not provide accurate predictions for the substrate specificity of membrane transport proteins. TrSSP: The Transporter Substrate Specificity Prediction Server, a web server that implements the SVM models developed in this paper, is freely available at http://bioinfo.noble.org/TrSSP.


BMC Bioinformatics | 2010

Prediction of FAD interacting residues in a protein from its primary sequence using evolutionary information

Nitish K. Mishra; Gajendra P. S. Raghava

BackgroundFlavin binding proteins (FBP) plays a critical role in several biological functions such as electron transport system (ETS). These flavoproteins contain very tightly bound, sometimes covalently, flavin adenine dinucleotide (FAD) or flavin mono nucleotide (FMN). The interaction between flavin nucleotide and amino acids of flavoprotein is essential for their functionality. Thus identification of FAD interacting residues in a FBP is an important step for understanding their function and mechanism.ResultsIn this study, we describe models developed for predicting FAD interacting residues using 15, 17 and 19 window pattern. Support vector machine (SVM) based models have been developed using binary pattern of amino acid sequence of protein and achieved maximum accuracy 69.65% with Mathews Correlation Coefficient (MCC) 0.39 and Area Under Curve (AUC) 0.773. The performance of these models have been improved significantly from 69.65% to 82.86% with MCC 0.66 and AUC 0.904, when evolutionary information is used as input in SVM. The evolutionary information was generated in form of position specific score matrix (PSSM) profile by using PSI-BLAST at e-value 0.001. All models were developed on 198 non-redundant FAD binding protein chains containing 5172 FAD interacting residues and evaluated using fivefold cross-validation technique.ConclusionThis study suggests that evolutionary information of 17 amino acid patterns perform best for FAD interacting residues prediction. We also developed a web server which predicts FAD interacting residues in a protein which is freely available for academics.


PLOS ONE | 2011

Identification of mannose interacting residues using local composition.

Sandhya Agarwal; Nitish K. Mishra; Harinder Singh; Gajendra P. S. Raghava

Background Mannose binding proteins (MBPs) play a vital role in several biological functions such as defense mechanisms. These proteins bind to mannose on the surface of a wide range of pathogens and help in eliminating these pathogens from our body. Thus, it is important to identify mannose interacting residues (MIRs) in order to understand mechanism of recognition of pathogens by MBPs. Results This paper describes modules developed for predicting MIRs in a protein. Support vector machine (SVM) based models have been developed on 120 mannose binding protein chains, where no two chains have more than 25% sequence similarity. SVM models were developed on two types of datasets: 1) main dataset consists of 1029 mannose interacting and 1029 non-interacting residues, 2) realistic dataset consists of 1029 mannose interacting and 10320 non-interacting residues. In this study, firstly, we developed standard modules using binary and PSSM profile of patterns and got maximum MCC around 0.32. Secondly, we developed SVM modules using composition profile of patterns and achieved maximum MCC around 0.74 with accuracy 86.64% on main dataset. Thirdly, we developed a model on a realistic dataset and achieved maximum MCC of 0.62 with accuracy 93.08%. Based on this study, a standalone program and web server have been developed for predicting mannose interacting residues in proteins (http://www.imtech.res.in/raghava/premier/). Conclusions Compositional analysis of mannose interacting and non-interacting residues shows that certain types of residues are preferred in mannose interaction. It was also observed that residues around mannose interacting residues have a preference for certain types of residues. Composition of patterns/peptide/segment has been used for predicting MIRs and achieved reasonable high accuracy. It is possible that this novel strategy may be effective to predict other types of interacting residues. This study will be useful in annotating the function of protein as well as in understanding the role of mannose in the immune system.


Letters in Drug Design & Discovery | 2011

Prediction of specificity and cross-reactivity of kinase inhibitors

Nitish K. Mishra; Gajendra P. S. Raghava

Designing kinase inhibitors is always an area of interest because kinases are involved in many diseases. In the last one decade a large number of kinase inhibitors have been launched successfully; six inhibitors have been approved by FDA and more are under clinical trials. Cross-reactivity or off-target is one of the major problems in designing inhibitors against protein kinases; as human, have more than 500 kinases with high sequence similarity. In this study an attempt has been made to develop a model for predicting specificity and cross-reactivity of kinase inhibitors. The dataset used for test- ing and training consists of binding affinities of 20 chemical kinase inhibitors with protein kinases. We developed QSAR based SVM models for predicting binding affinity of an inhibitor against protein kinases using most relevant 5,10 and 15 structure descriptors and achieving average correlation of 0.64, 0.488 and 0.442 respectively. In or- der to predict specificity and cross-reactivity of an inhibitor, we developed 16 QSAR based SVM models for 16 protein kinases; one model for each kinase. We achieved average correlation 0.719 between actual and predicted binding affinity using kinase specific models. Based on the above study a web server DMKPred has been developed for predicting binding affinity of a drug molecule with 16 kinases. The SVM based model used in this study can be used to predict kinase spe- cific inhibitors. This study will be useful for designing kinase specific inhibitors.


Protein and Peptide Letters | 2007

Support Vector Machine Based Prediction of Glutathione S-Transferase Proteins

Nitish K. Mishra; Manish Kumar; Gajendra P. S. Raghava

Glutathione S-transferase (GST) proteins play vital role in living organism that includes detoxification of exogenous and endogenous chemicals, survivability during stress condition. This paper describes a method developed for predicting GST proteins. We have used a dataset of 107 GST and 107 non-GST proteins for training and the performance of the method was evaluated with five-fold cross-validation technique. First a SVM based method has been developed using amino acid and dipeptide composition and achieved the maximum accuracy of 91.59% and 95.79% respectively. In addition we developed a SVM based method using tripeptide composition and achieved maximum accuracy 97.66% which is better than accuracy achieved by HMM based searching (96.26%). Based on above study a web-server GSTPred has been developed (http://www.imtech.res.in/raghava/gstpred/).


PLOS ONE | 2015

Exome Analysis Reveals Differentially Mutated Gene Signatures of Stage, Grade and Subtype in Breast Cancers

You Li; Xiaosheng Wang; Suleyman Vural; Nitish K. Mishra; Kenneth H. Cowan; Chittibabu Guda

Breast cancers exhibit highly heterogeneous molecular profiles. Although gene expression profiles have been used to predict the risks and prognostic outcomes of breast cancers, the high variability of gene expression limits its clinical application. In contrast, genetic mutation profiles would be more advantageous than gene expression profiles because genetic mutations can be stably detected and the mutational heterogeneity widely exists in breast cancer genomes. We analyzed 98 breast cancer whole exome samples that were sorted into three subtypes, two grades and two stages. The sum deleterious effect of all mutations in each gene was scored to identify differentially mutated genes (DMGs) for this case-control study. DMGs were corroborated using extensive published knowledge. Functional consequences of deleterious SNVs on protein structure and function were also investigated. Genes such as ERBB2, ESP8, PPP2R4, KIAA0922, SP4, CENPJ, PRCP and SELP that have been experimentally or clinically verified to be tightly associated with breast cancer prognosis are among the DMGs identified in this study. We also identified some genes such as ARL6IP5, RAET1E, and ANO7 that could be crucial for breast cancer development and prognosis. Further, SNVs such as rs1058808, rs2480452, rs61751507, rs79167802, rs11540666, and rs2229437 that potentially influence protein functions are observed at significantly different frequencies in different comparison groups. Protein structure modeling revealed that many non-synonymous SNVs have a deleterious effect on protein stability, structure and function. Mutational profiling at gene- and SNV-level revealed differential patterns within each breast cancer comparison group, and the gene signatures correlate with expected prognostic characteristics of breast cancer classes. Some of the genes and SNVs identified in this study show high promise and are worthy of further investigation by experimental studies.


Oncotarget | 2017

Genome-wide DNA methylation analysis reveals molecular subtypes of pancreatic cancer

Nitish K. Mishra; Chittibabu Guda

Pancreatic cancer (PC) is the fourth leading cause of cancer deaths in the United States with a five-year patient survival rate of only 6%. Early detection and treatment of this disease is hampered due to lack of reliable diagnostic and prognostic markers. Recent studies have shown that dynamic changes in the global DNA methylation and gene expression patterns play key roles in the PC development; hence, provide valuable insights for better understanding the initiation and progression of PC. In the current study, we used DNA methylation, gene expression, copy number, mutational and clinical data from pancreatic patients. We independently investigated the DNA methylation and differential gene expression profiles between normal and tumor samples and correlated methylation levels with gene expression patterns. We observed a total of ~23-thousand differentially methylated CpG sites (Δβ≥0.1) between normal and tumor samples, where majority of the CpG sites are hypermethylated in PC, and this phenomenon is more prominent in the 5′UTRs and promoter regions compared to the gene bodies. Differential methylation is observed in genes associated with the homeobox domain, cell division and differentiation, cytoskeleton, epigenetic regulation and development, pancreatic development and pancreatic signaling and pancreatic cancer core signaling pathways. Correlation analysis suggests that methylation in the promoter region and 5′UTR has mostly negative correlations with gene expression while gene body and 3′UTR associated methylation has positive correlations. Regulatory element analysis suggests that HOX cluster and histone core proteins are upstream regulators of hypomethylation, while SMAD4, STAT4, STAT5B and zinc finger proteins (ZNF) are upstream regulators of hypermethylation. Non-negative matrix factorization (NMF) clustering of differentially methylated sites generated three clusters in PCs suggesting the existence of distinct molecular subtypes. Cluster 1 and cluster 2 showed samples enriched with clinical phenotypes like neoplasm histological grade and pathologic T-stage T3, respectively, while cluster 3 showed the enrichment of samples with neoplasm histological grade G1. To the best of our knowledge, this is the first genome-scale methylome analysis of PC data from TCGA. Our clustering analysis provides a strong basis for future work on the molecular subtyping of epigenetic regulation in pancreatic cancer.


bioRxiv | 2018

Placental epigenetics for evaluation of fetal congenital heart defects: Ventricular septal defect (VSD)

Ray O. Bahado-Singh; Samet Albayrak; Rita Zafra; Alosh Baraa; Avinash M. Veerappa; Deepthi Mahishi; Nazia Saiyed; Nitish K. Mishra; Chittibabu Guda; Rouba Ali-Fehmi; Uppala Radhakrishna

Ventricular Septal Defect (VSD), the most common congenital heart defect, is characterized by a hole in the septum between the right and left ventricles. The pathogenesis of VSD is unknown in most clinical cases. There is a paucity of data relevant to epigenetic changes in VSD. The placenta is a fetal tissue and is a potentially useful surrogate for the evaluation of fetal organ development. To understand epigenetic mechanisms that may play a role in the development of VSD, a genome-wide DNA methylation assay of the placentas of 8 term subjects with isolated VSD and no known or suspected genetic syndromes and 10 normal controls was performed using the Illumina HumanMethylation450 BeadChip assay. The study identified a total of 80 highly accurate potential epigenomic markers in 80 genes for the detection of VSD; area under the receiver operating characteristic curve (AUC ROC) = 1.0 with significant 95% CI (FDR) p-values < 0.05. The biological processes and functions for these differentially methylated genes are known to be associated with heart development or heart disease, including cardiac ventricle development (HEY2, ISL1), heart looping (SRF), cardiac muscle cell differentiation (ACTC1, HEY2), cardiac septum development (ISL1), heart morphogenesis (SRF, HEY2, ISL1, HEYL), Notch signaling pathway (HEY2, HEYL), cardiac chamber development (ISL1), and cardiac muscle tissue development (ACTC1, ISL1). The study also identified eight microRNA genes that have the potential to be biomarkers for the early detection of VSD including miR-191, miR-548F1, miR-148A, miR-423, miR-92B, miR-611, miR-2110, and miR-548H4. To our knowledge this is the first report in which placental analysis has been used for determining the pathogenesis of and predicting CHD.

Collaboration


Dive into the Nitish K. Mishra's collaboration.

Top Co-Authors

Avatar

Gajendra P. S. Raghava

Indraprastha Institute of Information Technology

View shared research outputs
Top Co-Authors

Avatar

Chittibabu Guda

University of Nebraska Medical Center

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Rita Zafra

Icahn School of Medicine at Mount Sinai

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jagat Singh Chauhan

Council of Scientific and Industrial Research

View shared research outputs
Top Co-Authors

Avatar

Nazia Saiyed

Nirma University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Alosh Baraa

Wayne State University

View shared research outputs
Researchain Logo
Decentralizing Knowledge