Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where M. Michael Gromiha is active.

Publication


Featured researches published by M. Michael Gromiha.


Nucleic Acids Research | 2006

CUPSAT: prediction of protein stability upon point mutations

Vijaya Parthiban; M. Michael Gromiha; Dietmar Schomburg

CUPSAT (Cologne University Protein Stability Analysis Tool) is a web tool to analyse and predict protein stability changes upon point mutations (single amino acid mutations). This program uses structural environment specific atom potentials and torsion angle potentials to predict ΔΔG, the difference in free energy of unfolding between wild-type and mutant proteins. It requires the protein structure in Protein Data Bank format and the location of the residue to be mutated. The output consists information about mutation site, its structural features (solvent accessibility, secondary structure and torsion angles), and comprehensive information about changes in protein stability for 19 possible substitutions of a specific amino acid mutation. Additionally, it also analyses the ability of the mutated amino acids to adapt the observed torsion angles. Results were tested on 1538 mutations from thermal denaturation and 1603 mutations from chemical denaturation experiments. Several validation tests (split-sample, jack-knife and k-fold) were carried out to ensure the reliability, accuracy and transferability of the prediction method that gives >80% prediction accuracy for most of these validation tests. Thus, the program serves as a valuable tool for the analysis of protein design and stability. The tool is accessible from the link .


Bioinformatics | 2004

Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information

Shandar Ahmad; M. Michael Gromiha; Akinori Sarai

MOTIVATION Though vitally important to cell function, the mechanism of protein-DNA binding has not yet been completely understood. We therefore analysed the relationship between DNA binding and protein sequence composition, solvent accessibility and secondary structure. Using non-redundant databases of transcription factors and protein-DNA complexes, neural network models were developed to utilize the information present in this relationship to predict DNA-binding proteins and their binding residues. RESULTS Sequence composition was found to provide sufficient information to predict the probability of its binding to DNA with nearly 69% sensitivity at 64% accuracy for the considered proteins; sequence neighbourhood and solvent accessibility information were sufficient to make binding site predictions with 40% sensitivity at 79% accuracy. Detailed analysis of binding residues shows that some three- and five-residue segments frequently bind to DNA and that solvent accessibility plays a major role in binding. Although, binding behaviour was not associated with any particular secondary structure, there were interesting exceptions at the residue level. Over-representation of some residues in the binding sites was largely lost at the total sequence level, but a different kind of compositional preference was observed in DNA-binding proteins.


Nucleic Acids Research | 2006

ProTherm and ProNIT: thermodynamic databases for proteins and protein–nucleic acid interactions

M. D. Shaji Kumar; K. Abdulla Bava; M. Michael Gromiha; Ponraj Prabakaran; Koji Kitajima; Hatsuho Uedaira; Akinori Sarai

ProTherm and ProNIT are two thermodynamic databases that contain experimentally determined thermodynamic parameters of protein stability and protein–nucleic acid interactions, respectively. The current versions of both the databases have considerably increased the total number of entries and enhanced search interface with added new fields, improved search, display and sorting options. As on September 2005, ProTherm release 5.0 contains 17 113 entries from 771 proteins, retrieved from 1497 scientific articles (∼20% increase in data from the previous version). ProNIT release 2.0 contains 4900 entries from 273 research articles, representing 158 proteins. Both databases can be queried using WWW interfaces. Both quick search and advanced search are provided on this web page to facilitate easy retrieval and display of the data from these databases. ProTherm is freely available online at and ProNIT at .


Nucleic Acids Research | 2000

ProTherm, version 2.0: thermodynamic database for proteins and mutants

M. Michael Gromiha; Jianghong An; Hidetoshi Kono; Motohisa Oobatake; Hatsuho Uedaira; Ponraj Prabakaran; Akinori Sarai

Release 4.0 of ProTherm, thermodynamic database for proteins and mutants, contains approximately 14,500 numerical data (approximately 450% of the first version) of several thermodynamic parameters along with experimental methods and conditions, and structural, functional and literature information. The sequence and structural information of proteins is connected with thermodynamic data through links between entries in Protein Data Bank, Protein Information Resource and SWISS-PROT and the data in ProTherm. We have separated the Gibbs free energy change obtained at extrapolated temperature from the data on denaturation temperature measured by the thermal denaturation method. We have added the statistics of amino acid replacements and links to homologous structures to each protein. Further, we have improved the search and display options to enhance search capability through the web interface. ProTherm is freely available at http://gibk26. bse.kyutech.ac.jp/jouhou/Protherm/protherm.html.


BMC Bioinformatics | 2004

ASAView: Database and tool for solvent accessibility representation in proteins

Shandar Ahmad; M. Michael Gromiha; Hamed Fawareh; Akinori Sarai

BackgroundAccessible surface area (ASA) or solvent accessibility of amino acids in a protein has important implications. Knowledge of surface residues helps in locating potential candidates of active sites. Therefore, a method to quickly see the surface residues in a two dimensional model would help to immediately understand the population of amino acid residues on the surface and in the inner core of the proteins.ResultsASAView is an algorithm, an application and a database of schematic representations of solvent accessibility of amino acid residues within proteins. A characteristic two-dimensional spiral plot of solvent accessibility provides a convenient graphical view of residues in terms of their exposed surface areas. In addition, sequential plots in the form of bar charts are also provided. Online plots of the proteins included in the entire Protein Data Bank (PDB), are provided for the entire protein as well as their chains separately.ConclusionsThese graphical plots of solvent accessibility are likely to provide a quick view of the overall topological distribution of residues in proteins. Chain-wise computation of solvent accessibility is also provided.


Proteins | 2003

Real value prediction of solvent accessibility from amino acid sequence

Shandar Ahmad; M. Michael Gromiha; Akinori Sarai

The solvent accessibility of amino acid residues has been predicted in the past by classifying them into exposure states with varying thresholds. This classification provides a wide range of values for the accessible surface area (ASA) within which a residue may fall. Thus far, no attempt has been made to predict real values of ASA from the sequence information without a priori classification into exposure states. Here, we present a new method with which to predict real value ASAs for residues, based on neighborhood information. Our real value prediction neural network could estimate the ASA for four different nonhomologous, nonredundant data sets of varying size, with 18.0–19.5% mean absolute error, defined as per residue absolute difference between the predicted and experimental values of relative ASA. Correlation between the predicted and experimental values ranged from 0.47 to 0.50. It was observed that the ASA of a residue could be predicted within a 23.7% mean absolute error, even when no information about its neighbors is included. Prediction of real values answers the issue of arbitrary choice of ASA state thresholds, and carries more information than category prediction. Prediction error for each residue type strongly correlates with the variability in its experimental ASA values. Proteins 2003;50:629–635.


Biophysical Chemistry | 1999

Important amino acid properties for enhanced thermostability from mesophilic to thermophilic proteins

M. Michael Gromiha; Motohisa Oobatake; Akinori Sarai

Understanding the role of various interactions in enhancing the thermostability of proteins is important not only for clarifying the mechanism of protein stability but also for designing stable proteins. In this work, we have analyzed the thermostability of 16 different families by comparing mesophilic and thermophilic proteins with 48 various physicochemical, energetic and conformational properties. We found that the increase in shape, s (location of branch point in side chain) increases the thermostability, whereas, an opposite trend is observed for Gibbs free energy change of hydration for native proteins, GhN, in 14 families. A good correlation is observed between these two properties and the simultaneous increases of -GhN and s is necessary to enhance the thermostability from mesophile to thermophile. The increase in shape, which tends to increase with increasing number of carbon atoms both for polar and non-polar residues, may generate more packing and compactness, and the position of beta and higher order branches may be important for better packing. On the other hand, the increase in -GhN in thermophilic proteins increases the solubility of the proteins. This tendency counterbalances the increases in insolubility and unfolding heat capacity change due to the increase in the number of carbon atoms. Thus, the present results suggest that the stability of thermophilic proteins may be achieved by a balance between better packing and solubility.


Proteins | 2008

Prediction of RNA binding sites in a protein using SVM and PSSM profile

Manish Kumar; M. Michael Gromiha; Gajendra P. S. Raghava

RNA‐binding proteins (RBPs) play key roles in post‐transcriptional control of gene expression, which, along with transcriptional regulation, is a major way to regulate patterns of gene expression during development. Thus, the identification and prediction of RNA binding sites is an important step in comprehensive understanding of how RBPs control organism development. Combining evolutionary information and support vector machine (SVM), we have developed an improved method for predicting RNA binding sites or RNA interacting residues in a protein sequence. The prediction models developed in this study have been trained and tested on 86 RNA binding protein chains and evaluated using fivefold cross validation technique. First, a SVM model was developed that achieved a maximum Matthews correlation coefficient (MCC) of 0.31. The performance of this SVM model further improved the MCC from 0.31 to 0.45, when multiple sequence alignment in the form of PSSM profiles was used as input to the SVM, which is far better than the maximum MCC achieved by previous methods (0.41) on the same dataset. In addition, SVM models were also developed on an alternative dataset that contained 107 RBP chains. Utilizing PSSM as input information to the SVM, the training/testing on this alternate dataset achieved a maximum MCC of 0.32. Conclusively, the prediction performance of SVM models developed in this study is better than the existing methods on the same datasets. A web server ‘Pprint’ was also developed for predicting RNA binding residues in a protein sequence which is freely available at http://www.imtech.res.in/raghava/pprint/. Proteins 2008.


BMC Bioinformatics | 2007

Identification of DNA-binding proteins using support vector machines and evolutionary profiles

Manish Kumar; M. Michael Gromiha; Gajendra P. S. Raghava

BackgroundIdentification of DNA-binding proteins is one of the major challenges in the field of genome annotation, as these proteins play a crucial role in gene-regulation. In this paper, we developed various SVM modules for predicting DNA-binding domains and proteins. All models were trained and tested on multiple datasets of non-redundant proteins.ResultsSVM models have been developed on DNAaset, which consists of 1153 DNA-binding and equal number of non DNA-binding proteins, and achieved the maximum accuracy of 72.42% and 71.59% using amino acid and dipeptide compositions, respectively. The performance of SVM model improved from 72.42% to 74.22%, when evolutionary information in form of PSSM profiles was used as input instead of amino acid composition. In addition, SVM models have been developed on DNAset, which consists of 146 DNA-binding and 250 non-binding chains/domains, and achieved the maximum accuracy of 79.80% and 86.62% using amino acid composition and PSSM profiles. The SVM models developed in this study perform better than existing methods on a blind dataset.ConclusionA highly accurate method has been developed for predicting DNA-binding proteins using SVM and PSSM profiles. This is the first study in which evolutionary information in form of PSSM profiles has been used successfully for predicting DNA-binding proteins. A web-server DNAbinder has been developed for identifying DNA-binding proteins and domains from query amino acid sequences http://www.imtech.res.in/raghava/dnabinder/.


Nucleic Acids Research | 2006

FOLD-RATE: prediction of protein folding rates from amino acid sequence

M. Michael Gromiha; A. Mary Thangakani; Samuel Selvaraj

We have developed a web server, FOLD-RATE, for predicting the folding rates of proteins from their amino acid sequences. The relationship between amino acid properties and protein folding rates has been systematically analyzed and a statistical method based on linear regression technique has been proposed for predicting the folding rate of proteins. We found that the classification of proteins into different structural classes shows an excellent correlation between amino acid properties and folding rates of two and three-state proteins. Consequently, different regression equations have been developed for proteins belonging to all-α, all-β and mixed class. We observed an excellent agreement between predicted and experimentally observed folding rates of proteins; the correlation coefficients are, 0.99, 0.97 and 0.90, respectively, for all-α, all-β and mixed class proteins. The prediction server is freely available at .

Collaboration


Dive into the M. Michael Gromiha's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Akinori Sarai

Kyushu Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Ponraj Prabakaran

Kyushu Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Hatsuho Uedaira

National Institute of Advanced Industrial Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Hidetoshi Kono

Japan Atomic Energy Agency

View shared research outputs
Top Co-Authors

Avatar

Makiko Suwa

National Institute of Advanced Industrial Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Kazuhiko Fukui

National Institute of Advanced Industrial Science and Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jianghong An

Scripps Research Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge