Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Reiji Teramoto is active.

Publication


Featured researches published by Reiji Teramoto.


Journal of Chemical Information and Modeling | 2007

Supervised consensus scoring for docking and virtual screening

Reiji Teramoto; Hiroaki Fukunishi

Docking programs are widely used to discover novel ligands efficiently and can predict protein-ligand complex structures with reasonable accuracy and speed. However, there is an emerging demand for better performance from the scoring methods. Consensus scoring (CS) methods improve the performance by compensating for the deficiencies of each scoring function. However, conventional CS and existing scoring functions have the same problems, such as a lack of protein flexibility, inadequate treatment of salvation, and the simplistic nature of the energy function used. Although there are many problems in current scoring functions, we focus our attention on the incorporation of unbound ligand conformations. To address this problem, we propose supervised consensus scoring (SCS), which takes into account protein-ligand binding process using unbound ligand conformations with supervised learning. An evaluation of docking accuracy for 100 diverse protein-ligand complexes shows that SCS outperforms both CS and 11 scoring functions (PLP, F-Score, LigScore, DrugScore, LUDI, X-Score, AutoDock, PMF, G-Score, ChemScore, and D-score). The success rates of SCS range from 89% to 91% in the range of rmsd < 2 A, while those of CS range from 80% to 85%, and those of the scoring functions range from 26% to 76%. Moreover, we also introduce a method for judging whether a compound is active or inactive with the appropriate criterion for virtual screening. SCS performs quite well in docking accuracy and is presumably useful for screening large-scale compound databases before predicting binding affinity.


Biochimica et Biophysica Acta | 2008

Protein expression profile characteristic to hepatocellular carcinoma revealed by 2D-DIGE with supervised learning

Reiji Teramoto; Hirotaka Minagawa; Masao Honda; Kenji Miyazaki; Yo Tabuse; Kenichi Kamijo; Teruyuki Ueda; Shuichi Kaneko

Hepatocellular carcinoma (HCC) is one of the most common and aggressive human malignancies. Although several major risks related to HCC, e.g., hepatitis B and/or hepatitis C virus infection, aflatoxin B1 exposure, alcohol drinking and genetic defects have been revealed, the molecular mechanisms leading to the initiation and progression of HCC have not been clarified. To reduce the mortality and improve the effectiveness of therapy, it is important to detect the proteins which are associated with tumor progression and may be useful as potential therapeutic or diagnosis targets. However, previous studies have not yet revealed the associations among HCC cells, histological grade and AFP. Here, we performed two-dimensional difference gel electrophoresis (2D-DIGE) combined with MS for 18 HCC patients. To focus not on individual proteins but on multiple proteins associated with pathogenesis, we introduce the supervised feature selection based on stochastic gradient boosting (SGB) for identifying protein spots that discriminate HCC/non HCC, histological grade of moderate/well and high alpha-fetoprotein (AFP)/low AFP level without arbitrariness. We detected 18, 25 and 27 protein spots associated with HCC, histological grade and AFP level, respectively. We confirmed that SGB is able to identify the known HCC-related proteins, e.g., heat shock proteins, carbonic anhydrase 2. Moreover, we identified the differentially expressed proteins associated with histological grade of HCC and AFP level and found that aldo-keto reductase 1B10 (AKR1B10) is related to well differentiated HCC, keratin 8 (KRT8) is related to both histological grade and AFP level and protein disulfide isomerase-associated 3 (PDIA3) is associated with both HCC and AFP level. Our pilot study provides new insights on understanding the pathogenesis of HCC, histological grade and AFP level.


Journal of Chemical Information and Modeling | 2008

Consensus scoring with feature selection for structure-based virtual screening.

Reiji Teramoto; Hiroaki Fukunishi

The evaluation of ligand conformations is a crucial aspect of structure-based virtual screening, and scoring functions play significant roles in it. While consensus scoring (CS) generally improves enrichment by compensating for the deficiencies of each scoring function, the strategy of how individual scoring functions are selected remains a challenging task when few known active compounds are available. To address this problem, we propose feature selection-based consensus scoring (FSCS), which performs supervised feature selection with docked native ligand conformations to select complementary scoring functions. We evaluated the enrichments of five scoring functions (F-Score, D-Score, PMF, G-Score, and ChemScore), FSCS, and RCS (rank-by-rank consensus scoring) for four different target proteins: acetylcholine esterase (AChE), thrombin (thrombin), phosphodiesterase 5 (PDE5), and peroxisome proliferator-activated receptor gamma (PPARgamma). The results indicated that FSCS was able to select the complementary scoring functions and enhance ligand enrichments and that it outperformed RCS and the individual scoring functions for all target proteins. They also indicated that the performances of the single scoring functions were strongly dependent on the target protein. An especially favorable result with implications for practical drug screening is that FSCS performs well even if only one 3D structure of the protein-ligand complex is known. Moreover, we found that one can infer which scoring functions significantly enrich active compounds by using feature selection before actual docking and that the selected scoring functions are complementary.


Journal of Chemical Information and Modeling | 2008

Bootstrap-based consensus scoring method for protein-ligand docking.

Hiroaki Fukunishi; Reiji Teramoto; Toshikazu Takada; Jiro Shimada

To improve the performance of a single scoring function used in a protein-ligand docking program, we developed a bootstrap-based consensus scoring (BBCS) method, which is based on ensemble learning. BBCS combines multiple scorings, each of which has the same function form but different energy-parameter sets. These multiple energy-parameter sets are generated in two steps: (1) generation of training sets by a bootstrap method and (2) optimization of energy-parameter set by a Z-score approach, which is based on energy landscape theory as used in protein folding, against each training set. In this study, we applied BBCS to the FlexX scoring function. Using given 50 complexes, we generated 100 training sets and obtained 100 optimized energy-parameter sets. These parameter sets were tested against 48 complexes different from the training sets. BBCS was shown to be an improvement over single scoring when using a parameter set optimized by the same Z-score approach. Comparing BBCS with the original FlexX scoring function, we found that (1) the success rate of recognizing the crystal structure at the top relative to decoys increased from 33.3% to 52.1% and that (2) the rank of the crystal structure improved for 54.2% of the complexes and worsened for none. We also found that BBCS performed better than conventional consensus scoring (CS).


Journal of Chemical Information and Modeling | 2008

Structure-Based Virtual Screening with Supervised Consensus Scoring: Evaluation of Pose Prediction and Enrichment Factors

Reiji Teramoto; Hiroaki Fukunishi

Since the evaluation of ligand conformations is a crucial aspect of structure-based virtual screening, scoring functions play significant roles in it. However, it is known that a scoring function does not always work well for all target proteins. When one cannot know which scoring function works best against a target protein a priori, there is no standard scoring method to know it even if 3D structure of a target protein-ligand complex is available. Therefore, development of the method to achieve high enrichments from given scoring functions and 3D structure of protein-ligand complex is a crucial and challenging task. To address this problem, we applied SCS (supervised consensus scoring), which employs a rough linear correlation between the binding free energy and the root-mean-square deviation (rmsd) of a native ligand conformations and incorporates protein-ligand binding process with docked ligand conformations using supervised learning, to virtual screening. We evaluated both the docking poses and enrichments of SCS and five scoring functions (F-Score, G-Score, D-Score, ChemScore, and PMF) for three different target proteins: thymidine kinase (TK), thrombin (thrombin), and peroxisome proliferator-activated receptor gamma (PPARgamma). Our enrichment studies show that SCS is competitive or superior to a best single scoring function at the top ranks of screened database. We found that the enrichments of SCS could be limited by a best scoring function, because SCS is obtained on the basis of the five individual scoring functions. Therefore, it is concluded that SCS works very successfully from our results. Moreover, from docking pose analysis, we revealed the connection between enrichment and average centroid distance of top-scored docking poses. Since SCS requires only one 3D structure of protein-ligand complex, SCS will be useful for identifying new ligands.


Journal of Chemical Information and Modeling | 2007

Supervised Scoring Models with Docked Ligand Conformations for Structure-Based Virtual Screening

Reiji Teramoto; Hiroaki Fukunishi

Protein-ligand docking programs have been used to efficiently discover novel ligands for target proteins from large-scale compound databases. However, better scoring methods are needed. Generally, scoring functions are optimized by means of various techniques that affect their fitness for reproducing X-ray structures and protein-ligand binding affinities. However, these scoring functions do not always work well for all target proteins. A scoring function should be optimized for a target protein to enhance enrichment for structure-based virtual screening. To address this problem, we propose the supervised scoring model (SSM), which takes into account the protein-ligand binding process using docked ligand conformations with supervised learning for optimizing scoring functions against a target protein. SSM employs a rough linear correlation between binding free energy and the root mean square deviation of a native ligand for predicting binding energy. We applied SSM to the FlexX scoring function, that is, F-Score, with five different target proteins: thymidine kinase (TK), estrogen receptor (ER), acetylcholine esterase (AChE), phosphodiesterase 5 (PDE5), and peroxisome proliferator-activated receptor gamma (PPARgamma). For these five proteins, SSM always enhanced enrichment better than F-Score, exhibiting superior performance that was particularly remarkable for TK, AChE, and PPARgamma. We also demonstrated that SSM is especially good at enhancing enrichments of the top ranks of screened compounds, which is useful in practical drug screening.


Statistical Applications in Genetics and Molecular Biology | 2009

Balanced Gradient Boosting from Imbalanced Data for Clinical Outcome Prediction

Reiji Teramoto

In clinical outcome prediction, such as disease diagnosis and prognosis, it is often assumed that the class, e.g., disease and control, is equally distributed. However, in practice we often encounter biological or clinical data whose class distribution is highly skewed. Since standard supervised learning algorithms intend to maximize the overall prediction accuracy, a prediction model tends to show a strong bias toward the majority class when it is trained on such imbalanced data. Therefore, the class distribution should be incorporated appropriately to learn from imbalanced data. To address this practically important problem, we proposed balanced gradient boosting (BalaBoost) which reformulates gradient boosting to avoid the overfitting to the majority class and is sensitive to the minority class by making use of the equal class distribution instead of the empirical class distribution. We applied BalaBoost to cancer tissue diagnosis based on miRNA expression data, premature death prediction for diabetes patients based on biochemical and clinical variables and tumor grade prediction of renal cell carcinoma based on tumor marker expressions whose class distribution is highly skewed. Experimental results showed that BalaBoost outperformed the representative supervised learning algorithms, i.e., gradient boosting, Random Forests and Support Vector Machine. Our results led us to the conclusion that BalaBoost is promising for clinical outcome prediction from imbalanced data.


Computational Biology and Chemistry | 2008

Brief Communication: Prediction of Alzheimer's diagnosis using semi-supervised distance metric learning with label propagation

Reiji Teramoto

Alzheimers disease (AD) is the most common form of dementia and leads to irreversible neurogenerative damage of the brain. However, the current diagnostic tools have poor sensitivity, especially for the early stages of AD and do not allow for diagnosis until AD has lead to irreversible brain damage. Therefore, it is crucial that AD is detected as early as possible. Although it is very hard, laborious and time-consuming to gather many AD and non-AD labeled samples, gathering unlabeled samples is easier than labeled samples. Since standard learning algorithms learn a diagnosis model from labeled samples only, they require many labeled samples and do not work well when the number of training samples is small. Therefore, it is very desirable to develop a predictive learning method to achieve high performance using both labeled samples and unlabeled samples. To address these problems, we propose semi-supervised distance metric learning using Random Forests with label propagation (SRF-LP) which incorporates labeled data for obtaining good metrics and propagates labels based on them. Experimental results showed that SRF-LP outperformed standard supervised learning algorithms, i.e., RF, SVM, Adaboost and CART and reached 93.1% accuracy at a maximum. Especially, SRF-LP largely outperformed when the number of training samples is very small. Our results also suggested that SRF-LP exhibits a synergistic effect of semi-supervised distance metric learning and label propagation.


Biochemical and Biophysical Research Communications | 2008

Comparative proteomic and transcriptomic profiling of the human hepatocellular carcinoma.

Hirotaka Minagawa; Masao Honda; Kenji Miyazaki; Yo Tabuse; Reiji Teramoto; Taro Yamashita; Ryuhei Nishino; Hajime Takatori; Teruyuki Ueda; Ken’ichi Kamijo; Shuichi Kaneko


Journal of Chemical Information and Modeling | 2008

Hidden Active Information in a Random Compound Library : Extraction Using a Pseudo-Structure-Activity Relationship Model

Hiroaki Fukunishi; Reiji Teramoto; Jiro Shimada

Collaboration


Dive into the Reiji Teramoto's collaboration.

Researchain Logo
Decentralizing Knowledge