Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sheng-You Huang is active.

Publication


Featured researches published by Sheng-You Huang.


Proteins | 2006

Ensemble docking of multiple protein structures: considering protein structural variations in molecular docking.

Sheng-You Huang; Xiaoqin Zou

One approach to incorporate protein flexibility in molecular docking is the use of an ensemble consisting of multiple protein structures. Sequentially docking each ligand into a large number of protein structures is computationally too expensive to allow large‐scale database screening. It is challenging to achieve a good balance between docking accuracy and computational efficiency. In this work, we have developed a fast, novel docking algorithm utilizing multiple protein structures, referred to as ensemble docking, to account for protein structural variations. The algorithm can simultaneously dock a ligand into an ensemble of protein structures and automatically select an optimal protein structure that best fits the ligand by optimizing both ligand coordinates and the conformational variable m, where m represents the m‐th structure in the protein ensemble. The docking algorithm was validated on 10 protein ensembles containing 105 crystal structures and 87 ligands in terms of binding mode and energy score predictions. A success rate of 93% was obtained with the criterion of root‐mean‐square deviation <2.5 Å if the top five orientations for each ligand were considered, comparable to that of sequential docking in which scores for individual docking are merged into one list by re‐ranking, and significantly better than that of single rigid‐receptor docking (75% on average). Similar trends were also observed in binding score predictions and enrichment tests of virtual database screening. The ensemble docking algorithm is computationally efficient, with a computational time comparable to that for docking a ligand into a single protein structure. In contrast, the computational time for the sequential docking method increases linearly with the number of protein structures in the ensemble. The algorithm was further evaluated using a more realistic ensemble in which the corresponding bound protein structures of inhibitors were excluded. The results show that ensemble docking successfully predicts the binding modes of the inhibitors, and discriminates the inhibitors from a set of noninhibitors with similar chemical properties. Although multiple experimental structures were used in the present work, our algorithm can be easily applied to multiple protein conformations generated by computational methods, and helps improve the efficiency of other existing multiple protein structure(MPS)‐based methods to accommodate protein flexibility. Proteins 2007.


Physical Chemistry Chemical Physics | 2010

Scoring functions and their evaluation methods for protein–ligand docking: recent advances and future directions

Sheng-You Huang; Sam Z. Grinter; Xiaoqin Zou

The scoring function is one of the most important components in structure-based drug design. Despite considerable success, accurate and rapid prediction of protein-ligand interactions is still a challenge in molecular docking. In this perspective, we have reviewed three basic types of scoring functions (force-field, empirical, and knowledge-based) and the consensus scoring technique that are used for protein-ligand docking. The commonly-used assessment criteria and publicly available protein-ligand databases for performance evaluation of the scoring functions have also been presented and discussed. We end with a discussion of the challenges faced by existing scoring functions and possible future directions for developing improved scoring functions.


International Journal of Molecular Sciences | 2010

Advances and Challenges in Protein-Ligand Docking

Sheng-You Huang; Xiaoqin Zou

Molecular docking is a widely-used computational tool for the study of molecular recognition, which aims to predict the binding mode and binding affinity of a complex formed by two or more constituent molecules with known structures. An important type of molecular docking is protein-ligand docking because of its therapeutic applications in modern structure-based drug design. Here, we review the recent advances of protein flexibility, ligand sampling, and scoring functions—the three important aspects in protein-ligand docking. Challenges and possible future directions are discussed in the Conclusion.


Journal of Molecular Biology | 2011

Community-wide assessment of protein-interface modeling suggests improvements to design methodology

Sarel J. Fleishman; Timothy A. Whitehead; Eva Maria Strauch; Jacob E. Corn; Sanbo Qin; Huan-Xiang Zhou; Julie C. Mitchell; Omar Demerdash; Mayuko Takeda-Shitaka; Genki Terashi; Iain H. Moal; Xiaofan Li; Paul A. Bates; Martin Zacharias; Hahnbeom Park; Jun Su Ko; Hasup Lee; Chaok Seok; Thomas Bourquard; Julie Bernauer; Anne Poupon; Jérôme Azé; Seren Soner; Şefik Kerem Ovali; Pemra Ozbek; Nir Ben Tal; Turkan Haliloglu; Howook Hwang; Thom Vreven; Brian G. Pierce

The CAPRI (Critical Assessment of Predicted Interactions) and CASP (Critical Assessment of protein Structure Prediction) experiments have demonstrated the power of community-wide tests of methodology in assessing the current state of the art and spurring progress in the very challenging areas of protein docking and structure prediction. We sought to bring the power of community-wide experiments to bear on a very challenging protein design problem that provides a complementary but equally fundamental test of current understanding of protein-binding thermodynamics. We have generated a number of designed protein-protein interfaces with very favorable computed binding energies but which do not appear to be formed in experiments, suggesting that there may be important physical chemistry missing in the energy calculations. A total of 28 research groups took up the challenge of determining what is missing: we provided structures of 87 designed complexes and 120 naturally occurring complexes and asked participants to identify energetic contributions and/or structural features that distinguish between the two sets. The community found that electrostatics and solvation terms partially distinguish the designs from the natural complexes, largely due to the nonpolar character of the designed interactions. Beyond this polarity difference, the community found that the designed binding surfaces were, on average, structurally less embedded in the designed monomers, suggesting that backbone conformational rigidity at the designed surface is important for realization of the designed function. These results can be used to improve computational design strategies, but there is still much to be learned; for example, one designed complex, which does form in experiments, was classified by all metrics as a nonbinder.


Journal of Computational Chemistry | 2006

An iterative knowledge-based scoring function to predict protein–ligand interactions: I. Derivation of interaction potentials

Sheng-You Huang; Xiaoqin Zou

Using a novel iterative method, we have developed a knowledge‐based scoring function (ITScore) to predict protein–ligand interactions. The pair potentials for ITScore were derived from a training set of 786 protein–ligand complex structures in the Protein Data Bank. Twenty‐six atom types were used based on the atom type category of the SYBYL software. The iterative method circumvents the long‐standing reference state problem in the derivation of knowledge‐based scoring functions. The basic idea is to improve pair potentials by iteration until they correctly discriminate experimentally determined binding modes from decoy ligand poses for the ligand‐protein complexes in the training set. The iterative method is efficient and normally converges within 20 iterative steps. The scoring function based on the derived potentials was tested on a diverse set of 140 protein–ligand complexes for affinity prediction, yielding a high correlation coefficient of 0.74. Because ITScore uses SYBYL‐defined atom types, this scoring function is easy to use for molecular files prepared by SYBYL or converted by software such as BABEL.


Journal of Computational Chemistry | 2006

An iterative knowledge‐based scoring function to predict protein–ligand interactions: II. Validation of the scoring function

Sheng-You Huang; Xiaoqin Zou

We have developed an iterative knowledge‐based scoring function (ITScore) to describe protein–ligand interactions. Here, we assess ITScore through extensive tests on native structure identification, binding affinity prediction, and virtual database screening. Specifically, ITScore was first applied to a test set of 100 protein–ligand complexes constructed by Wang et al. (J Med Chem 2003, 46, 2287), and compared with 14 other scoring functions. The results show that ITScore yielded a high success rate of 82% on identifying native‐like binding modes under the criterion of rmsd ≤2 Å for each top‐ranked ligand conformation. The success rate increased to 98% if the top five conformations were considered for each ligand. In the case of binding affinity prediction, ITScore also obtained a good correlation for this test set (R = 0.65). Next, ITScore was used to predict binding affinities of a second diverse test set of 77 protein–ligand complexes prepared by Muegge and Martin (J Med Chem 1999, 42, 791), and compared with four other widely used knowledge‐based scoring functions. ITScore yielded a high correlation of R2 = 0.65 (or R = 0.81) in the affinity prediction. Finally, enrichment tests were performed with ITScore against four target proteins using the compound databases constructed by Jacobsson et al. (J Med Chem 2003, 46, 5781). The results were compared with those of eight other scoring functions. ITScore yielded high enrichments in all four database screening tests. ITScore can be easily combined with the existing docking programs for the use of structure‐based drug design.


Proteins | 2008

An iterative knowledge‐based scoring function for protein–protein recognition

Sheng-You Huang; Xiaoqin Zou

Using an efficient iterative method, we have developed a distance‐dependent knowledge‐based scoring function to predict protein–protein interactions. The function, referred to as ITScore‐PP, was derived using the crystal structures of a training set of 851 protein–protein dimeric complexes containing true biological interfaces. The key idea of the iterative method for deriving ITScore‐PP is to improve the interatomic pair potentials by iteration, until the pair potentials can distinguish true binding modes from decoy modes for the protein–protein complexes in the training set. The iterative method circumvents the challenging reference state problem in deriving knowledge‐based potentials. The derived scoring function was used to evaluate the ligand orientations generated by ZDOCK 2.1 and the native ligand structures on a diverse set of 91 protein–protein complexes. For the bound test cases, ITScore‐PP yielded a success rate of 98.9% if the top 10 ranked orientations were considered. For the more realistic unbound test cases, the corresponding success rate was 40.7%. Furthermore, for faster orientational sampling purpose, several residue‐level knowledge‐based scoring functions were also derived following the similar iterative procedure. Among them, the scoring function that uses the side‐chain center of mass (SCM) to represent a residue, referred to as ITScore‐PP(SCM), showed the best performance and yielded success rates of 71.4% and 30.8% for the bound and unbound cases, respectively, when the top 10 orientations were considered. ITScore‐PP was further tested using two other published protein–protein docking decoy sets, the ZDOCK decoy set and the RosettaDock decoy set. In addition to binding mode prediction, the binding scores predicted by ITScore‐PP also correlated well with the experimentally determined binding affinities, yielding a correlation coefficient of R = 0.71 on a test set of 74 protein–protein complexes with known affinities. ITScore‐PP is computationally efficient. The average run time for ITScore‐PP was about 0.03 second per orientation (including optimization) on a personal computer with 3.2 GHz Pentium IV CPU and 3.0 GB RAM. The computational speed of ITScore‐PP(SCM) is about an order of magnitude faster than that of ITScore‐PP. ITScore‐PP and/or ITScore‐PP(SCM) can be combined with efficient protein docking software to study protein–protein recognition. Proteins 2008.


Proteins | 2013

Community-wide evaluation of methods for predicting the effect of mutations on protein-protein interactions

Rocco Moretti; Sarel J. Fleishman; Rudi Agius; Mieczyslaw Torchala; Paul A. Bates; Panagiotis L. Kastritis; João Garcia Lopes Maia Rodrigues; Mikael Trellet; Alexandre M. J. J. Bonvin; Meng Cui; Marianne Rooman; Dimitri Gillis; Yves Dehouck; Iain H. Moal; Miguel Romero-Durana; Laura Pérez-Cano; Chiara Pallara; Brian Jimenez; Juan Fernández-Recio; Samuel Coulbourn Flores; Michael S. Pacella; Krishna Praneeth Kilambi; Jeffrey J. Gray; Petr Popov; Sergei Grudinin; Juan Esquivel-Rodriguez; Daisuke Kihara; Nan Zhao; Dmitry Korkin; Xiaolei Zhu

Community‐wide blind prediction experiments such as CAPRI and CASP provide an objective measure of the current state of predictive methodology. Here we describe a community‐wide assessment of methods to predict the effects of mutations on protein–protein interactions. Twenty‐two groups predicted the effects of comprehensive saturation mutagenesis for two designed influenza hemagglutinin binders and the results were compared with experimental yeast display enrichment data obtained using deep sequencing. The most successful methods explicitly considered the effects of mutation on monomer stability in addition to binding affinity, carried out explicit side‐chain sampling and backbone relaxation, evaluated packing, electrostatic, and solvation effects, and correctly identified around a third of the beneficial mutations. Much room for improvement remains for even the best techniques, and large‐scale fitness landscapes should continue to provide an excellent test bed for continued evaluation of both existing and new prediction methodologies. Proteins 2013; 81:1980–1987.


Journal of Chemical Information and Modeling | 2010

Inclusion of solvation and entropy in the knowledge-based scoring function for protein-ligand interactions.

Sheng-You Huang; Xiaoqin Zou

The effects of solvation and entropy play a critical role in determining the binding free energy in protein-ligand interactions. Despite the good balance between speed and accuracy, no current knowledge-based scoring functions account for the effects of solvation and configurational entropy explicitly due to the difficulty in deriving the corresponding pair potentials and the resulting double counting problem. In the present work, we have included the solvation effect and configurational entropy in the knowledge-based scoring function by an iterative method. The newly developed scoring function has yielded a success rate of 91% in identifying near-native binding modes with Wang et al.s benchmark of 100 diverse protein-ligand complexes. The results have been compared with the results of 15 other scoring functions for validation purpose. In binding affinity prediction, our scoring function has yielded a correlation of R(2) = 0.76 between the predicted binding scores and the experimentally measured binding affinities on the PMF validation sets of 77 diverse complexes. The results have been compared with R(2) of four other well-known knowledge-based scoring functions. Finally, our scoring function was also validated on the large PDBbind database of 1299 protein-ligand complexes and yielded a correlation coefficient of 0.474. The present computational model can be applied to other scoring functions to account for solvation and entropic effects.


Protein Science | 2006

Efficient molecular docking of NMR structures: Application to HIV-1 protease

Sheng-You Huang; Xiaoqin Zou

Docking ligands into an ensemble of NMR conformers is essential to structure‐based drug discovery if only NMR structures are available for the target. However, sequentially docking ligands into each NMR conformer through standard single‐receptor‐structure docking, referred to as sequential docking, is computationally expensive for large‐scale database screening because of the large number of NMR conformers involved. Recently, we developed an efficient ensemble docking algorithm to consider protein structural variations in ligand binding. The algorithm simultaneously docks ligands into an ensemble of protein structures and achieves comparable performance to sequential docking without significant increase in computational time over single‐structure docking. Here, we applied this algorithm to docking with NMR structures. The HIV‐1 protease was used for validation in terms of docking accuracy and virtual screening. Ensemble docking of the NMR structures identified 91% of the known inhibitors under the criterion of RMSD < 2.0 Å for the best‐scored conformation, higher than the average success rate of single docking of individual crystal structures (66%). In the virtual screening test, on average, ensemble docking of the NMR structures obtained higher enrichments than single‐structure docking of the crystal structures. In contrast, docking of either the NMR minimized average structure or a single NMR conformer performed less satisfactorily on both binding mode prediction and virtual screening, indicating that a single NMR structure may not be suitable for docking calculations. The success of ensemble docking of the NMR structures suggests an efficient alternative method for standard single docking of crystal structures and for considering protein flexibility.

Collaboration


Dive into the Sheng-You Huang's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Xiaoqin Zou

University of Missouri

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Yumeng Yan

Huazhong University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Di Zhang

Huazhong University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge