Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where L. Y. Han is active.

Publication


Featured researches published by L. Y. Han.


Nucleic Acids Research | 2003

SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence

C. Z. Cai; L. Y. Han; Zhi Liang Ji; Xi Chen; Yu Zong Chen

Prediction of protein function is of significance in studying biological processes. One approach for function prediction is to classify a protein into functional family. Support vector machine (SVM) is a useful method for such classification, which may involve proteins with diverse sequence distribution. We have developed a web-based software, SVMProt, for SVM classification of a protein into functional family from its primary sequence. SVMProt classification system is trained from representative proteins of a number of functional families and seed proteins of Pfam curated protein families. It currently covers 54 functional families and additional families will be added in the near future. The computed accuracy for protein family classification is found to be in the range of 69.1-99.6%. SVMProt shows a certain degree of capability for the classification of distantly related proteins and homologous proteins of different function and thus may be used as a protein function prediction tool that complements sequence alignment methods. SVMProt can be accessed at http://jing.cz3.nus.edu.sg/cgi-bin/svmprot.cgi.


Proteins | 2004

Enzyme family classification by support vector machines

C. Z. Cai; L. Y. Han; Zhi Liang Ji; Yu Zong Chen

One approach for facilitating protein function prediction is to classify proteins into functional families. Recent studies on the classification of G‐protein coupled receptors and other proteins suggest that a statistical learning method, Support vector machines (SVM), may be potentially useful for protein classification into functional families. In this work, SVM is applied and tested on the classification of enzymes into functional families defined by the Enzyme Nomenclature Committee of IUBMB. SVM classification system for each family is trained from representative enzymes of that family and seed proteins of Pfam curated protein families. The classification accuracy for enzymes from 46 families and for non‐enzymes is in the range of 50.0% to 95.7% and 79.0% to 100% respectively. The corresponding Matthews correlation coefficient is in the range of 54.1% to 96.1%. Moreover, 80.3% of the 8,291 correctly classified enzymes are uniquely classified into a specific enzyme family by using a scoring function, indicating that SVM may have certain level of unique prediction capability. Testing results also suggest that SVM in some cases is capable of classification of distantly related enzymes and homologous enzymes of different functions. Effort is being made to use a more comprehensive set of enzymes as training sets and to incorporate multi‐class SVM classification systems to further enhance the unique prediction accuracy. Our results suggest the potential of SVM for enzyme family classification and for facilitating protein function prediction. Our software is accessible at http://jing.cz3.nus.edu.sg/cgi‐bin/svmprot.cgi. Proteins 2004.


Pharmacological Reviews | 2006

Therapeutic Targets: Progress of Their Exploration and Investigation of Their Characteristics

C. J. Zheng; L. Y. Han; Chun Wei Yap; Zhi Liang Ji; Z. W. Cao; Yu Zong Chen

Modern drug discovery is primarily based on the search and subsequent testing of drug candidates acting on a preselected therapeutic target. Progress in genomics, protein structure, proteomics, and disease mechanisms has led to a growing interest in and effort for finding new targets and more effective exploration of existing targets. The number of reported targets of marketed and investigational drugs has significantly increased in the past 8 years. There are 1535 targets collected in the therapeutic target database compared with ∼500 targets reported in a 1996 review. Knowledge of these targets is helpful for molecular dissection of the mechanism of action of drugs and for predicting features that guide new drug design and the search for new targets. This article summarizes the progress of target exploration and investigates the characteristics of the currently explored targets to analyze their sequence, structure, family representation, pathway association, tissue distribution, and genome location features for finding clues useful for searching for new targets. Possible “rules” to guide the search for druggable proteins and the feasibility of using a statistical learning method for predicting druggable proteins directly from their sequences are discussed.


Nucleic Acids Research | 2010

Update of TTD: Therapeutic Target Database

Feng Zhu; Bu-Cong Han; Pankaj Kumar; Xianghui Liu; Xiao Hua Ma; Xiaona Wei; Lu Huang; YangFan Guo; L. Y. Han; C. J. Zheng; Yu Zong Chen

Increasing numbers of proteins, nucleic acids and other molecular entities have been explored as therapeutic targets, hundreds of which are targets of approved and clinical trial drugs. Knowledge of these targets and corresponding drugs, particularly those in clinical uses and trials, is highly useful for facilitating drug discovery. Therapeutic Target Database (TTD) has been developed to provide information about therapeutic targets and corresponding drugs. In order to accommodate increasing demand for comprehensive knowledge about the primary targets of the approved, clinical trial and experimental drugs, numerous improvements and updates have been made to TTD. These updates include information about 348 successful, 292 clinical trial and 1254 research targets, 1514 approved, 1212 clinical trial and 2302 experimental drugs linked to their primary targets (3382 small molecule and 649 antisense drugs with available structure and sequence), new ways to access data by drug mode of action, recursive search of related targets or drugs, similarity target and drug searching, customized and whole data download, standardized target ID, and significant increase of data (1894 targets, 560 diseases and 5028 drugs compared with the 433 targets, 125 diseases and 809 drugs in the original release described in previous paper). This database can be accessed at http://bidd.nus.edu.sg/group/cjttd/TTD.asp.


British Journal of Pharmacology | 2009

Database of traditional Chinese medicine and its application to studies of mechanism and to prescription validation

Xi Chen; Hufeng Zhou; Y B Liu; J. F. Wang; H. Li; Choong Yong Ung; L. Y. Han; Z. W. Cao; Yu Zong Chen

Traditional Chinese Medicine (TCM) is widely practised and is viewed as an attractive alternative to conventional medicine. Quantitative information about TCM prescriptions, constituent herbs and herbal ingredients is necessary for studying and exploring TCM.


Drug Discovery Today | 2009

Synergistic therapeutic actions of herbal ingredients and their mechanisms from molecular interaction and network perspectives.

Xiao Hua Ma; C. J. Zheng; L. Y. Han; B. Xie; Jia Jia; Z. W. Cao; Yixue X. Li; Yu Zong Chen

Opinions about the therapeutic efficacy of medicinal herbs differ significantly. Some reported herbal efficacies at low doses of active ingredients suggest a need for investigating whether these are because of placebo or multi-ingredient synergistic effects. This review discusses the opinions, methods and outcomes of herbal synergism investigations and analyzes indications from 48 in vivo tests and 106 rigorous clinical trials. Analyses of ingredient-mediated interactions at molecular and pathway levels indicate multi-ingredient synergism in 27 of the 39 reported cases of herbal synergism with available ingredient information. Synergistic actions may be responsible for the therapeutic efficacy of a substantial number of herbal products and their mechanisms may be studied by analyzing ingredient-mediated molecular interactions and network regulation.


Journal of Molecular Graphics & Modelling | 2008

A support vector machines approach for virtual screening of active compounds of single and multiple mechanisms from large libraries at an improved hit-rate and enrichment factor.

L. Y. Han; Xiao Hua Ma; Honghuang Lin; Jia Jia; Feng Zhu; Y. Xue; Ze Rong Li; Z. W. Cao; Zhi Liang Ji; Yu Zong Chen

Support vector machines (SVM) and other machine-learning (ML) methods have been explored as ligand-based virtual screening (VS) tools for facilitating lead discovery. While exhibiting good hit selection performance, in screening large compound libraries, these methods tend to produce lower hit-rate than those of the best performing VS tools, partly because their training-sets contain limited spectrum of inactive compounds. We tested whether the performance of SVM can be improved by using training-sets of diverse inactive compounds. In retrospective database screening of active compounds of single mechanism (HIV protease inhibitors, DHFR inhibitors, dopamine antagonists) and multiple mechanisms (CNS active agents) from large libraries of 2.986 million compounds, the yields, hit-rates, and enrichment factors of our SVM models are 52.4-78.0%, 4.7-73.8%, and 214-10,543, respectively, compared to those of 62-95%, 0.65-35%, and 20-1200 by structure-based VS and 55-81%, 0.2-0.7%, and 110-795 by other ligand-based VS tools in screening libraries of >or=1 million compounds. The hit-rates are comparable and the enrichment factors are substantially better than the best results of other VS tools. 24.3-87.6% of the predicted hits are outside the known hit families. SVM appears to be potentially useful for facilitating lead discovery in VS of large compound libraries.


Journal of Chemical Information and Modeling | 2006

PEARLS: program for energetic analysis of receptor-ligand system.

L. Y. Han; Honghuang Lin; Ze-Rong Li; C. J. Zheng; Zhi Wei Cao; B. Xie; Yu Zong Chen

Analysis of the energetics of small molecule ligand-protein, ligand-nucleic acid, and protein-nucleic acid interactions facilitates the quantitative understanding of molecular interactions that regulate the function and conformation of proteins. It has also been extensively used for ranking potential new ligands in virtual drug screening. We developed a Web-based software, PEARLS (Program for Energetic Analysis of Ligand-Receptor Systems), for computing interaction energies of ligand-protein, ligand-nucleic acid, protein-nucleic acid, and ligand-protein-nucleic acid complexes from their 3D structures. AMBER molecular force field, Morse potential, and empirical energy functions are used to compute the van der Waals, electrostatic, hydrogen bond, metal-ligand bonding, and water-mediated hydrogen bond energies between the binding molecules. The change in the solvation free energy of molecular binding is estimated by using an empirical solvation free energy model. Contribution from ligand conformational entropy change is also estimated by a simple model. The computed free energy for a number of PDB ligand-receptor complexes were studied and compared to experimental binding affinity. A substantial degree of correlation between the computed free energy and experimental binding affinity was found, which suggests that PEARLS may be useful in facilitating energetic analysis of ligand-protein, ligand-nucleic acid, and protein-nucleic acid interactions. PEARLS can be accessed at http://ang.cz3.nus.edu.sg/cgi-bin/prog/rune.pl.


BMC Bioinformatics | 2006

Prediction of the functional class of metal-binding proteins from sequence derived physicochemical properties by support vector machine approach

Honghuang Lin; L. Y. Han; Hua Zhang; C. J. Zheng; B. Xie; Zhi Wei Cao; Yu Zong Chen

Metal-binding proteins play important roles in structural stability, signaling, regulation, transport, immune response, metabolism control, and metal homeostasis. Because of their functional and sequence diversity, it is desirable to explore additional methods for predicting metal-binding proteins irrespective of sequence similarity. This work explores support vector machines (SVM) as such a method. SVM prediction systems were developed by using 53,333 metal-binding and 147,347 non-metal-binding proteins, and evaluated by an independent set of 31,448 metal-binding and 79,051 non-metal-binding proteins. The computed prediction accuracy is 86.3%, 81.6%, 83.5%, 94.0%, 81.2%, 85.4%, 77.6%, 90.4%, 90.9%, 74.9% and 78.1% for calcium-binding, cobalt-binding, copper-binding, iron-binding, magnesium-binding, manganese-binding, nickel-binding, potassium-binding, sodium-binding, zinc-binding, and all metal-binding proteins respectively. The accuracy for the non-member proteins of each class is 88.2%, 99.9%, 98.1%, 91.4%, 87.9%, 94.5%, 99.2%, 99.9%, 99.9%, 98.0%, and 88.0% respectively. Comparable accuracies were obtained by using a different SVM kernel function. Our method predicts 67% of the 87 metal-binding proteins non-homologous to any protein in the Swissprot database and 85.3% of the 333 proteins of known metal-binding domains as metal-binding. These suggest the usefulness of SVM for facilitating the prediction of metal-binding proteins. Our software can be accessed at the SVMProt server http://jing.cz3.nus.edu.sg/cgi-bin/svmprot.cgi.


Proteins | 2005

Prediction of transporter family from protein sequence by support vector machine approach

Honghuang Lin; L. Y. Han; C. Z. Cai; Zhi Liang Ji; Yu Zong Chen

Transporters play key roles in cellular transport and metabolic processes, and in facilitating drug delivery and excretion. These proteins are classified into families based on the transporter classification (TC) system. Determination of the TC family of transporters facilitates the study of their cellular and pharmacological functions. Methods for predicting TC family without sequence alignments or clustering are particularly useful for studying novel transporters whose function cannot be determined by sequence similarity. This work explores the use of a machine learning method, support vector machines (SVMs), for predicting the family of transporters from their sequence without the use of sequence similarity. A total of 10,636 transporters in 13 TC subclasses, 1914 transporters in eight TC families, and 168,341 nontransporter proteins are used to train and test the SVM prediction system. Testing results by using a separate set of 4351 transporters and 83,151 nontransporter proteins show that the overall accuracy for predicting members of these TC subclasses and families is 83.4% and 88.0%, respectively, and that of nonmembers is 99.3% and 96.6%, respectively. The accuracies for predicting members and nonmembers of individual TC subclasses are in the range of 70.7–96.1% and 97.6–99.9%, respectively, and those of individual TC families are in the range of 60.6–97.1% and 91.5–99.4%, respectively. A further test by using 26,139 transmembrane proteins outside each of the 13 TC subclasses shows that 90.4–99.6% of these are correctly predicted. Our study suggests that the SVM is potentially useful for facilitating functional study of transporters irrespective of sequence similarity. Proteins 2006.

Collaboration


Dive into the L. Y. Han's collaboration.

Top Co-Authors

Avatar

Yu Zong Chen

National University of Singapore

View shared research outputs
Top Co-Authors

Avatar

C. J. Zheng

National University of Singapore

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

B. Xie

National University of Singapore

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Juan Cui

University of Nebraska–Lincoln

View shared research outputs
Top Co-Authors

Avatar

Z. W. Cao

National University of Singapore

View shared research outputs
Top Co-Authors

Avatar

Chun Wei Yap

National University of Singapore

View shared research outputs
Top Co-Authors

Avatar

Hua Zhang

Nanyang Technological University

View shared research outputs
Top Co-Authors

Avatar

C. Z. Cai

National University of Singapore

View shared research outputs
Researchain Logo
Decentralizing Knowledge