Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Qian-Nan Hu is active.

Publication


Featured researches published by Qian-Nan Hu.


Bioinformatics | 2013

ChemoPy: freely available python package for computational biology and chemoinformatics

Dong-Sheng Cao; Qing-Song Xu; Qian-Nan Hu; Yi-Zeng Liang

MOTIVATION Molecular representation for small molecules has been routinely used in QSAR/SAR, virtual screening, database search, ranking, drug ADME/T prediction and other drug discovery processes. To facilitate extensive studies of drug molecules, we developed a freely available, open-source python package called chemoinformatics in python (ChemoPy) for calculating the commonly used structural and physicochemical features. It computes 16 drug feature groups composed of 19 descriptors that include 1135 descriptor values. In addition, it provides seven types of molecular fingerprint systems for drug molecules, including topological fingerprints, electro-topological state (E-state) fingerprints, MACCS keys, FP4 keys, atom pairs fingerprints, topological torsion fingerprints and Morgan/circular fingerprints. By applying a semi-empirical quantum chemistry program MOPAC, ChemoPy can also compute a large number of 3D molecular descriptors conveniently. AVAILABILITY The python package, ChemoPy, is freely available via http://code.google.com/p/pychem/downloads/list, and it runs on Linux and MS-Windows. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.


Analytica Chimica Acta | 2012

Large-scale prediction of drug-target interactions using protein sequences and drug topological structures

Dong-Sheng Cao; Shao Liu; Qing-Song Xu; Hongmei Lu; Jian-Hua Huang; Qian-Nan Hu; Yi-Zeng Liang

The identification of interactions between drugs and target proteins plays a key role in the process of genomic drug discovery. It is both consuming and costly to determine drug-target interactions by experiments alone. Therefore, there is an urgent need to develop new in silico prediction approaches capable of identifying these potential drug-target interactions in a timely manner. In this article, we aim at extending current structure-activity relationship (SAR) methodology to fulfill such requirements. In some sense, a drug-target interaction can be regarded as an event or property triggered by many influence factors from drugs and target proteins. Thus, each interaction pair can be represented theoretically by using these factors which are based on the structural and physicochemical properties simultaneously from drugs and proteins. To realize this, drug molecules are encoded with MACCS substructure fingerings representing existence of certain functional groups or fragments; and proteins are encoded with some biochemical and physicochemical properties. Four classes of drug-target interaction networks in humans involving enzymes, ion channels, G-protein-coupled receptors (GPCRs) and nuclear receptors, are independently used for establishing predictive models with support vector machines (SVMs). The SVM models gave prediction accuracy of 90.31%, 88.91%, 84.68% and 83.74% for four datasets, respectively. In conclusion, the results demonstrate the ability of our proposed method to predict the drug-target interactions, and show a general compatibility between the new scheme and current SAR methodology. They open the way to a host of new investigations on the diversity analysis and prediction of drug-target interactions.


Analytica Chimica Acta | 2011

In silico classification of human maximum recommended daily dose based on modified random forest and substructure fingerprint

Dong-Sheng Cao; Qian-Nan Hu; Qing-Song Xu; Yan-Ning Yang; Jian-Chao Zhao; Hongmei Lu; Liang-Xiao Zhang; Yi-Zeng Liang

A modified random forest (RF) algorithm, as a novel machine learning technique, was developed to estimate the maximum recommended daily dose (MRDD) of a large and diverse pharmaceutical dataset for phase I human trials using substructure fingerprint descriptors calculated from simple molecular structure alone. This type of novel molecular descriptors encodes molecular structure in a series of binary bits that represent the presence or absence of particular substructures in the molecule and thereby can accurately and directly depict a series of local information hidden in this molecule. Two model validation approaches, 5-fold cross-validation and an independent validation set, were used for assessing the prediction capability of our models. The results obtained in this study indicate that the modified RF gave prediction accuracy of 80.45%, sensitivity of 75.08%, specificity of 84.85% for 5-fold cross-validation, and prediction accuracy of 80.5%, sensitivity of 76.47%, specificity of 83.48% for independent validation set, respectively, which are as a whole better than those by the original RF. At the same time, the important substructure fingerprints, recognized by the RF technique, gave some insights into the structure features related to toxicity of pharmaceuticals. This could help provide intuitive understanding for medicinal chemists.


PLOS ONE | 2013

Genome-Scale Screening of Drug-Target Associations Relevant to Ki Using a Chemogenomics Approach

Dong-Sheng Cao; Yi-Zeng Liang; Zhe Deng; Qian-Nan Hu; Min He; Qing-Song Xu; Guang Hua Zhou; Liu-Xia Zhang; Zixin Deng; Shao Liu

The identification of interactions between drugs and target proteins plays a key role in genomic drug discovery. In the present study, the quantitative binding affinities of drug-target pairs are differentiated as a measurement to define whether a drug interacts with a protein or not, and then a chemogenomics framework using an unbiased set of general integrated features and random forest (RF) is employed to construct a predictive model which can accurately classify drug-target pairs. The predictability of the model is further investigated and validated by several independent validation sets. The built model is used to predict drug-target associations, some of which were confirmed by comparing experimental data from public biological resources. A drug-target interaction network with high confidence drug-target pairs was also reconstructed. This network provides further insight for the action of drugs and targets. Finally, a web-based server called PreDPI-Ki was developed to predict drug-target interactions for drug discovery. In addition to providing a high-confidence list of drug-target associations for subsequent experimental investigation guidance, these results also contribute to the understanding of drug-target interactions. We can also see that quantitative information of drug-target associations could greatly promote the development of more accurate models. The PreDPI-Ki server is freely available via: http://sdd.whu.edu.cn/dpiki.


Sar and Qsar in Environmental Research | 2012

In silico toxicity prediction by support vector machine and SMILES representation-based string kernel

Dong-Sheng Cao; Jian-Chao Zhao; Yan-Ning Yang; C.-X. Zhao; Jun Yan; Shao Liu; Qian-Nan Hu; Qing-Song Xu; Yi-Zeng Liang

There is a great need to assess the harmful effects or toxicities of chemicals to which man is exposed. In the present paper, the simplified molecular input line entry specification (SMILES) representation-based string kernel, together with the state-of-the-art support vector machine (SVM) algorithm, were used to classify the toxicity of chemicals from the US Environmental Protection Agency Distributed Structure-Searchable Toxicity (DSSTox) database network. In this method, the molecular structure can be directly encoded by a series of SMILES substrings that represent the presence of some chemical elements and different kinds of chemical bonds (double, triple and stereochemistry) in the molecules. Thus, SMILES string kernel can accurately and directly measure the similarities of molecules by a series of local information hidden in the molecules. Two model validation approaches, five-fold cross-validation and independent validation set, were used for assessing the predictive capability of our developed models. The results obtained indicate that SVM based on the SMILES string kernel can be regarded as a very promising and alternative modelling approach for potential toxicity prediction of chemicals.


Analytica Chimica Acta | 2011

A novel kernel Fisher discriminant analysis: constructing informative kernel by decision tree ensemble for metabolomics data analysis.

Dong-Sheng Cao; Mao-Mao Zeng; Lunzhao Yi; Bing Wang; Qing-Song Xu; Qian-Nan Hu; Liang-Xiao Zhang; Hongmei Lu; Yi-Zeng Liang

Large amounts of data from high-throughput metabolomics experiments become commonly more and more complex, which brings an enormous amount of challenges to existing statistical modeling. Thus there is a need to develop statistically efficient approach for mining the underlying metabolite information contained by metabolomics data under investigation. In the work, we developed a novel kernel Fisher discriminant analysis (KFDA) algorithm by constructing an informative kernel based on decision tree ensemble. The constructed kernel can effectively encode the similarities of metabolomics samples between informative metabolites/biomarkers in specific parts of the measurement space. Simultaneously, informative metabolites or potential biomarkers can be successfully discovered by variable importance ranking in the process of building kernel. Moreover, KFDA can also deal with nonlinear relationship in the metabolomics data by such a kernel to some extent. Finally, two real metabolomics datasets together with a simulated data were used to demonstrate the performance of the proposed approach through the comparison of different approaches.


Journal of Biomolecular Screening | 2009

COMDECOM: Predicting the Lifetime of Screening Compounds in DMSO Solution

Emrin Zitha-Bovens; Peter Maas; Dick Wife; Johan Tijhuis; Qian-Nan Hu; Thomas Kleinöder; Johann Gasteiger

The technological evolution of the 1990s in both combinatorial chemistry and high-throughput screening created the demand for rapid access to the compound deck to support the screening process. The common strategy within the pharmaceutical industry is to store the screening library in DMSO solution. Several studies have shown that a percentage of these compounds decompose in solution, varying from a few percent of the total to a substantial part of the library. In the COMDECOM (COMpound DECOMposition) project, the compound stability of screening compounds in DMSO solution is monitored in an accelerated thermal, hydrolytic, and oxidative decomposition program. A large database with stability data is collected, and from this database, a predictive model is being developed. The aim of this program is to build an algorithm that can flag compounds that are likely to decompose—information that is considered to be of utmost importance (e.g., in the compound acquisition process and when evaluation screening results of library compounds, as well as in the determination of optimal storage conditions).


Journal of Chemometrics | 2012

Computer‐aided prediction of toxicity with substructure pattern and random forest

Dong-Sheng Cao; Yan-Ning Yang; Jian-Chao Zhao; Jun Yan; Shao Liu; Qian-Nan Hu; Qing-Song Xu; Yi-Zend Liang

Toxicity of chemicals induced by different factors is an important consideration, especially during the drug research and development process. Thus, there is urgent need to develop computationally effective models that can predict the toxicity or adverse effects of chemicals for a specific class of chemicals. In this study, random forest (RF) was used to classify five toxicity data sets from Distributed Structure‐Searchable Toxicity database network, using substructure fingerprints calculated directly from simple molecular structure. Three model validation approaches, out‐of‐bag validation incorporated in RF, fivefold cross‐validation, and an independent validation set, were used for assessing the prediction capability of our models. The chemical space analysis of data sets was explored by multidimensional scaling plots, and outlying molecules were also detected by the proximity measure in RF. At the same time, the important substructure fingerprints, recognized by the RF technique, gave some insights into the structure features related to toxicity of chemicals. The results obtained showed that these in silico classification models with substructure patterns and RF are applicable for potential toxicity prediction of chemical compounds. Copyright


BMC Systems Biology | 2015

Predicting target-ligand interactions using protein ligand-binding site and ligand substructures

Caihua Wang; Juan Liu; Fei Luo; Zixing Deng; Qian-Nan Hu

BackgroundCell proliferation, differentiation, Gene expression, metabolism, immunization and signal transduction require the participation of ligands and targets. It is a great challenge to identify rules governing molecular recognition between chemical topological substructures of ligands and the binding sites of the targets.MethodsWe suppose that the ligand-target interactions are determined by ligand substructures as well as the physical-chemical properties of the binding sites. Therefore, we propose a fragment interaction model (FIM) to describe the interactions between ligands and targets, with the purpose of facilitating the chemical interpretation of ligand-target binding. First we extract target-ligand complexes from sc-PDB database, based on which, we get the target binding sites and the ligands. Then we represent each binding site as a fragment vector based on a target fragment dictionary that is composed of 199 clusters (denoted as fragements in this work) obtained by clustering 4200 trimers according to their physical-chemical properties. And then, we represent each ligand as a substructure vector based on a dictionary containing 747 substructures. Finally, we build the FIM by generating the interaction matrix M (representing the fragment interaction network), and the FIM can later be used for predicting unknown ligand-target interactions as well as providing the binding details of the interactions.ResultsThe five-fold cross validation results show that the proposed model can get higher AUC score (92%) than three prevalence algorithms CS-PD (80%), BLM-NII (85%) and RF (85%), demonstrating the remarkable predictive ability of FIM. We also show that the ligand binding sites (local information) overweight the sequence similarities (global information) in ligand-target binding, and introducing too much global information would be harmful to the predictive ability. Moreover, The derived fragment interaction network can provide the chemical insights on the interactions.ConclusionsThe target and ligand bindings are local events, and the local information dominate the binding ability. Though integrating of the global information can promote the predictive ability, the role is very limited. The fragment interaction network is helpful for understanding the mechanism of the ligand-target interaction.


Bioinformatics | 2011

RxnFinder: biochemical reaction search engines using molecular structures, molecular fragments and reaction similarity

Qian-Nan Hu; Zhe Deng; Huanan Hu; Dong−Sheng Cao; Yi−Zeng Liang

SUMMARY Biochemical reactions play a key role to help sustain life and allow cells to grow. RxnFinder was developed to search biochemical reactions from KEGG reaction database using three search criteria: molecular structures, molecular fragments and reaction similarity. RxnFinder is helpful to get reference reactions for biosynthesis and xenobiotics metabolism. AVAILABILITY RxnFinder is freely available via: http://sdd.whu.edu.cn/rxnfinder. CONTACT [email protected].

Collaboration


Dive into the Qian-Nan Hu's collaboration.

Top Co-Authors

Avatar

Yi-Zeng Liang

Central South University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Dong-Sheng Cao

Central South University

View shared research outputs
Top Co-Authors

Avatar

Liang-Xiao Zhang

Dalian Institute of Chemical Physics

View shared research outputs
Top Co-Authors

Avatar

Jun Yan

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Xiao-Ling Peng

Hong Kong Baptist University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Kai-Tai Fang

United International College

View shared research outputs
Researchain Logo
Decentralizing Knowledge