Jiayu Gong
East China University of Science and Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jiayu Gong.
Nucleic Acids Research | 2010
Xiaofeng Liu; Sisheng Ouyang; Biao Yu; Yabo Liu; Kai Huang; Jiayu Gong; Siyuan Zheng; Zhihua Li; Honglin Li; Hualiang Jiang
In silico drug target identification, which includes many distinct algorithms for finding disease genes and proteins, is the first step in the drug discovery pipeline. When the 3D structures of the targets are available, the problem of target identification is usually converted to finding the best interaction mode between the potential target candidates and small molecule probes. Pharmacophore, which is the spatial arrangement of features essential for a molecule to interact with a specific target receptor, is an alternative method for achieving this goal apart from molecular docking method. PharmMapper server is a freely accessed web server designed to identify potential target candidates for the given small molecules (drugs, natural products or other newly discovered compounds with unidentified binding targets) using pharmacophore mapping approach. PharmMapper hosts a large, in-house repertoire of pharmacophore database (namely PharmTargetDB) annotated from all the targets information in TargetBank, BindingDB, DrugBank and potential drug target database, including over 7000 receptor-based pharmacophore models (covering over 1500 drug targets information). PharmMapper automatically finds the best mapping poses of the query molecule against all the pharmacophore models in PharmTargetDB and lists the top N best-fitted hits with appropriate target annotations, as well as respective molecule’s aligned poses are presented. Benefited from the highly efficient and robust triangle hashing mapping method, PharmMapper bears high throughput ability and only costs 1 h averagely to screen the whole PharmTargetDB. The protocol was successful in finding the proper targets among the top 300 pharmacophore candidates in the retrospective benchmarking test of tamoxifen. PharmMapper is available at http://59.78.96.61/pharmmapper.
Bioinformatics | 2013
Jiayu Gong; Chaoqian Cai; Xiaofeng Liu; Xin Ku; Hualiang Jiang; Daqi Gao; Honglin Li
SUMMARY ChemMapper is an online platform to predict polypharmacology effect and mode of action for small molecules based on 3D similarity computation. ChemMapper collects >350 000 chemical structures with bioactivities and associated target annotations (as well as >3 000 000 non-annotated compounds for virtual screening). Taking the user-provided chemical structure as the query, the top most similar compounds in terms of 3D similarity are returned with associated pharmacology annotations. ChemMapper is designed to provide versatile services in a variety of chemogenomics, drug repurposing, polypharmacology, novel bioactive compounds identification and scaffold hopping studies. AVAILABILITY http://lilab.ecust.edu.cn/chemmapper/. CONTACT [email protected] or [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
BMC Bioinformatics | 2013
Chuanxin Zou; Jiayu Gong; Honglin Li
BackgroundDNA-binding proteins (DNA-BPs) play a pivotal role in both eukaryotic and prokaryotic proteomes. There have been several computational methods proposed in the literature to deal with the DNA-BPs, many informative features and properties were used and proved to have significant impact on this problem. However the ultimate goal of Bioinformatics is to be able to predict the DNA-BPs directly from primary sequence.ResultsIn this work, the focus is how to transform these informative features into uniform numeric representation appropriately and improve the prediction accuracy of our SVM-based classifier for DNA-BPs. A systematic representation of some selected features known to perform well is investigated here. Firstly, four kinds of protein properties are obtained and used to describe the protein sequence. Secondly, three different feature transformation methods (OCTD, AC and SAA) are adopted to obtain numeric feature vectors from three main levels: Global, Nonlocal and Local of protein sequence and their performances are exhaustively investigated. At last, the mRMR-IFS feature selection method and ensemble learning approach are utilized to determine the best prediction model. Besides, the optimal features selected by mRMR-IFS are illustrated based on the observed results which may provide useful insights for revealing the mechanisms of protein-DNA interactions. For five-fold cross-validation over the DNAdset and DNAaset, we obtained an overall accuracy of 0.940 and 0.811, MCC of 0.881 and 0.614 respectively.ConclusionsThe good results suggest that it can efficiently develop an entirely sequence-based protocol that transforms and integrates informative features from different scales used by SVM to predict DNA-BPs accurately. Moreover, a novel systematic framework for sequence descriptor-based protein function prediction is proposed here.
Journal of Chemical Information and Modeling | 2016
Xia Wang; Chenxu Pan; Jiayu Gong; Xiaofeng Liu; Honglin Li
PharmMapper is a web server for drug target identification by reversed pharmacophore matching the query compound against an annotated pharmacophore model database, which provides a computational polypharmacology prediction approach for drug repurposing and side effect risk evaluation. But due to the inherent nondiscriminative feature of the simple fit scores used for prediction results ranking, the signal/noise ratio of the prediction results is high, posing a challenge for predictive reliability. In this paper, we improved the predictive accuracy of PharmMapper by generating a ligand-target pairwise fit score matrix from profiling all the annotated pharmacophore models against corresponding ligands in the original complex structures that were used to extract these pharmacophore models. The matrix reflects the noise baseline of fit score distribution of the background database, thus enabling estimation of the probability of finding a given target randomly with the calculated ligand-pharmacophore fit score. Two retrospective tests were performed which confirmed that the probability-based ranking score outperformed the simple fit score in terms of identification of both known drug targets and adverse drug reaction related off-targets.
Journal of Cheminformatics | 2014
Xia Wang; Haipeng Chen; Feng Yang; Jiayu Gong; Shiliang Li; Jianfeng Pei; Xiaofeng Liu; Hualiang Jiang; Luhua Lai; Honglin Li
BackgroundThe progress in computer-aided drug design (CADD) approaches over the past decades accelerated the early-stage pharmaceutical research. Many powerful standalone tools for CADD have been developed in academia. As programs are developed by various research groups, a consistent user-friendly online graphical working environment, combining computational techniques such as pharmacophore mapping, similarity calculation, scoring, and target identification is needed.ResultsWe presented a versatile, user-friendly, and efficient online tool for computer-aided drug design based on pharmacophore and 3D molecular similarity searching. The web interface enables binding sites detection, virtual screening hits identification, and drug targets prediction in an interactive manner through a seamless interface to all adapted packages (e.g., Cavity, PocketV.2, PharmMapper, SHAFTS). Several commercially available compound databases for hit identification and a well-annotated pharmacophore database for drug targets prediction were integrated in iDrug as well. The web interface provides tools for real-time molecular building/editing, converting, displaying, and analyzing. All the customized configurations of the functional modules can be accessed through featured session files provided, which can be saved to the local disk and uploaded to resume or update the history work.ConclusionsiDrug is easy to use, and provides a novel, fast and reliable tool for conducting drug design experiments. By using iDrug, various molecular design processing tasks can be submitted and visualized simply in one browser without installing locally any standalone modeling softwares. iDrug is accessible free of charge at http://lilab.ecust.edu.cn/idrug.
Measurement Science and Technology | 2012
Daqi Gao; Jiuming Ji; Jiayu Gong; Chaoqian Cai
This paper sets up an improved electronic nose with an automatic sampling mode, large volumetric vapors and constant temperature for headspace vapors and gas sensor array. In order to facilitate the fast recovery and good repeatability of gas sensors, the steps taken include (A) short-time contact with odors measured; (B) long-time purification using environmental air; (C) exact calibration using clean air before sampling. We employ multiple single-output perceptrons to discriminate and quantify multiple kinds of odors. This task is first regarded as multiple two-class discrimination problems and then multiple quantification problems, and accomplished by multiple single-output perceptrons followed by multiple single-output perceptrons. The experimental results for measuring and quantifying 12 kinds of volatile organic compounds with changing concentrations show that the type of electronic nose with a hierarchical perceptron model has a simple structure, easy operation, good repeatability and good discrimination and quantification performance.
Journal of Molecular Modeling | 2012
Chaoqian Cai; Jiayu Gong; Xiaofeng Liu; Hualiang Jiang; Daqi Gao; Honglin Li
AbstractA novel molecular shape similarity comparison method, namely SHeMS, derived from spherical harmonic (SH) expansion, is presented in this study. Through weight optimization using genetic algorithms for a customized reference set, the optimal combination of weights for the translationally and rotationally invariant (TRI) SH shape descriptor, which can specifically and effectively distinguish overall and detailed shape features according to the molecular surface, is obtained for each molecule. This method features two key aspects: firstly, the SH expansion coefficients from different bands are weighted to calculate similarity, leading to a distinct contribution of overall and detailed features to the final score, and thus can be better tailored for each specific system under consideration. Secondly, the reference set for optimization can be totally configured by the user, which produces great flexibility, allowing system-specific and customized comparisons. The directory of useful decoys (DUD) database was adopted to validate and test our method, and principal component analysis (PCA) reveals that SH descriptors for shape comparison preserve sufficient information to separate actives from decoys. The results of virtual screening indicate that the proposed method based on optimal SH descriptor weight combinations represents a great improvement in performance over original SH (OSH) and ultra-fast shape recognition (USR) methods, and is comparable to many other popular methods. Through combining efficient shape similarity comparison with SH expansion method, and other aspects such as chemical and pharmacophore features, SHeMS can play a significant role in this field and can be applied practically to virtual screening by means of similarity comparison with 3D shapes of known active compounds or the binding pockets of target proteins. FigureSchematic diagram of spherical harmonic (SH) based weighted similarity calculation. First, molecular surfaces are projected to groups of SH producing a series of projection coefficients which are used to calculate SH descriptors. Then, a genetic algorithm based searching process will be carried out, producing a group of optimal weights which can best separate actives from negatives. Finally, combining the SH descriptor and corresponding weights, a similarity score was calculated and used to rank the candidate molecules
Journal of Chemical Theory and Computation | 2011
Li Liu; Xiaofeng Liu; Jiayu Gong; Hualiang Jiang; Honglin Li
All-atom normal mode analysis (NMA) is an efficient way to predict the collective motions in a given macromolecule, which is essential for the understanding of protein biological function and drug design. However, the calculations are limited in time scale mainly because the required diagonalization of the Hessian matrix by Householder-QR transformation is a computationally exhausting task. In this paper, we demonstrate the parallel computing power of the graphics processing unit (GPU) in NMA by mapping Householder-QR transformation onto GPU using Compute Unified Device Architecture (CUDA). The results revealed that the GPU-accelerated all-atom NMA could reduce the runtime of diagonalization significantly and achieved over 20× speedup over CPU-based NMA. In addition, we analyzed the influence of precision on both the performance and the accuracy of GPU. Although the performance of GPU with double precision is weaker than that with single precision in theory, more accurate results and an acceptable speedup of double precision were obtained in our approach by reducing the data transfer time to a minimum. Finally, the inherent drawbacks of GPU and the corresponding solution to deal with the limitation in computational scale are also discussed in this study.
Journal of Chemical Information and Modeling | 2013
Chaoqian Cai; Jiayu Gong; Xiaofeng Liu; Daqi Gao; Honglin Li
In this study, a Gaussian volume overlap and chemical feature based molecular similarity metric was devised, and a downhill simplex searching was carried out to evaluate the corresponding similarity. By representing the shapes of both the candidate small molecules and the binding site with chemical features and comparing the corresponding Gaussian volumes overlaps, the active compounds could be identified. These two aspects compose the proposed method named SimG which supports both structure-based and ligand-based strategies. The validity of the proposed method was examined by analyzing the similarity score variation between actives and decoys as well as correlation among distinct reference methods. A retrospective virtual screening test was carried out on DUD data sets, demonstrating that the performance of structure-based shape matching virtual screening in DUD data sets is substantially dependent on some physical properties, especially the solvent-exposure extent of the binding site: The enrichments of targets with less solvent-exposed binding sites generally exceeds that of the one with more solvent-exposed binding sites and even surpasses the corresponding ligand-based virtual screening.
Combinatorial Chemistry & High Throughput Screening | 2016
Shiliang Li; Xiaojuan Yu; Chuanxin Zou; Jiayu Gong; Xiujie Liu; Honglin Li
Identification of potential druggable targets utilizing protein-protein interactions network (PPIN) has been emerging as a hotspot in drug discovery and development research. However, it remains unclear whether the currently used PPIN topological properties are enough to discriminate the drug targets from non-drug targets. In this study, three-step classification models using different network topological properties were designed and implemented using support vector machine (SVM) to compare the enrichment of known drug targets from non-targets. Surprisingly, none of the models was able to identify more than 75% of the true targets in the test set. It appears that the currently used simple PPIN topological properties are not likely robust enough for prediction of potential drug targets with high confidence, which also echoes similarly unsatisfying prediction data reported previously. However, we proposed that quality and quantity improvement of the protein-protein interactions (PPI) data for model training will help increasing the prediction accuracy.