Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Zhisong He is active.

Publication


Featured researches published by Zhisong He.


PLOS ONE | 2010

Predicting drug-target interaction networks based on functional groups and biological features.

Zhisong He; Jian Zhang; Xiao-He Shi; Le-Le Hu; Xiangyin Kong; Yu-Dong Cai; Kuo-Chen Chou

Background Study of drug-target interaction networks is an important topic for drug development. It is both time-consuming and costly to determine compound-protein interactions or potential drug-target interactions by experiments alone. As a complement, the in silico prediction methods can provide us with very useful information in a timely manner. Methods/Principal Findings To realize this, drug compounds are encoded with functional groups and proteins encoded by biological features including biochemical and physicochemical properties. The optimal feature selection procedures are adopted by means of the mRMR (Maximum Relevance Minimum Redundancy) method. Instead of classifying the proteins as a whole family, target proteins are divided into four groups: enzymes, ion channels, G-protein- coupled receptors and nuclear receptors. Thus, four independent predictors are established using the Nearest Neighbor algorithm as their operation engine, with each to predict the interactions between drugs and one of the four protein groups. As a result, the overall success rates by the jackknife cross-validation tests achieved with the four predictors are 85.48%, 80.78%, 78.49%, and 85.66%, respectively. Conclusion/Significance Our results indicate that the network prediction system thus established is quite promising and encouraging.


PLOS ONE | 2010

Analysis and Prediction of the Metabolic Stability of Proteins Based on Their Sequential Features, Subcellular Locations and Interaction Networks

Tao Huang; Xiao-He Shi; Ping Wang; Zhisong He; Kai-Yan Feng; Le-Le Hu; Xiangyin Kong; Yixue Li; Yu-Dong Cai; Kuo-Chen Chou

The metabolic stability is a very important idiosyncracy of proteins that is related to their global flexibility, intramolecular fluctuations, various internal dynamic processes, as well as many marvelous biological functions. Determination of proteins metabolic stability would provide us with useful information for in-depth understanding of the dynamic action mechanisms of proteins. Although several experimental methods have been developed to measure proteins metabolic stability, they are time-consuming and more expensive. Reported in this paper is a computational method, which is featured by (1) integrating various properties of proteins, such as biochemical and physicochemical properties, subcellular locations, network properties and protein complex property, (2) using the mRMR (Maximum Relevance & Minimum Redundancy) principle and the IFS (Incremental Feature Selection) procedure to optimize the prediction engine, and (3) being able to identify proteins among the four types: “short”, “medium”, “long”, and “extra-long” half-life spans. It was revealed through our analysis that the following seven characters played major roles in determining the stability of proteins: (1) KEGG enrichment scores of the protein and its neighbors in network, (2) subcellular locations, (3) polarity, (4) amino acids composition, (5) hydrophobicity, (6) secondary structure propensity, and (7) the number of protein complexes the protein involved. It was observed that there was an intriguing correlation between the predicted metabolic stability of some proteins and the real half-life of the drugs designed to target them. These findings might provide useful insights for designing protein-stability-relevant drugs. The computational method can also be used as a large-scale tool for annotating the metabolic stability for the avalanche of protein sequences generated in the post-genomic age.


PLOS ONE | 2010

Prediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.

Tao Huang; Ping Wang; Zhi-Qiang Ye; Heng Xu; Zhisong He; Kai-Yan Feng; Le-Le Hu; Weiren Cui; Kai Wang; Xiao Dong; Lu Xie; Xiangyin Kong; Yu-Dong Cai; Yixue Li

Non-synonymous SNPs (nsSNPs), also known as Single Amino acid Polymorphisms (SAPs) account for the majority of human inherited diseases. It is important to distinguish the deleterious SAPs from neutral ones. Most traditional computational methods to classify SAPs are based on sequential or structural features. However, these features cannot fully explain the association between a SAP and the observed pathophysiological phenotype. We believe the better rationale for deleterious SAP prediction should be: If a SAP lies in the protein with important functions and it can change the protein sequence and structure severely, it is more likely related to disease. So we established a method to predict deleterious SAPs based on both protein interaction network and traditional hybrid properties. Each SAP is represented by 472 features that include sequential features, structural features and network features. Maximum Relevance Minimum Redundancy (mRMR) method and Incremental Feature Selection (IFS) were applied to obtain the optimal feature set and the prediction model was Nearest Neighbor Algorithm (NNA). In jackknife cross-validation, 83.27% of SAPs were correctly predicted when the optimized 263 features were used. The optimized predictor with 263 features was also tested in an independent dataset and the accuracy was still 80.00%. In contrast, SIFT, a widely used predictor of deleterious SAPs based on sequential features, has a prediction accuracy of 71.05% on the same dataset. In our study, network features were found to be most important for accurate prediction and can significantly improve the prediction performance. Our results suggest that the protein interaction context could provide important clues to help better illustrate SAPs functional association. This research will facilitate the post genome-wide association studies.


RNA | 2014

Conserved expression of lincRNA during human and macaque prefrontal cortex development and maturation

Zhisong He; Hindrike Bammann; Dingding Han; Gangcai Xie; Philipp Khaitovich

The current annotation of the human genome includes more than 12,000 long intergenic noncoding RNAs (lincRNA). While a handful of lincRNA have been shown to play important regulatory roles, the functionality of most remains unclear. Here, we examined the expression conservation and putative functionality of lincRNA in human and macaque prefrontal cortex (PFC) development and maturation. We analyzed transcriptome sequence (RNA-seq) data from 38 human and 40 macaque individuals covering the entire postnatal development interval. Using the human data set, we detected the expression of 5835 lincRNA annotated in GENCODE and further identified 1888 novel lincRNA. Most of these lincRNA show low DNA sequence conservation, as well as low expression levels. Remarkably, developmental expression patterns of these lincRNA were as conserved between humans and macaques as those of protein-coding genes. Transfection of development-associated lincRNA into human SH-SY5Y cells affected gene expression, indicating their regulatory potential. In brain, expression of these putative target genes correlated with the expression of the corresponding lincRNA during human and macaque PFC development. These results support the potential functionality of lincRNA in primate PFC development.


eLife | 2016

Differences and similarities between human and chimpanzee neural progenitors during cerebral cortex development

Felipe Mora-Bermúdez; Farhath Badsha; Sabina Kanton; J. Gray Camp; Benjamin Vernot; Kathrin Köhler; Birger Voigt; Keisuke Okita; Tomislav Maricic; Zhisong He; R Lachmann; Svante Pääbo; Barbara Treutlein; Wieland B. Huttner

Human neocortex expansion likely contributed to the remarkable cognitive abilities of humans. This expansion is thought to primarily reflect differences in proliferation versus differentiation of neural progenitors during cortical development. Here, we have searched for such differences by analysing cerebral organoids from human and chimpanzees using immunohistofluorescence, live imaging, and single-cell transcriptomics. We find that the cytoarchitecture, cell type composition, and neurogenic gene expression programs of humans and chimpanzees are remarkably similar. Notably, however, live imaging of apical progenitor mitosis uncovered a lengthening of prometaphase-metaphase in humans compared to chimpanzees that is specific to proliferating progenitors and not observed in non-neural cells. Consistent with this, the small set of genes more highly expressed in human apical progenitors points to increased proliferative capacity, and the proportion of neurogenic basal progenitors is lower in humans. These subtle differences in cortical progenitors between humans and chimpanzees may have consequences for human neocortex evolution. DOI: http://dx.doi.org/10.7554/eLife.18683.001


Genome Biology | 2015

Evaluating intra- and inter-individual variation in the human placental transcriptome.

David A. Hughes; Martin Kircher; Zhisong He; Song Guo; Genevieve L Fairbrother; Carlos S. Moreno; Philipp Khaitovich; Mark Stoneking

BackgroundGene expression variation is a phenotypic trait of particular interest as it represents the initial link between genotype and other phenotypes. Analyzing how such variation apportions among and within groups allows for the evaluation of how genetic and environmental factors influence such traits. It also provides opportunities to identify genes and pathways that may have been influenced by non-neutral processes. Here we use a population genetics framework and next generation sequencing to evaluate how gene expression variation is apportioned among four human groups in a natural biological tissue, the placenta.ResultsWe estimate that on average, 33.2%, 58.9%, and 7.8% of the placental transcriptome is explained by variation within individuals, among individuals, and among human groups, respectively. Additionally, when technical and biological traits are included in models of gene expression they each account for roughly 2% of total gene expression variation. Notably, the variation that is significantly different among groups is enriched in biological pathways associated with immune response, cell signaling, and metabolism. Many biological traits demonstrate correlated changes in expression in numerous pathways of potential interest to clinicians and evolutionary biologists. Finally, we estimate that the majority of the human placental transcriptome exhibits expression profiles consistent with neutrality; the remainder are consistent with stabilizing selection, directional selection, or diversifying selection.ConclusionsWe apportion placental gene expression variation into individual, population, and biological trait factors and identify how each influence the transcriptome. Additionally, we advance methods to associate expression profiles with different forms of selection.


Protein and Peptide Letters | 2013

A Sequence-based Approach for Predicting Protein Disordered Regions

Tao Huang; Zhisong He; Weiren Cui; Yu-Dong Cai; Xiao-He Shi; Le-Le Hu; Kuo-Chen Chou

Protein disordered regions are associated with some critical cellular functions such as transcriptional regulation, translation and cellular signal transduction, and they are responsible for various diseases. Although experimental methods have been developed to determine these regions, they are time-consuming and expensive. Therefore, it is highly desired to develop computational methods that can provide us with this kind information in a rapid and inexpensive manner. Here we propose a sequence-based computational approach for predicting protein disordered regions by means of the Nearest Neighbor algorithm, in which conservation, amino acid factor and secondary structure status of each amino acid in a fixed-length sliding window are taken as the encoding features. Also, the feature selection based on mRMR (maximum Relevancy Minimum Redundancy) is applied to obtain an optimal 51-feature set that includes 39 conservation features and 12 secondary structure features. With the optimal 51 features, our predictor yielded quite promising MCC (Mathews correlation coefficients): 0.371 on a rigorous benchmark dataset tested by 5-fold cross-validation and 0.219 on an independent test dataset. Our results suggest that conservation and secondary structure play important roles in intrinsically disordered proteins.


PLOS ONE | 2012

Cooperativity among short amyloid stretches in long amyloidogenic sequences.

Le-Le Hu; Weiren Cui; Zhisong He; Xiao-He Shi; Kai-Yan Feng; Buyong Ma; Yu-Dong Cai

Amyloid fibrillar aggregates of polypeptides are associated with many neurodegenerative diseases. Short peptide segments in protein sequences may trigger aggregation. Identifying these stretches and examining their behavior in longer protein segments is critical for understanding these diseases and obtaining potential therapies. In this study, we combined machine learning and structure-based energy evaluation to examine and predict amyloidogenic segments. Our feature selection method discovered that windows consisting of long amino acid segments of ∼30 residues, instead of the commonly used short hexapeptides, provided the highest accuracy. Weighted contributions of an amino acid at each position in a 27 residue window revealed three cooperative regions of short stretch, resemble the β-strand-turn-β-strand motif in A-βpeptide amyloid and β-solenoid structure of HET-s(218–289) prion (C). Using an in-house energy evaluation algorithm, the interaction energy between two short stretches in long segment is computed and incorporated as an additional feature. The algorithm successfully predicted and classified amyloid segments with an overall accuracy of 75%. Our study revealed that genome-wide amyloid segments are not only dependent on short high propensity stretches, but also on nearby residues.


Medicinal Chemistry | 2010

Using Compound Similarity and Functional Domain Composition for Prediction of Drug-Target Interaction Networks

Lei Chen; Zhisong He; Tao Huang; Yu-Dong Cai

Study of interactions between drugs and target proteins is an essential step in genomic drug discovery. It is very hard to determine the compound-protein interactions or drug-target interactions by experiment alone. As supplementary, effective prediction model using machine learning or data mining methods can provide much help. In this study, a prediction method based on Nearest Neighbor Algorithm and a novel metric, which was obtained by combining compound similarity and functional domain composition, was proposed. The target proteins were divided into the following groups: enzymes, ion channels, G protein-coupled receptors, and nuclear receptors. As a result, four predictors with the optimal parameters were established. The overall prediction accuracies, evaluated by jackknife cross-validation test, for four groups of target proteins are 90.23%, 94.74%, 97.80%, and 97.51%, respectively, indicating that compound similarity and functional domain composition are very effective to predict drug-target interaction networks.


Nature Neuroscience | 2017

Comprehensive transcriptome analysis of neocortical layers in humans, chimpanzees and macaques

Zhisong He; Dingding Han; Olga Efimova; Patricia Guijarro; Qianhui Yu; Anna Oleksiak; Shasha Jiang; K. V. Anokhin; Boris M. Velichkovsky; Stefan Grünewald; Philipp Khaitovich

While human cognitive abilities are clearly unique, underlying changes in brain organization and function remain unresolved. Here we characterized the transcriptome of the cortical layers and adjacent white matter in the prefrontal cortexes of humans, chimpanzees and rhesus macaques using unsupervised sectioning followed by RNA sequencing. More than 20% of detected genes were expressed predominantly in one layer, yielding 2,320 human layer markers. While the bulk of the layer markers were conserved among species, 376 switched their expression to another layer in humans. By contrast, only 133 of such changes were detected in the chimpanzee brain, suggesting acceleration of cortical reorganization on the human evolutionary lineage. Immunohistochemistry experiments further showed that human-specific expression changes were not limited to neurons but affected a broad spectrum of cortical cell types. Thus, despite apparent histological conservation, human neocortical organization has undergone substantial changes affecting more than 5% of its transcriptome.

Collaboration


Dive into the Zhisong He's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Xiao-He Shi

Shanghai Jiao Tong University

View shared research outputs
Top Co-Authors

Avatar

Kai-Yan Feng

University of Manchester

View shared research outputs
Top Co-Authors

Avatar

Tao Huang

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Qianhui Yu

CAS-MPG Partner Institute for Computational Biology

View shared research outputs
Top Co-Authors

Avatar

Weiren Cui

CAS-MPG Partner Institute for Computational Biology

View shared research outputs
Top Co-Authors

Avatar

Haipeng Li

CAS-MPG Partner Institute for Computational Biology

View shared research outputs
Researchain Logo
Decentralizing Knowledge