Ping-an He
Zhejiang Sci-Tech University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Ping-an He.
Proteins | 2008
Yuhua Yao; Qi Dai; Chun Li; Ping-an He; Xuying Nan; Yaozhou Zhang
On the basis of a selected pair of physicochemical properties of amino acids, we introduce a dynamic 2D graphical representation of protein sequences. Then, we introduce and compare two numerical characterizations of protein graphs as descriptors to analyze the nine ND5 proteins. The approach is simple, convenient, and fast. Proteins 2008.
Journal of Computational Chemistry | 2009
Yuhua Yao; Qi Dai; Ling Li; Xuying Nan; Ping-an He; Yaozhou Zhang
A (two‐dimensional) 2D graphical representation of protein sequences based on six physicochemical properties of amino acids is outlined. The numerical characterization of protein graphs is given as descriptors of protein sequences. It is not only useful for comparative study of proteins but also for encoding innate information about the structure of proteins. The coefficient of determination is proposed as a new similarity/dissimilarity measure. Finally, a simple example is taken to highlight the behavior of the new similarity/dissimilarity measure on protein sequences taken from the ND6 (NADH dehydrogenase subunit 6) proteins for eight different species. The results demonstrate the approach is convenient, fast, and efficient.
Journal of Computational Chemistry | 2010
Ping-an He; Yan-Ping Zhang; Yuhua Yao; Yi-Fa Tang; Xuying Nan
Based on the chaos game representation, a 2D graphical representation of protein sequences was introduced in which the 20 amino acids are rearranged in a cyclic order according to their physicochemical properties. The Euclidean distances between the corresponding amino acids from the 2‐D graphical representations are computed to find matching (or conserved) fragments of amino acids between the two proteins. Again, the cumulative distance of the 2D‐graphical representations is defined to compare the similarity of protein. And, the examination of the similarity among sequences of the ND5 proteins of nine species shows the utility of our approach.
Journal of Theoretical Biology | 2012
Ping-an He; Dan Li; Yanping Zhang; Xin Wang; Yuhua Yao
Based on the order of 6-bit binary Gray code, a cyclic order of 20 amino acids is introduced. A novel 3D graphical representation of protein sequences is proposed according to the CGR of DNA sequences. Furthermore, the mathematical descriptor is suggested to characterize the graphical representation curve. The efficiency of our approach can be illustrated by performing the comparison of similarities/dissimilarities among sequences of the ND5 proteins of nine different species. With the correlation and significance analysis, the comparisons of both our results and results of other graphical representation with the ClustalWs results can show the utility of our approach.
Journal of Computational Chemistry | 2008
Yuhua Yao; Qi Dai; Xuying Nan; Ping-an He; Zuoming Nie; Songping Zhou; Yaozhou Zhang
On the basis of a class of 2D graphical representations of DNA sequences, sensitivity analysis has been performed, showing the high‐capability of the proposed representations to take into account small modifications of the DNA sequences. And sensitivity analysis also indicates that the absolute differences of the leading eigenvalues of the L/L matrices associated with DNA increase with the increase of the number of the base mutations. Besides, we conclude that the similarity analysis method based on the correlation angles can better eliminate the effects of the lengths of DNA sequences if compared with the method using the Euclidean distances. As application, the examination of similarities/dissimilarities among the coding sequences of the first exon of β‐globin gene of different species has been performed by our method, and the reasonable results verify the validity of our method.
Journal of Theoretical Biology | 2014
Yuhua Yao; Shoujiang Yan; Jianning Han; Qi Dai; Ping-an He
In this paper, a dynamic 3-D graphical representation of protein sequences is introduced based on three physical-chemical properties of amino acids. The coordinates of the graph have direct biological significance, which could reflect the innate structure of the proteins. The information of principal moments of inertia and range of axis coordinate are extracted as a novel mixed descriptor and proposed for the comparison of protein primary sequences. Meanwhile, the Euclidean distance of the normalized descriptor vectors which avoid the influence of the difference in length of protein sequences under consideration is employed as a quantitative measurement of the similarity of proteins. Finally, we take the nine ND5 (NADH dehydrogenase subunit 5) proteins for example and illustrate the effectiveness of our approach.
BMC Bioinformatics | 2013
Qi Dai; Yan Li; Xiaoqing Liu; Yuhua Yao; Yunjie Cao; Ping-an He
BackgroundMany content-based statistical features of secondary structural elements (CBF-PSSEs) have been proposed and achieved promising results in protein structural class prediction, but until now position distribution of the successive occurrences of an element in predicted secondary structure sequences hasn’t been used. It is necessary to extract some appropriate position-based features of the secondary structural elements for prediction task.ResultsWe proposed some position-based features of predicted secondary structural elements (PBF-PSSEs) and assessed their intrinsic ability relative to the available CBF-PSSEs, which not only offers a systematic and quantitative experimental assessment of these statistical features, but also naturally complements the available comparison of the CBF-PSSEs. We also analyzed the performance of the CBF-PSSEs combined with the PBF-PSSE and further constructed a new combined feature set, PBF11CBF-PSSE. Based on these experiments, novel valuable guidelines for the use of PBF-PSSEs and CBF-PSSEs were obtained.ConclusionsPBF-PSSEs and CBF-PSSEs have a compelling impact on protein structural class prediction. When combining with the PBF-PSSE, most of the CBF-PSSEs get a great improvement over the prediction accuracies, so the PBF-PSSEs and the CBF-PSSEs have to work closely so as to make significant and complementary contributions to protein structural class prediction. Besides, the proposed PBF-PSSE’s performance is extremely sensitive to the choice of parameter k. In summary, our quantitative analysis verifies that exploring the position information of predicted secondary structural elements is a promising way to improve the abilities of protein structural class prediction.
Biochimie | 2014
Junru Wang; Yan Li; Xiaoqing Liu; Qi Dai; Yuhua Yao; Ping-an He
Since introduction of PseAAs and functional domains, promising results have been achieved in protein structural class predication, but some challenges still exist in the representation of the PseAA structural correlation and structural domains. This paper proposed a high-accuracy prediction method using novel PseAA structural properties and secondary structural patterns, reflecting the long-range and local structural properties of the PseAAs and certain compact structural domains. The proposed prediction method was tested against the competing prediction methods with four experiments. The experiment results indicate that the proposed method achieved the best performance. Its overall accuracies for datasets 25 PDB, D640, FC699 and 1189 are 88.8%, 90.9%, 96.4% and 87.4%, which are 4.5%, 7.6%, 2% and 3.9% higher than the existing best-performing method. This understanding can be used to guide development of more powerful methods for protein structural class prediction. The software and supplement material are freely available at http://bioinfo.zstu.edu.cn/PseAA-SSP.
PLOS ONE | 2017
Qi Dai; Lili Geng; Minjia Lu; Weibo Jin; Xuying Nan; Ping-an He; Yuhua Yao
Although domesticated tomato is cultivated by wild tomato, there are a lot of differences between cultivated tomato and wild tomato, such as shape, physiological function and life history. Many studies show that wild tomato has better salt resistance and drought resistance. In addition to, domesticated tomato’s fruit is bigger and has more nutritious than wild tomato. The different features are closely related to differentially expressed genes. We identified 126 up-regulated differentially expressed genes and 87 down-regulated differentially expressed genes in cultivated tomato and wild tomato by RNA-Seq. These differentially expressed genes may be associated with salt resistance, drought resistance and fruit nutrition. These differentially expressed genes also further highlight the large-scale reconstruction between wild and cultivated species. In this paper, we mainly study GO enrichment analysis and pathway analysis of the differentially expressed genes. After GO and pathway enrichment analysis, a set of significantly enriched GO annotations and pathways were identified for the differentially expressed genes. What’s more, we also identified long non-coding RNAs and mRNAs in the two species and analyzed its essential features. In addition to, we construct a co-expression network of long non-coding RNAs and mRNAs, and annotate mRNAs associated with long non-coding RNAs as target genes, and speculate the regulation function of long non-coding RNAs. In total, our results reveal the effects of artificial and natural selection on tomato’s transcript, providing scientific basis for tomato’s research in the future.
BioMed Research International | 2014
Yusheng Bai; Tingting Ma; Yuhua Yao; Qi Dai; Ping-an He
A new mathematical descriptor was proposed based on 3D graphical representation. Using the method, we construct the phylogenetic trees of nine proteins of H7N9 influenza virus to analyze the originated source of H7N9. The results show that the evolution route of H7N9 avian influenza is from America through Europe to Asia. Furthermore, two samples collected from environment in Nanjing and Zhejiang and one sample collected from chicken are the sources of H7N9 influenza virus that infected human in China.