Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Xiaoqi Zheng is active.

Publication


Featured researches published by Xiaoqi Zheng.


Amino Acids | 2012

Accurate prediction of protein structural class using auto covariance transformation of PSI-BLAST profiles

Taigang Liu; Xingbo Geng; Xiaoqi Zheng; Rensuo Li; Jun Wang

Computational prediction of protein structural class based solely on sequence data remains a challenging problem in protein science. Existing methods differ in the protein sequence representation models and prediction engines adopted. In this study, a powerful feature extraction method, which combines position-specific score matrix (PSSM) with auto covariance (AC) transformation, is introduced. Thus, a sample protein is represented by a series of discrete components, which could partially incorporate the long-range sequence order information and evolutionary information reflected from the PSI-BLAST profile. To verify the performance of our method, jackknife cross-validation tests are performed on four widely used benchmark datasets. Comparison of our results with existing methods shows that our method provides the state-of-the-art performance for structural class prediction. A Web server that implements the proposed method is freely available at http://202.194.133.5/xinxi/AAC_PSSM_AC/index.htm.


Biochimie | 2010

Prediction of protein structural class for low-similarity sequences using support vector machine and PSI-BLAST profile

Taigang Liu; Xiaoqi Zheng; Jun Wang

Knowledge of structural class plays an important role in understanding protein folding patterns. In this study, a simple and powerful computational method, which combines support vector machine with PSI-BLAST profile, is proposed to predict protein structural class for low-similarity sequences. The evolution information encoding in the PSI-BLAST profiles is converted into a series of fixed-length feature vectors by extracting amino acid composition and dipeptide composition from the profiles. The resulting vectors are then fed to a support vector machine classifier for the prediction of protein structural class. To evaluate the performance of the proposed method, jackknife cross-validation tests are performed on two widely used benchmark datasets, 1189 (containing 1092 proteins) and 25PDB (containing 1673 proteins) with sequence similarity lower than 40% and 25%, respectively. The overall accuracies attain 70.7% and 72.9% for 1189 and 25PDB datasets, respectively. Comparison of our results with other methods shows that our method is very promising to predict protein structural class particularly for low-similarity datasets and may at least play an important complementary role to existing methods.


Protein and Peptide Letters | 2012

Prediction of protein subcellular multi-localization based on the general form of Chou's pseudo amino acid composition.

Liqi Li; Yuan Zhang; Lingyun Zou; Yue Zhou; Xiaoqi Zheng

Many proteins bear multi-locational characteristics, and this phenomenon is closely related to biological function. However, most of the existing methods can only deal with single-location proteins. Therefore, an automatic and reliable ensemble classifier for protein subcellular multi-localization is needed. We propose a new ensemble classifier combining the KNN (K-nearest neighbour) and SVM (support vector machine) algorithms to predict the subcellular localization of eukaryotic, Gram-negative bacterial and viral proteins based on the general form of Chous pseudo amino acid composition, i.e., GO (gene ontology) annotations, dipeptide composition and AmPseAAC (Amphiphilic pseudo amino acid composition). This ensemble classifier was developed by fusing many basic individual classifiers through a voting system. The overall prediction accuracies obtained by the KNN-SVM ensemble classifier are 95.22, 93.47 and 80.72% for the eukaryotic, Gram-negative bacterial and viral proteins, respectively. Our prediction accuracies are significantly higher than those by previous methods and reveal that our strategy better predicts subcellular locations of multi-location proteins.


Proceedings of the National Academy of Sciences of the United States of America | 2017

Simplified and representative bacterial community of maize roots

Ben Niu; Joseph N. Paulson; Xiaoqi Zheng; Roberto Kolter

Significance Many species of microbes colonize plants as members of complex communities. The high complexity of such plant microbial communities poses great difficulty for any experimental analyses aimed at understanding the principles underlying such microbe–plant interactions. In this work, we assembled a greatly simplified, yet representative, synthetic bacterial model community that allowed us to study the community assembly dynamics and function on axenic maize seedlings. This model community interfered with the growth of a plant pathogenic fungus, thus protecting the plant. This model system will prove to be a useful system for future research on plant–microbe interactions. Plant-associated microbes are important for the growth and health of their hosts. As a result of numerous prior studies, we know that host genotypes and abiotic factors influence the composition of plant microbiomes. However, the high complexity of these communities challenges detailed studies to define experimentally the mechanisms underlying the dynamics of community assembly and the beneficial effects of such microbiomes on plant hosts. In this work, from the distinctive microbiota assembled by maize roots, through host-mediated selection, we obtained a greatly simplified synthetic bacterial community consisting of seven strains (Enterobacter cloacae, Stenotrophomonas maltophilia, Ochrobactrum pituitosum, Herbaspirillum frisingense, Pseudomonas putida, Curtobacterium pusillum, and Chryseobacterium indologenes) representing three of the four most dominant phyla found in maize roots. By using a selective culture-dependent method to track the abundance of each strain, we investigated the role that each plays in community assembly on roots of axenic maize seedlings. Only the removal of E. cloacae led to the complete loss of the community, and C. pusillum took over. This result suggests that E. cloacae plays the role of keystone species in this model ecosystem. In planta and in vitro, this model community inhibited the phytopathogenic fungus Fusarium verticillioides, indicating a clear benefit to the host. Thus, combined with the selective culture-dependent quantification method, our synthetic seven-species community representing the root microbiome has the potential to serve as a useful system to explore how bacterial interspecies interactions affect root microbiome assembly and to dissect the beneficial effects of the root microbiota on hosts under laboratory conditions in the future.


Genome Biology | 2014

MethylPurify: tumor purity deconvolution and differential methylation detection from single tumor DNA methylomes

Xiaoqi Zheng; Qian Zhao; Hua-Jun Wu; Wei Li; Haiyun Wang; Clifford A. Meyer; Qian Alvin Qin; Han Xu; Chongzhi Zang; Peng Jiang; Fuqiang Li; Yong Hou; Jianxing He; Jun Wang; Peng Zhang; Yong Zhang; Xiaole Shirley Liu

We propose a statistical algorithm MethylPurify that uses regions with bisulfite reads showing discordant methylation levels to infer tumor purity from tumor samples alone. MethylPurify can identify differentially methylated regions (DMRs) from individual tumor methylome samples, without genomic variation information or prior knowledge from other datasets. In simulations with mixed bisulfite reads from cancer and normal cell lines, MethylPurify correctly inferred tumor purity and identified over 96% of the DMRs. From patient data, MethylPurify gave satisfactory DMR calls from tumor methylome samples alone, and revealed potential missed DMRs by tumor to normal comparison due to tumor heterogeneity.


Biochimie | 2014

Prediction of bacterial protein subcellular localization by incorporating various features into Chou's PseAAC and a backward feature selection approach

Liqi Li; Sanjiu Yu; Weidong Xiao; Yongsheng Li; Maolin Li; Lan Huang; Xiaoqi Zheng; Shiwen Zhou; Hua Yang

Information on the subcellular localization of bacterial proteins is essential for protein function prediction, genome annotation and drug design. Here we proposed a novel approach to predict the subcellular localization of bacterial proteins by fusing features from position-specific score matrix (PSSM), Gene Ontology (GO) and PROFEAT. A backward feature selection approach by linear kennel of SVM was then used to rank the integrated feature vectors and extract optimal features. Finally, SVM was applied for predicting protein subcellular locations based on these optimal features. To validate the performance of our method, we employed jackknife cross-validation tests on three low similarity datasets, i.e., M638, Gneg1456 and Gpos523. The overall accuracies of 94.98%, 93.21%, and 94.57% were achieved for these three datasets, which are higher (from 1.8% to 10.9%) than those by state-of-the-art tools. Comparison results suggest that our method could serve as a very useful vehicle for expediting the prediction of bacterial protein subcellular localization.


Amino Acids | 2012

Predicting subcellular location of apoptosis proteins with pseudo amino acid composition: approach from amino acid substitution matrix and auto covariance transformation

Xiaoqing Yu; Xiaoqi Zheng; Taigang Liu; Yongchao Dou; Jun Wang

Apoptosis proteins are very important for understanding the mechanism of programmed cell death. Obtaining information on subcellular location of apoptosis proteins is very helpful to understand the apoptosis mechanism. In this paper, based on amino acid substitution matrix and auto covariance transformation, we introduce a new sequence-based model, which not only quantitatively describes the differences between amino acids, but also partially incorporates the sequence-order information. This method is applied to predict the apoptosis proteins’ subcellular location of two widely used datasets by the support vector machine classifier. The results obtained by jackknife test are quite promising, indicating that the proposed method might serve as a potential and efficient prediction model for apoptosis protein subcellular location prediction.


PLOS Computational Biology | 2015

Predicting Anticancer Drug Responses Using a Dual-Layer Integrated Cell Line-Drug Network Model.

Naiqian Zhang; Haiyun Wang; Yun Fang; Jun Wang; Xiaoqi Zheng; X. Shirley Liu

The ability to predict the response of a cancer patient to a therapeutic agent is a major goal in modern oncology that should ultimately lead to personalized treatment. Existing approaches to predicting drug sensitivity rely primarily on profiling of cancer cell line panels that have been treated with different drugs and selecting genomic or functional genomic features to regress or classify the drug response. Here, we propose a dual-layer integrated cell line-drug network model, which uses both cell line similarity network (CSN) data and drug similarity network (DSN) data to predict the drug response of a given cell line using a weighted model. Using the Cancer Cell Line Encyclopedia (CCLE) and Cancer Genome Project (CGP) studies as benchmark datasets, our single-layer model with CSN or DSN and only a single parameter achieved a prediction performance comparable to the previously generated elastic net model. When using the dual-layer model integrating both CSN and DSN, our predicted response reached a 0.6 Pearson correlation coefficient with observed responses for most drugs, which is significantly better than the previous results using the elastic net model. We have also applied the dual-layer cell line-drug integrated network model to fill in the missing drug response values in the CGP dataset. Even though the dual-layer integrated cell line-drug network model does not specifically model mutation information, it correctly predicted that BRAF mutant cell lines would be more sensitive than BRAF wild-type cell lines to three MEK1/2 inhibitors tested.


Journal of Computational Chemistry | 2009

An information-theoretic approach to the prediction of protein structural class

Xiaoqi Zheng; Chun Li; Jun Wang

An information‐theoretical approach, which combines a sequence decomposition technique and a fuzzy clustering algorithm, is proposed for prediction of protein structural class. This approach could bypass the process of selecting and comparing sequence features as done previously. First, distances between each pair of protein sequences are estimated using a conditional decomposition technique in information theory. Then, the fuzzy k‐nearest neighbor algorithm is used to identify the structural class of a protein given as set of sample sequences. To verify the strength of our method, we choose three widely used datasets constructed by Chou and Zhou. It is shown by the Jackknife test that our approach represents an improvement in the prediction of accuracy over existing methods.


PLOS ONE | 2014

PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations.

Liqi Li; Xiang Cui; Sanjiu Yu; Yuan Zhang; Zhong Luo; Hua Yang; Yue Zhou; Xiaoqi Zheng

Protein structure prediction is critical to functional annotation of the massively accumulated biological sequences, which prompts an imperative need for the development of high-throughput technologies. As a first and key step in protein structure prediction, protein structural class prediction becomes an increasingly challenging task. Amongst most homological-based approaches, the accuracies of protein structural class prediction are sufficiently high for high similarity datasets, but still far from being satisfactory for low similarity datasets, i.e., below 40% in pairwise sequence similarity. Therefore, we present a novel method for accurate and reliable protein structural class prediction for both high and low similarity datasets. This method is based on Support Vector Machine (SVM) in conjunction with integrated features from position-specific score matrix (PSSM), PROFEAT and Gene Ontology (GO). A feature selection approach, SVM-RFE, is also used to rank the integrated feature vectors through recursively removing the feature with the lowest ranking score. The definitive top features selected by SVM-RFE are input into the SVM engines to predict the structural class of a query protein. To validate our method, jackknife tests were applied to seven widely used benchmark datasets, reaching overall accuracies between 84.61% and 99.79%, which are significantly higher than those achieved by state-of-the-art tools. These results suggest that our method could serve as an accurate and cost-effective alternative to existing methods in protein structural classification, especially for low similarity datasets.

Collaboration


Dive into the Xiaoqi Zheng's collaboration.

Top Co-Authors

Avatar

Jun Wang

Shanghai Normal University

View shared research outputs
Top Co-Authors

Avatar

Taigang Liu

Shandong Agricultural University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Liqi Li

Third Military Medical University

View shared research outputs
Top Co-Authors

Avatar

Naiqian Zhang

Shanghai Normal University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Xiaoqing Yu

Shanghai Normal University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Hua Yang

Third Military Medical University

View shared research outputs
Top Co-Authors

Avatar

Sanjiu Yu

Third Military Medical University

View shared research outputs
Researchain Logo
Decentralizing Knowledge