Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Hua Tang is active.

Publication


Featured researches published by Hua Tang.


Oncotarget | 2016

iOri-Human: identify human origin of replication by incorporating dinucleotide physicochemical properties into pseudo nucleotide composition

Chang-Jian Zhang; Hua Tang; Wen-Chao Li; Hao Lin; Wei Chen; Kuo-Chen Chou

The initiation of replication is an extremely important process in DNA life cycle. Given an uncharacterized DNA sequence, can we identify where its origin of replication (ORI) is located? It is no doubt a fundamental problem in genome analysis. Particularly, with the rapid development of genome sequencing technology that results in a huge amount of sequence data, it is highly desired to develop computational methods for rapidly and effectively identifying the ORIs in these genomes. Unfortunately, by means of the existing computational methods, such as sequence alignment or kmer strategies, it could hardly achieve decent success rates. To address this problem, we developed a predictor called “iOri-Human”. Rigorous jackknife tests have shown that its overall accuracy and stability in identifying human ORIs are over 75% and 50%, respectively. In the predictor, it is through the pseudo nucleotide composition (an extension of pseudo amino acid composition) that 96 physicochemical properties for the 16 possible constituent dinucleotides have been incorporated to reflect the global sequence patterns in DNA as well as its local sequence patterns. Moreover, a user-friendly web-server for iOri-Human has been established at http://lin.uestc.edu.cn/server/iOri-Human.html, by which users can easily get their desired results without the need to through the complicated mathematics involved.


Molecular therapy. Nucleic acids | 2016

iRNA-PseU: Identifying RNA pseudouridine sites

Wei Chen; Hua Tang; Jing Ye; Hao Lin; Kuo-Chen Chou

As the most abundant RNA modification, pseudouridine plays important roles in many biological processes. Occurring at the uridine site and catalyzed by pseudouridine synthase, the modification has been observed in nearly all kinds of RNA, including transfer RNA, messenger RNA, small nuclear or nucleolar RNA, and ribosomal RNA. Accordingly, its importance to basic research and drug development is self-evident. Despite some experimental technologies have been developed to detect the pseudouridine sites, they are both time-consuming and expensive. Facing the explosive growth of RNA sequences in the postgenomic age, we are challenged to address the problem by computational approaches: For an uncharacterized RNA sequence, can we predict which of its uridine sites can be modified as pseudouridine and which ones cannot? Here a predictor called “iRNA-PseU” was proposed by incorporating the chemical properties of nucleotides and their occurrence frequency density distributions into the general form of pseudo nucleotide composition (PseKNC). It has been demonstrated via the rigorous jackknife test, independent dataset test, and practical genome-wide analysis that the proposed predictor remarkably outperforms its counterpart. For the convenience of most experimental scientists, the web-server for iRNA-PseU was established at http://lin.uestc.edu.cn/server/iRNA-PseU, by which users can easily get their desired results without the need to go through the mathematical details.


BioMed Research International | 2016

Identification of Secretory Proteins in Mycobacterium tuberculosis Using Pseudo Amino Acid Composition

Huan Yang; Hua Tang; Xin-Xin Chen; Chang-Jian Zhang; Pan-Pan Zhu; Hui Ding; Wei Chen; Hao Lin

Tuberculosis is killing millions of lives every year and on the blacklist of the most appalling public health problems. Recent findings suggest that secretory protein of Mycobacterium tuberculosis may serve the purpose of developing specific vaccines and drugs due to their antigenicity. Responding to global infectious disease, we focused on the identification of secretory proteins in Mycobacterium tuberculosis. A novel method called MycoSec was designed by incorporating g-gap dipeptide compositions into pseudo amino acid composition. Analysis of variance-based technique was applied in the process of feature selection and a total of 374 optimal features were obtained and used for constructing the final predicting model. In the jackknife test, MycoSec yielded a good performance with the area under the receiver operating characteristic curve of 0.93, demonstrating that the proposed system is powerful and robust. For users convenience, the web server MycoSec was established and an obliging manual on how to use it was provided for getting around any trouble unnecessary.


BioMed Research International | 2016

Identification of Bacterial Cell Wall Lyases via Pseudo Amino Acid Composition

Xin-Xin Chen; Hua Tang; Wen-Chao Li; Hao Wu; Wei Chen; Hui Ding; Hao Lin

Owing to the abuse of antibiotics, drug resistance of pathogenic bacteria becomes more and more serious. Therefore, it is interesting to develop a more reasonable way to solve this issue. Because they can destroy the bacterial cell structure and then kill the infectious bacterium, the bacterial cell wall lyases are suitable candidates of antibacteria sources. Thus, it is urgent to develop an accurate and efficient computational method to predict the lyases. Based on the consideration, in this paper, a set of objective and rigorous data was collected by searching through the Universal Protein Resource (the UniProt database), whereafter a feature selection technique based on the analysis of variance (ANOVA) was used to acquire optimal feature subset. Finally, the support vector machine (SVM) was used to perform prediction. The jackknife cross-validated results showed that the optimal average accuracy of 84.82% was achieved with the sensitivity of 76.47% and the specificity of 93.16%. For the convenience of other scholars, we built a free online server called Lypred. We believe that Lypred will become a practical tool for the research of cell wall lyases and development of antimicrobial agents.


IEEE/ACM Transactions on Computational Biology and Bioinformatics | 2017

Identifying sigma70 promoters with novel pseudo nucleotide composition

Hao Lin; Zhi-Yong Liang; Hua Tang; Wei Chen

Promoters are DNA regulatory elements located directly upstream or at the 5’ end of the transcription initiation site (TSS), which are in charge of gene transcription initiation. With the completion of a large number of microorganism genomics, it is urgent to predict promoters accurately in bacteria by using the computational method. In this work, a sequence-based predictor named “iPro70-PseZNC” was designed for identifying sigma70 promoters in prokaryote. In the predictor, the samples of DNA sequences are formulated by a novel pseudo nucleotide composition, called PseZNC, into which the multi-window Z-curve composition and six local DNA structural properties are incorporated. In the 5-fold cross-validation, the area under the curve of receiver operating characteristic of 0.909 was obtained on our benchmark dataset, indicating that the proposed predictor is promising and will provide an important guide in this area. Further studies showed that the performance of PseZNC is better than it of multi-window Z-curve composition. For the sake of convenience for researchers, a user-friendly online service was established and can be freely accessible at http://lin.uestc.edu.cn/server/iPro70-PseZNC. The PseZNC approach can be also extended to other DNA-related problems.


Oncotarget | 2017

Sequence-based predictive modeling to identify cancerlectins

Hong-Yan Lai; Xin-Xin Chen; Wei Chen; Hua Tang; Hao Lin

Lectins are a diverse type of glycoproteins or carbohydrate-binding proteins that have a wide distribution to various species. They can specially identify and exclusively bind to a certain kind of saccharide groups. Cancerlectins are a group of lectins that are closely related to cancer and play a major role in the initiation, survival, growth, metastasis and spread of tumor. Several computational methods have emerged to discriminate cancerlectins from non-cancerlectins, which promote the study on pathogenic mechanisms and clinical treatment of cancer. However, the predictive accuracies of most of these techniques are very limited. In this work, by constructing a benchmark dataset based on the CancerLectinDB database, a new amino acid sequence-based strategy for feature description was developed, and then the binomial distribution was applied to screen the optimal feature set. Ultimately, an SVM-based predictor was performed to distinguish cancerlectins from non-cancerlectins, and achieved an accuracy of 77.48% with AUC of 85.52% in jackknife cross-validation. The results revealed that our prediction model could perform better comparing with published predictive tools.


Journal of Biomolecular Structure & Dynamics | 2017

MethyRNA: a web server for identification of N6-methyladenosine sites

Wei Chen; Hua Tang; Hao Lin

N6-methyladenosine (m6A) is the most abundant post-transcriptional modification and has been found in the three domains of life (Cantara et al., 2011). m6A plays fundamental regulatory roles in a s...


Bioinformatics | 2016

Pro54DB: a database for experimentally verified sigma-54 promoters

Zhi-Yong Liang; Hong-Yan Lai; Huan Yang; Chang-Jian Zhang; Hui Yang; Huan-Huan Wei; Xin-Xin Chen; Ya-Wei Zhao; Zhen-Dong Su; Wen-Chao Li; En-Ze Deng; Hua Tang; Wei Chen; Hao Lin

Summary: In prokaryotes, the &sgr;54 promoters are unique regulatory elements and have attracted much attention because they are in charge of the transcription of carbon and nitrogen‐related genes and participate in numerous ancillary processes and environmental responses. All findings on &sgr;54 promoters are favorable for a better understanding of their regulatory mechanisms in gene transcription and an accurate discovery of genes missed by the wet experimental evidences. In order to provide an up‐to‐date, interactive and extensible database for &sgr;54 promoter, a free and easy accessed database called Pro54DB (&sgr;54 promoter database) was built to collect information of &sgr;54 promoter. In the current version, it has stored 210 experimental‐confirmed &sgr;54 promoters with 297 regulated genes in 43 species manually extracted from 133 publications, which is helpful for researchers in fields of bioinformatics and molecular biology. Availability and Implementation: Pro54DB is freely available on the web at http://lin.uestc.edu.cn/database/pro54db with all major browsers supported. Contacts: [email protected] or [email protected]


Biochemical and Biophysical Research Communications | 2016

Prediction of cell-penetrating peptides with feature selection techniques

Hua Tang; Zhen-Dong Su; Huan-Huan Wei; Wei Chen; Hao Lin

Cell-penetrating peptides are a group of peptides which can transport different types of cargo molecules such as drugs across plasma membrane and have been applied in the treatment of various diseases. Thus, the accurate prediction of cell-penetrating peptides with bioinformatics methods will accelerate the development of drug delivery systems. The study aims to develop a powerful model to accurately identify cell-penetrating peptides. At first, the peptides were translated into a set of vectors with the same dimension by using dipeptide compositions. Secondly, the Analysis of Variance-based technique was used to reduce the dimension of the vector and explore the optimized features. Finally, the support vector machine was utilized to discriminate cell-penetrating peptides from non-cell-penetrating peptides. The five-fold cross-validated results showed that our proposed method could achieve an overall prediction accuracy of 83.6%. Based on the proposed model, we constructed a free webserver called C2Pred (http://lin.uestc.edu.cn/server/C2Pred).


Scientific Reports | 2016

Prediction of phosphothreonine sites in human proteins by fusing different features

Ya-Wei Zhao; Hong-Yan Lai; Hua Tang; Wei Chen; Hao Lin

Phosphorylation is one of the most important protein post-translation modifications. With the rapid development of high-throughput mass spectrometry, phosphorylation site data is rapidly accumulating, which provides us an opportunity to systematically investigate and predict phosphorylation in proteins. The phosphorylation of threonine is the addition of a phosphoryl group to its polar side chains group. In this work, we statistically analyzed the distribution of the different properties including position conservation, secondary structure, accessibility and some other physicochemical properties of the residues surrounding the phosphothreonine site and non-phosphothreonine site. We found that the distributions of those features are non-symmetrical. Based on the distribution of properties, we developed a new model by using optimal window size strategy and feature selection technique. The cross-validated results show that the area under receiver operating characteristic curve reaches to 0.847, suggesting that our model may play a complementary role to other existing methods for predicting phosphothreonine site in proteins.

Collaboration


Dive into the Hua Tang's collaboration.

Top Co-Authors

Avatar

Hao Lin

University of Electronic Science and Technology of China

View shared research outputs
Top Co-Authors

Avatar

Wei Chen

North China University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Hui Ding

University of Electronic Science and Technology of China

View shared research outputs
Top Co-Authors

Avatar

Hong-Yan Lai

University of Electronic Science and Technology of China

View shared research outputs
Top Co-Authors

Avatar

Pengmian Feng

North China University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Wuritu Yang

University of Electronic Science and Technology of China

View shared research outputs
Top Co-Authors

Avatar

Xin-Xin Chen

University of Electronic Science and Technology of China

View shared research outputs
Top Co-Authors

Avatar

Ya-Wei Zhao

University of Electronic Science and Technology of China

View shared research outputs
Top Co-Authors

Avatar

Zhen-Dong Su

University of Electronic Science and Technology of China

View shared research outputs
Top Co-Authors

Avatar

Chang-Jian Zhang

University of Electronic Science and Technology of China

View shared research outputs
Researchain Logo
Decentralizing Knowledge