Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Justin Bo Kai Hsu is active.

Publication


Featured researches published by Justin Bo Kai Hsu.


Nucleic Acids Research | 2007

ViTa: prediction of host microRNAs targets on viruses

Paul Wei-Che Hsu; Li-Zen Lin; Sheng-Da Hsu; Justin Bo Kai Hsu; Hsien-Da Huang

MicroRNAs (miRNAs) are involved in various biological processes by suppressing gene expression. A recent work has indicated that host miRNAs are also capable of regulating viral gene expression by targeting the virus genomes. To investigate regulatory relationships between host miRNAs and related viruses, we present a novel database, namely ViTa, to curate the known virus miRNA genes and the known/putative target sites of human, mice, rat and chicken miRNAs. Known miRNAs are obtained from miRBase. Virus data are collected and referred from ICTVdB, VBRC and VirGen. Experimentally validated miRNA targets on viruses were derived from literatures. Then, miRanda and TargetScan are utilized to predict miRNA targets within virus genomes. ViTa also provides the virus annotations, virus-infected tissues and tissue specificity of host miRNAs. This work also facilitates the comparisons between subtypes of viruses, such as influenza viruses, human liver viruses and the conserved regions between viruses. Both textual and graphical web interfaces are provided to facilitate the data retrieves in the ViTa database. The database is now freely available at .


Journal of Computational Chemistry | 2009

Incorporating Structural Characteristics for Identification of Protein Methylation Sites

Dray Ming Shien; Tzong-Yi Lee; Wen Chi Chang; Justin Bo Kai Hsu; Jorng Tzong Horng; Po Chiang Hsu; Ting Yuan Wang; Hsien-Da Huang

Studies over the last few years have identified protein methylation on histones and other proteins that are involved in the regulation of gene transcription. Several works have developed approaches to identify computationally the potential methylation sites on lysine and arginine. Studies of protein tertiary structure have demonstrated that the sites of protein methylation are preferentially in regions that are easily accessible. However, previous studies have not taken into account the solvent‐accessible surface area (ASA) that surrounds the methylation sites. This work presents a method named MASA that combines the support vector machine with the sequence and structural characteristics of proteins to identify methylation sites on lysine, arginine, glutamate, and asparagine. Since most experimental methylation sites are not associated with corresponding protein tertiary structures in the Protein Data Bank, the effective solvent‐accessible prediction tools have been adopted to determine the potential ASA values of amino acids in proteins. Evaluation of predictive performance by cross‐validation indicates that the ASA values around the methylation sites can improve the accuracy of prediction. Additionally, an independent test reveals that the prediction accuracies for methylated lysine and arginine are 80.8 and 85.0%, respectively. Finally, the proposed method is implemented as an effective system for identifying protein methylation sites. The developed web server is freely available at http://MASA.mbc.nctu.edu.tw/.


Journal of Computational Chemistry | 2009

Incorporating Support Vector Machine for Identifying Protein Tyrosine Sulfation Sites

Wen Chi Chang; Tzong-Yi Lee; Dray Ming Shien; Justin Bo Kai Hsu; Jorng Tzong Horng; Po Chiang Hsu; Ting Yuan Wang; Hsien-Da Huang; Rong Long Pan

Tyrosine sulfation is a post‐translational modification of many secreted and membrane‐bound proteins. It governs protein‐protein interactions that are involved in leukocyte adhesion, hemostasis, and chemokine signaling. However, the intrinsic feature of sulfated protein remains elusive and remains to be delineated. This investigation presents SulfoSite, which is a computational method based on a support vector machine (SVM) for predicting protein sulfotyrosine sites. The approach was developed to consider structural information such as concerning the secondary structure and solvent accessibility of amino acids that surround the sulfotyrosine sites. One hundred sixty‐two experimentally verified tyrosine sulfation sites were identified using UniProtKB/SwissProt release 53.0. The results of a five‐fold cross‐validation evaluation suggest that the accessibility of the solvent around the sulfotyrosine sites contributes substantially to predictive accuracy. The SVM classifier can achieve an accuracy of 94.2% in five‐fold cross validation when sequence positional weighted matrix (PWM) is coupled with values of the accessible surface area (ASA). The proposed method significantly outperforms previous methods for accurately predicting the location of tyrosine sulfation sites.


Nucleic Acids Research | 2011

RegPhos: a system to explore the protein kinase–substrate phosphorylation network in humans

Tzong-Yi Lee; Justin Bo Kai Hsu; Wen Chi Chang; Hsien-Da Huang

Protein phosphorylation catalyzed by kinases plays crucial regulatory roles in intracellular signal transduction. With the increasing number of experimental phosphorylation sites that has been identified by mass spectrometry-based proteomics, the desire to explore the networks of protein kinases and substrates is motivated. Manning et al. have identified 518 human kinase genes, which provide a starting point for comprehensive analysis of protein phosphorylation networks. In this study, a knowledgebase is developed to integrate experimentally verified protein phosphorylation data and protein–protein interaction data for constructing the protein kinase–substrate phosphorylation networks in human. A total of 21 110 experimental verified phosphorylation sites within 5092 human proteins are collected. However, only 4138 phosphorylation sites (∼20%) have the annotation of catalytic kinases from public domain. In order to fully investigate how protein kinases regulate the intracellular processes, a published kinase-specific phosphorylation site prediction tool, named KinasePhos is incorporated for assigning the potential kinase. The web-based system, RegPhos, can let users input a group of human proteins; consequently, the phosphorylation network associated with the protein subcellular localization can be explored. Additionally, time-coursed microarray expression data is subsequently used to represent the degree of similarity in the expression profiles of network members. A case study demonstrates that the proposed scheme not only identify the correct network of insulin signaling but also detect a novel signaling pathway that may cross-talk with insulin signaling network. This effective system is now freely available at http://RegPhos.mbc.nctu.edu.tw.


asia pacific bioinformatics conference | 2012

GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group

Tzong-Yi Lee; Wen Chi Chang; Justin Bo Kai Hsu; Tzu Hao Chang; Dray Ming Shien

BackgroundSequence features in promoter regions are involved in regulating gene transcription initiation. Although numerous computational methods have been developed for predicting transcriptional start sites (TSSs) or transcription factor (TF) binding sites (TFBSs), they lack annotations for do not consider some important regulatory features such as CpG islands, tandem repeats, the TATA box, CCAAT box, GC box, over-represented oligonucleotides, DNA stability, and GC content. Additionally, the combinatorial interaction of TFs regulates the gene group that is associated with same expression pattern. To investigate gene transcriptional regulation, an integrated system that annotates regulatory features in a promoter sequence and detects co-regulation of TFs in a group of genes is needed.ResultsThis work identifies TSSs and regulatory features in a promoter sequence, and recognizes co-occurrence of cis-regulatory elements in co-expressed genes using a novel system. Three well-known TSS prediction tools are incorporated with orthologous conserved features, such as CpG islands, nucleotide composition, over-represented hexamer nucleotides, and DNA stability, to construct the novel Gene Promoter Miner (GPMiner) using a support vector machine (SVM). According to five-fold cross-validation results, the predictive sensitivity and specificity are both roughly 80%. The proposed system allows users to input a group of gene names/symbols, enabling the co-occurrence of TFBSs to be determined. Additionally, an input sequence can also be analyzed for homogeneity of experimental mammalian promoter sequences, and conserved regulatory features between homologous promoters can be observed through cross-species analysis. After identifying promoter regions, regulatory features are visualized graphically to facilitate gene promoter observations.ConclusionsThe GPMiner, which has a user-friendly input/output interface, has numerous benefits in analyzing human and mouse promoters. The proposed system is freely available at http://GPMiner.mbc.nctu.edu.tw/.


Journal of Computational Chemistry | 2010

N‐Ace: Using solvent accessibility and physicochemical properties to identify protein N‐acetylation sites

Tzong-Yi Lee; Justin Bo Kai Hsu; Feng Mao Lin; Wen Chi Chang; Po Chiang Hsu; Hsien-Da Huang

Protein acetylation, which is catalyzed by acetyltransferases, is a type of post‐translational modification and crucial to numerous essential biological processes, including transcriptional regulation, apoptosis, and cytokine signaling. As the experimental identification of protein acetylation sites is time consuming and laboratory intensive, several computational approaches have been developed for identifying the candidates of experimental validation. In this work, solvent accessibility and the physicochemical properties of proteins are utilized to identify acetylated alanine, glycine, lysine, methionine, serine, and threonine. A two‐stage support vector machine was applied to learn the computational models with combinations of amino acid sequences, and the accessible surface area and physicochemical properties of proteins. The predictive accuracy thus achieved is 5% to 14% higher than that of models trained using only amino acid sequences. Additionally, the substrate specificity of the acetylated site was investigated in detail with reference to the subcellular colocalization of acetyltransferases and acetylated proteins. The proposed method, N‐Ace, is evaluated using independent test sets in various acetylated residues and predictive accuracies of 90% were achieved, indicating that the performance of N‐Ace is comparable with that of other acetylation prediction methods. N‐Ace not only provides a user‐friendly input/output interface but also is a creative method for predicting protein acetylation sites. This novel analytical resource is now freely available at http://N‐Ace.mbc.NCTU.edu.tw/.


BMC Research Notes | 2009

A comprehensive resource for integrating and displaying protein post-translational modifications.

Tzong-Yi Lee; Justin Bo Kai Hsu; Wen Chi Chang; Ting-Yuan Wang; Po-Chiang Hsu; Hsien-Da Huang

BackgroundProtein Post-Translational Modification (PTM) plays an essential role in cellular control mechanisms that adjust protein physical and chemical properties, folding, conformation, stability and activity, thus also altering protein function.FindingsdbPTM (version 1.0), which was developed previously, aimed on a comprehensive collection of protein post-translational modifications. In this update version (dbPTM2.0), we developed a PTM database towards an expert system of protein post-translational modifications. The database comprehensively collects experimental and predictive protein PTM sites. In addition, dbPTM2.0 was extended to a knowledge base comprising the modified sites, solvent accessibility of substrate, protein secondary and tertiary structures, protein domains, protein intrinsic disorder region, and protein variations. Moreover, this work compiles a benchmark to construct evaluation datasets for computational study to identifying PTM sites, such as phosphorylated sites, glycosylated sites, acetylated sites and methylated sites.ConclusionThe current release not only provides the sequence-based information, but also annotates the structure-based information for protein post-translational modification. The interface is also designed to facilitate the access to the resource. This effective database is now freely accessible at http://dbPTM.mbc.nctu.edu.tw/.


PLOS ONE | 2011

Incorporating Evolutionary Information and Functional Domains for Identifying RNA Splicing Factors in Humans

Justin Bo Kai Hsu; Neil Arvin Bretaña; Tzong-Yi Lee; Hsien-Da Huang

Regulation of pre-mRNA splicing is achieved through the interaction of RNA sequence elements and a variety of RNA-splicing related proteins (splicing factors). The splicing machinery in humans is not yet fully elucidated, partly because splicing factors in humans have not been exhaustively identified. Furthermore, experimental methods for splicing factor identification are time-consuming and lab-intensive. Although many computational methods have been proposed for the identification of RNA-binding proteins, there exists no development that focuses on the identification of RNA-splicing related proteins so far. Therefore, we are motivated to design a method that focuses on the identification of human splicing factors using experimentally verified splicing factors. The investigation of amino acid composition reveals that there are remarkable differences between splicing factors and non-splicing proteins. A support vector machine (SVM) is utilized to construct a predictive model, and the five-fold cross-validation evaluation indicates that the SVM model trained with amino acid composition could provide a promising accuracy (80.22%). Another basic feature, amino acid dipeptide composition, is also examined to yield a similar predictive performance to amino acid composition. In addition, this work presents that the incorporation of evolutionary information and domain information could improve the predictive performance. The constructed models have been demonstrated to effectively classify (73.65% accuracy) an independent data set of human splicing factors. The result of independent testing indicates that in silico identification could be a feasible means of conducting preliminary analyses of splicing factors and significantly reducing the number of potential targets that require further in vivo or in vitro confirmation.


Journal of Computer-aided Molecular Design | 2014

Incorporating significant amino acid pairs and protein domains to predict RNA splicing-related proteins with functional roles

Justin Bo Kai Hsu; Kai-Yao Huang; Tzu-Ya Weng; Chien-Hsun Huang; Tzong-Yi Lee

Abstract Machinery of pre-mRNA splicing is carried out through the interaction of RNA sequence elements and a variety of RNA splicing-related proteins (SRPs) (e.g. spliceosome and splicing factors). Alternative splicing, which is an important post-transcriptional regulation in eukaryotes, gives rise to multiple mature mRNA isoforms, which encodes proteins with functional diversities. However, the regulation of RNA splicing is not yet fully elucidated, partly because SRPs have not yet been exhaustively identified and the experimental identification is labor-intensive. Therefore, we are motivated to design a new method for identifying SRPs with their functional roles in the regulation of RNA splicing. The experimentally verified SRPs were manually curated from research articles. According to the functional annotation of Splicing Related Gene Database, the collected SRPs were further categorized into four functional groups including small nuclear Ribonucleoprotein, Splicing Factor, Splicing Regulation Factor and Novel Spliceosome Protein. The composition of amino acid pairs indicates that there are remarkable differences among four functional groups of SRPs. Then, support vector machines (SVMs) were utilized to learn the predictive models for identifying SRPs as well as their functional roles. The cross-validation evaluation presents that the SVM models trained with significant amino acid pairs and functional domains could provide a better predictive performance. In addition, the independent testing demonstrates that the proposed method could accurately identify SRPs in mammals/plants as well as effectively distinguish between SRPs and RNA-binding proteins. This investigation provides a practical means to identifying potential SRPs and a perspective for exploring the regulation of RNA splicing.


BMC Bioinformatics | 2013

An enhanced computational platform for investigating the roles of regulatory RNA and for identifying functional RNA motifs

Tzu Hao Chang; Hsi Yuan Huang; Justin Bo Kai Hsu; Shun Long Weng; Jorng Tzong Horng; Hsien-Da Huang

Collaboration


Dive into the Justin Bo Kai Hsu's collaboration.

Top Co-Authors

Avatar

Hsien-Da Huang

National Chiao Tung University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Wen Chi Chang

National Cheng Kung University

View shared research outputs
Top Co-Authors

Avatar

Dray Ming Shien

National Central University

View shared research outputs
Top Co-Authors

Avatar

Jorng Tzong Horng

National Central University

View shared research outputs
Top Co-Authors

Avatar

Po Chiang Hsu

National Chiao Tung University

View shared research outputs
Top Co-Authors

Avatar

Sheng-Da Hsu

National Chiao Tung University

View shared research outputs
Top Co-Authors

Avatar

Ting Yuan Wang

National Chiao Tung University

View shared research outputs
Top Co-Authors

Avatar

Tzu Hao Chang

Taipei Medical University

View shared research outputs
Top Co-Authors

Avatar

Chia-Hung Chien

National Chiao Tung University

View shared research outputs
Researchain Logo
Decentralizing Knowledge