Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ka-Lok Ng is active.

Publication


Featured researches published by Ka-Lok Ng.


Computers in Biology and Medicine | 2010

Prediction of protein functions based on function-function correlation relations

Ka-Lok Ng; Jin-Shuei Ciou; Chien-Hung Huang

A protein function pair approach, based on protein-protein interaction (PPI) data, is proposed to predict protein functions. Randomization tests are performed on the PPI dataset, which resulted in a protein function correlation scoring value which is used to rank the relative importance of a function pair. It has been found that certain classes of protein functions tend to be correlated together. Scoring values of these correlation pairs allow us to predict the functionality of a protein given that it interacts with proteins having well-defined function annotations. The jackknife test is used to validate the function pair method. The protein function pair approach achieves a prediction sensitivity comparable to an approach using more sophisticated method. The main advantages of this approach are as follows: (i) a set of function-function correlation relations are derived and intuitive biological interpretation can be achieved, and (ii) its simplicity, only two parameters are needed.


International Journal of Semantic Computing | 2008

USING SCDL FOR INTEGRATING TOOLS AND DATA FOR COMPLEX BIOMEDICAL APPLICATIONS

Shu Wang; Rouh-Mei Hu; Han C. W. Hsiao; David Hecht; Ka-Lok Ng; Rong-Ming Chen; Phillip C.-Y. Sheu; Jeffrey J. P. Tsai

Current bioinformatics tools or databases are very heterogeneous in terms of data formats, database schema, and terminologies. Additionally, most biomedical databases and analysis tools are scattered across different web sites making interoperability across such different services more difficult. It is desired that these diverse databases and analysis tools be normalized, integrated and encompassed with a semantic interface such that users of biological data and tools could communicate with the system in natural language and a workflow could be automatically generated and distributed into appropriate tools. In this paper, the BioSemantic System is presented to bridge complex biological/biomedical research problems and computational solutions via semantic computing. Due to the diversity of problems in various research fields, the semantic capability description language (SCDL) plays an important role as a common language and generic form for problem formalization. Several queries as well as their corresponding SCDL descriptions are provided as examples. For complex applications, multiple SCDL queries may be connected via control structures. For these cases, we present an algorithm to map a user request to one or more existing services if they exist.


Computers in Biology and Medicine | 2013

Prediction of microRNA-regulated protein interaction pathways in Arabidopsis using machine learning algorithms

Nilubon Kurubanjerdjit; Chien-Hung Huang; Yu-Liang Lee; Jeffrey J. P. Tsai; Ka-Lok Ng

MicroRNAs are small, endogenous RNAs found in many different species and are known to have an influence on diverse biological phenomena. They also play crucial roles in plant biological processes, such as metabolism, leaf sidedness and flower development. However, the functional roles of most microRNAs are still unknown. The identification of closely related microRNAs and target genes can be an essential first step towards the discovery of their combinatorial effects on different cellular states. A lot of research has tried to discover microRNAs and target gene interactions by implementing machine learning classifiers with target prediction algorithms. However, high rates of false positives have been reported as a result of undetermined factors which will affect recognition. Therefore, integrating diverse techniques could improve the prediction. In this paper we propose identifying microRNAs target of Arabidopsis thaliana by integrating prediction scores from PITA, miRanda and RNAHybrid algorithms used as a feature vector of microRNA-target interactions, and then implementing SVM, random forest tree and neural network machine learning algorithms to make final predictions by majority voting. Furthermore, microRNA target genes are linked with their protein-protein interaction (PPI) partners. We focus on plant resistance genes and transcription factor information to provide new insights into plant pathogen interaction networks. Downstream pathways are characterized by the Jaccard coefficient, which is implemented based on Gene Ontology. The database is freely accessible at http://ppi.bioinfo.asia.edu.tw/At_miRNA/.


Iet Systems Biology | 2016

Graph theory and stability analysis of protein complex interaction networks

Chien-Hung Huang; Teng-Hung Chen; Ka-Lok Ng

Protein complexes play an essential role in many biological processes. Complexes can interact with other complexes to form protein complex interaction network (PCIN) that involves in important cellular processes. There are relatively few studies on examining the interaction topology among protein complexes; and little is known about the stability of PCIN under perturbations. We employed graph theoretical approach to reveal hidden properties and features of four species PCINs. Two main issues are addressed, (i) the global and local network topological properties, and (ii) the stability of the networks under 12 types of perturbations. According to the topological parameter classification, we identified some critical protein complexes and validated that the topological analysis approach could provide meaningful biological interpretations of the protein complex systems. Through the Kolmogorov-Smimov test, we showed that local topological parameters are good indicators to characterise the structure of PCINs. We further demonstrated the effectiveness of the current approach by performing the scalability and data normalization tests. To measure the robustness of PCINs, we proposed to consider eight topological-based perturbations, which are specifically applicable in scenarios of targeted, sustained attacks. We found that the degree-based, betweenness-based and brokering-coefficient-based perturbations have the largest effect on network stability.


BMC Systems Biology | 2014

A model comparison study of the flowering time regulatory network in Arabidopsis

Charles C. N. Wang; Pei-Chun Chang; Ka-Lok Ng; Chun-Ming Chang; Phillip C.-Y. Sheu; Jeffrey J. P. Tsai

BackgroundSeveral dynamic models of a gene regulatory network of the light-induced floral transition process in Arabidopsis have been developed to capture the behavior of gene transcription and infer predictions based on experimental observations. It has been proven that the models can make accurate and novel predictions, which generate testable hypotheses.Two major issues were addressed in this study. First, construction of dynamic models for gene regulatory networks requires the use of mathematic modeling that comprises equations of a large number of parameters. Second, the binding mechanism of the transcription factor with DNA is another factor that requires detailed modeling. The first issue was tackled by adopting an optimization algorithm, and the second was addressed by comparing the performance of three alternative modeling approaches, namely the S-system, the Michaelis-Menten model and the Mass-action model. The efficiencies of parameter estimation and modeling performance were calculated based on least square error (O(p)), mean relative error (MRE) and Akaike Information Criterion (AIC).ResultsWe compared three models to describe gene regulation of the flowering transition process in Arabidopsis. The Mass-action model is the simplest and has the least parameters. It is therefore less computation-intensive with the smallest AIC value. The disadvantage, however, is that it assumes the system is simply a second order reaction which is not the case in our study. The Michaelis-Menten model also assumes the system is homogeneous and ignores the intracellular protein transport process. The S-system model has the best performance and it does describe the diffusion effects. A disadvantage of the S-system is that it involves the most parameters. The largest AIC value also implies an over-fitting may occur in parameter estimation.ConclusionsThree dynamic models were adopted to describe the dynamics of the gene regulatory network of the flowering transition process in Arabidopsis. Based on MRE, the least square error and global sensitivity analysis, the S-system has the best performance. However, the fact that it has the highest AIC suggests an over-fitting may occur in parameter estimation. The result of this study may need to be applied carefully when modeling complex gene regulatory networks.


bioinformatics and bioengineering | 2004

A cross-species study of the protein-protein interaction networks via the random graph approach

Ka-Lok Ng; Chien-Hung Huang

We employed the random graph theory approach to analyze the protein-protein interacting database DIP, for six different species (S. cerevisiae, H. pylori, E. coli, H. sapiens, M. musculus and D. melanogaster). Two global topological parameters (node connectivity, average diameter) were used to characterize these protein-protein interaction networks (PINs). The logarithm of the connectivity distribution vs. the logarithm of connectivity plot indicates that it follows a power law behavior quite well for the six species. We also demonstrated that the interaction networks are quite robust when subject to random perturbation. Node degree correlation study supports the earlier results that nodes of low connectivity are correlated, whereas nodes of high connectivity are not directly linked. These results provided some evidence suggesting such correlation relations might be a general feature of the PINs across different species.


BioMed Research International | 2015

Prediction of Cancer Proteins by Integrating Protein Interaction, Domain Frequency, and Domain Interaction Data Using Machine Learning Algorithms

Chien-Hung Huang; Huai-Shun Peng; Ka-Lok Ng

Many proteins are known to be associated with cancer diseases. It is quite often that their precise functional role in disease pathogenesis remains unclear. A strategy to gain a better understanding of the function of these proteins is to make use of a combination of different aspects of proteomics data types. In this study, we extended Araguess method by employing the protein-protein interaction (PPI) data, domain-domain interaction (DDI) data, weighted domain frequency score (DFS), and cancer linker degree (CLD) data to predict cancer proteins. Performances were benchmarked based on three kinds of experiments as follows: (I) using individual algorithm, (II) combining algorithms, and (III) combining the same classification types of algorithms. When compared with Araguess method, our proposed methods, that is, machine learning algorithm and voting with the majority, are significantly superior in all seven performance measures. We demonstrated the accuracy of the proposed method on two independent datasets. The best algorithm can achieve a hit ratio of 89.4% and 72.8% for lung cancer dataset and lung cancer microarray study, respectively. It is anticipated that the current research could help understand disease mechanisms and diagnosis.


BioMed Research International | 2014

Drug Repositioning Discovery for Early- and Late-Stage Non-Small-Cell Lung Cancer

Chien-Hung Huang; Peter Mu-Hsin Chang; Yong-Jie Lin; Cheng-Hsu Wang; Chi-Ying F. Huang; Ka-Lok Ng

Drug repositioning is a popular approach in the pharmaceutical industry for identifying potential new uses for existing drugs and accelerating the development time. Non-small-cell lung cancer (NSCLC) is one of the leading causes of death worldwide. To reduce the biological heterogeneity effects among different individuals, both normal and cancer tissues were taken from the same patient, hence allowing pairwise testing. By comparing early- and late-stage cancer patients, we can identify stage-specific NSCLC genes. Differentially expressed genes are clustered separately to form up- and downregulated communities that are used as queries to perform enrichment analysis. The results suggest that pathways for early- and late-stage cancers are different. Sets of up- and downregulated genes were submitted to the cMap web resource to identify potential drugs. To achieve high confidence drug prediction, multiple microarray experimental results were merged by performing meta-analysis. The results of a few drug findings are supported by MTT assay or clonogenic assay data. In conclusion, we have been able to assess the potential existing drugs to identify novel anticancer drugs, which may be helpful in drug repositioning discovery for NSCLC.


bioinformatics and bioengineering | 2011

Simulation of Bacterial Chemotaxis by the Random Run and Tumble Model

Charles C. N. Wang; Ka-Lok Ng; Yu-Ching Chen; Phillip C.Y. Sheu; Jeffrey J. P. Tsai

In this paper, the movement of bacteria, i.e. E. coli, is simulated based on the run and tumble model. The random walk parameters, such as the speed, tumbling frequency, run duration, and the turn angle between two successive runs were taken from experimental measurements, and use them to simulate the bacteria movement in cases of three different uniform chemical concentration distributions. The motility coefficient is computed to characterize the migration responses. Furthermore, a case of chemical attractant gradient distribution in the environment is designed to validate the run and tumble model. It is found that bacteria move with higher motility coefficient in higher chemical concentrations. Simulation results suggested that bacterial run and tumble model can be used to describe real bacteria movement.


BMC Bioinformatics | 2016

Drug repositioning for non-small cell lung cancer by using machine learning algorithms and topological graph theory

Chien-Hung Huang; Peter Mu-Hsin Chang; Chia-Wei Hsu; Chi-Ying F. Huang; Ka-Lok Ng

BackgroundNon-small cell lung cancer (NSCLC) is one of the leading causes of death globally, and research into NSCLC has been accumulating steadily over several years. Drug repositioning is the current trend in the pharmaceutical industry for identifying potential new uses for existing drugs and accelerating the development process of drugs, as well as reducing side effects.ResultsThis work integrates two approaches - machine learning algorithms and topological parameter-based classification - to develop a novel pipeline of drug repositioning to analyze four lung cancer microarray datasets, enriched biological processes, potential therapeutic drugs and targeted genes for NSCLC treatments. A total of 7 (8) and 11 (12) promising drugs (targeted genes) were discovered for treating early- and late-stage NSCLC, respectively. The effectiveness of these drugs is supported by the literature, experimentally determined in-vitro IC50 and clinical trials. This work provides better drug prediction accuracy than competitive research according to IC50 measurements.ConclusionsWith the novel pipeline of drug repositioning, the discovery of enriched pathways and potential drugs related to NSCLC can provide insight into the key regulators of tumorigenesis and the treatment of NSCLC. Based on the verified effectiveness of the targeted drugs predicted by this pipeline, we suggest that our drug-finding pipeline is effective for repositioning drugs.

Collaboration


Dive into the Ka-Lok Ng's collaboration.

Top Co-Authors

Avatar

Chien-Hung Huang

National Formosa University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jywe-Fei Fang

Beijing Jiaotong University

View shared research outputs
Top Co-Authors

Avatar

Chi-Ying F. Huang

National Yang-Ming University

View shared research outputs
Top Co-Authors

Avatar

Po-Han Lee

National Taiwan Normal University

View shared research outputs
Top Co-Authors

Avatar

Peter Mu-Hsin Chang

Taipei Veterans General Hospital

View shared research outputs
Top Co-Authors

Avatar

Shan-Chih Lee

Chung Shan Medical University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jim Jinn Chyuan Sheu

National Sun Yat-sen University

View shared research outputs
Top Co-Authors

Avatar

Rong-Ming Chen

National University of Tainan

View shared research outputs
Researchain Logo
Decentralizing Knowledge