Chun-Yuan Lin | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Chun-Yuan Lin is active.

Explore More

Publication

Featured researches published by Chun-Yuan Lin.

IEEE Transactions on Nanobioscience | 2007

Feature Selection and Combination Criteria for Improving Accuracy in Protein Structure Prediction

Ken-Li Lin; Chun-Yuan Lin; Chuen-Der Huang; Hsiu-Ming Chang; Chiao-Yun Yang; Chin-Teng Lin; Chuan Yi Tang; D.F. Hsu

The classification of protein structures is essential for their function determination in bioinformatics. At present, a reasonably high rate of prediction accuracy has been achieved in classifying proteins into four classes in the SCOP database according to their primary amino acid sequences. However, for further classification into fine-grained folding categories, especially when the number of possible folding patterns as those defined in the SCOP database is large, it is still quite a challenge. In our previous work, we have proposed a two-level classification strategy called hierarchical learning architecture (HLA) using neural networks and two indirect coding features to differentiate proteins according to their classes and folding patterns, which achieved an accuracy rate of 65.5%. In this paper, we use a combinatorial fusion technique to facilitate feature selection and combination for improving predictive accuracy in protein structure classification. When applying various criteria in combinatorial fusion to the protein fold prediction approach using neural networks with HLA and the radial basis function network (RBFN), the resulting classification has an overall prediction accuracy rate of 87% for four classes and 69.6% for 27 folding categories. These rates are significantly higher than the accuracy rate of 56.5% previously obtained by Ding and Dubchak. Our results demonstrate that data fusion is a viable method for feature selection and combination in the prediction and classification of protein structure.

IEEE Transactions on Computers | 2002

Efficient representation scheme for multidimensional array operations

Chun-Yuan Lin; Jen-Shiuh Liu; Yeh-Ching Chung

Array operations are used in a large number of important scientific codes. To implement these array operations efficiently, many methods have been proposed in the literature, most of which are focused on two-dimensional arrays. When extended to higher dimensional arrays, these methods usually do not perform well. Hence, designing efficient algorithms for multidimensional array operations becomes an important issue. We propose a new scheme, extended Karnaugh map representation (EKMR), for the multidimensional array representation. The main idea of the EKMR scheme is to represent a multidimensional array by a set of two-dimensional arrays. Hence, efficient algorithm design for multidimensional array operations becomes less complicated. To evaluate the proposed scheme, we design efficient algorithms for multidimensional array operations, matrix-matrix addition/subtraction and matrix-matrix multiplications, based on the EKMR and the traditional matrix representation (TMR) schemes. Theoretical and experimental tests for these array operations were conducted. In the experimental test, we compare the performance of intrinsic functions provided by the Fortran 90 compiler with those based on the EKMR scheme. The experimental results show that the algorithms based on the EKMR scheme outperform those based on the TMR scheme and those provided by the Fortran 90 compiler.

IEEE Transactions on Computers | 2003

Efficient data compression methods for multidimensional sparse array operations based on the EKMR scheme

Chun-Yuan Lin; Yeh-Ching Chung; Jen-Shiuh Liu

We have proposed the extended Karnaugh map representation (EKMH) scheme for multidimensional array representation. We propose two data compression schemes, EKMR compressed row/column storage (ECRS/ECCS), for multidimensional sparse arrays based on the EKMR scheme. To evaluate the proposed schemes, we compare them to the CRS/CCS schemes. Both theoretical analysis and experimental tests were conducted. In the theoretical analysis, we analyze the CRS/CCS and the ECRS/ECCS schemes in terms of the time complexity, the space complexity, and the range of their usability for practical applications. In experimental tests, we compare the compressing time of sparse arrays and the execution time of matrix-matrix addition and matrix-matrix multiplication based on the CRS/CCS and the ECRS/ECCS schemes. The theoretical analysis and experimental results show that the ECRS/ECCS schemes are superior to the CRS/CCS schemes for all the evaluated criteria, except the space complexity in some case.

Journal of Chemical Information and Modeling | 2011

Development of novel 3D-QSAR combination approach for screening and optimizing B-Raf inhibitors in silico.

Chun-Yuan Lin; Jiayi Zhou; Hsiao-Chieh Chi; Ting-Shou Chen; Chun-Chung Wang; Hsiang-Wen Tseng; Chuan Yi Tang

B-Raf is a member of the RAF family of serine/threonine kinases: it mediates cell division, differentiation, and apoptosis signals through the RAS-RAF-MAPK pathway. Thus, B-Raf is of keen interest in cancer therapy, such as melanoma. In this study, we propose the first combination approach to integrate the pharmacophore (PhModel), CoMFA, and CoMSIA models for B-Raf, and this approach could be used for screening and optimizing potential B-Raf inhibitors in silico. Ten PhModels were generated based on the HypoGen BEST algorithm with the flexible fit method and diverse inhibitor structures. Each PhModel was designated to the alignment rule and screening interface for CoMFA and CoMSIA models. Therefore, CoMFA and CoMSIA models could align and recognize diverse inhibitor structures. We used two quality validation methods to test the predication accuracy of these combination models. In the previously proposed combination approaches, they have a common factor in that the number of training set inhibitors is greater than that of testing set inhibitors. In our study, the 189 known diverse series B-Raf inhibitors, which are 7-fold the number of training set inhibitors, were used as a testing set in the partial least-squares validation. The best validation results were made by the CoMFA09 and CoMSIA09 models based on the Hypo09 alignment model. The predictive r(2)(pred) values of 0.56 and 0.56 were derived from the CoMFA09 and CoMSIA09 models, respectively. The CoMFA09 and CoMSIA09 models also had a satisfied predication accuracy of 77.78% and 80%, and the goodness of hit test score of 0.675 and 0.699, respectively. These results indicate that our combination approach could effectively identify diverse B-Raf inhibitors and predict the activity.

Computational Biology and Chemistry | 2015

CUDA ClustalW

Che-Lun Hung; Yu-Shiang Lin; Chun-Yuan Lin; Yeh-Ching Chung; Yi-Fang Chung

For biological applications, sequence alignment is an important strategy to analyze DNA and protein sequences. Multiple sequence alignment is an essential methodology to study biological data, such as homology modeling, phylogenetic reconstruction and etc. However, multiple sequence alignment is a NP-hard problem. In the past decades, progressive approach has been proposed to successfully align multiple sequences by adopting iterative pairwise alignments. Due to rapid growth of the next generation sequencing technologies, a large number of sequences can be produced in a short period of time. When the problem instance is large, progressive alignment will be time consuming. Parallel computing is a suitable solution for such applications, and GPU is one of the important architectures for contemporary parallel computing researches. Therefore, we proposed a GPU version of ClustalW v2.0.11, called CUDA ClustalW v1.0, in this work. From the experiment results, it can be seen that the CUDA ClustalW v1.0 can achieve more than 33× speedups for overall execution time by comparing to ClustalW v2.0.11.

IEEE Transactions on Parallel and Distributed Systems | 2003

Efficient data parallel algorithms for multidimensional array operations based on the EKMR scheme for distributed memory multicomputers

Chun-Yuan Lin; Yeh-Ching Chung; Jen-Shiuh Liu

Array operations are useful in a large number of important scientific codes, such as molecular dynamics, finite element methods, climate modeling, atmosphere and ocean sciences, etc. In our previous work, we have proposed a scheme of extended Karnaugh map representation (EKMR) for multidimensional array representation. We have shown that sequential multidimensional array operation algorithms based on the EKMR scheme have better performance than those based on the traditional matrix representation (TMR) scheme. Since parallel multidimensional array operations have been an extensively investigated problem, we present efficient data parallel algorithms for multidimensional array operations based on the EKMR scheme for distributed memory multicomputers. In a data parallel programming paradigm, in general, we distribute array elements to processors based on various distribution schemes, do local computation in each processor, and collect computation results from each processor. Based on the row, column, and 2D mesh distribution schemes, we design data parallel algorithms for matrix-matrix addition and matrix-matrix multiplication array operations in both TMR and EKMR schemes for multidimensional arrays. We also design data parallel algorithms for six Fortran 90 array intrinsic functions: All, Maxval, Merge, Pack, Sum, and Cshift. We compare the time of the data distribution, the local computation, and the result collection phases of these array operations based on the TMR and the EKMR schemes. The experimental results show that algorithms based on the EKMR scheme outperform those based on the TMR scheme for all test cases.

Bioorganic & Medicinal Chemistry Letters | 2011

Pharmacophore modeling and virtual screening to identify potential RET kinase inhibitors

Chung-Wai Shiau; Ting-Shou Chen; Ching-Huai Ko; Chih-Lung Lin; Chun-Yuan Lin; Chrong-Shiong Hwang; Chuan Yi Tang; Wan-Ru Chen; Jui-Wen Huang

Chemical features based 3D pharmacophore model for REarranged during Transfection (RET) tyrosine kinase were developed by using a training set of 26 structurally diverse known RET inhibitors. The best pharmacophore hypothesis, which identified inhibitors with an associated correlation coefficient of 0.90 between their experimental and estimated anti-RET values, contained one hydrogen-bond acceptor, one hydrogen-bond donor, one hydrophobic, and one ring aromatic features. The model was further validated by a testing set, Fischers randomization test, and goodness of hit (GH) test. We applied this pharmacophore model to screen NCI database for potential RET inhibitors. The hits were docked to RET with GOLD and CDOCKER after filtering by Lipinskis rules. Ultimately, 24 molecules were selected as potential RET inhibitors for further investigation.

Nucleic Acids Research | 2006

GeneAlign: a coding exon prediction tool based on phylogenetical comparisons

Shu Ju Hsieh; Chun-Yuan Lin; Ning-Han Liu; Wei Yuan Chow; Chuan Yi Tang

GeneAlign is a coding exon prediction tool for predicting protein coding genes by measuring the homologies between a sequence of a genome and related sequences, which have been annotated, of other genomes. Identifying protein coding genes is one of most important tasks in newly sequenced genomes. With increasing numbers of gene annotations verified by experiments, it is feasible to identify genes in the newly sequenced genomes by comparing to annotated genes of phylogenetically close organisms. GeneAlign applies CORAL, a heuristic linear time alignment tool, to determine if regions flanked by the candidate signals (initiation codon-GT, AG-GT and AG-STOP codon) are similar to annotated coding exons. Employing the conservation of gene structures and sequence homologies between protein coding regions increases the prediction accuracy. GeneAlign was tested on Projector dataset of 491 human–mouse homologous sequence pairs. At the gene level, both the average sensitivity and the average specificity of GeneAlign are 81%, and they are larger than 96% at the exon level. The rates of missing exons and wrong exons are smaller than 1%. GeneAlign is a free tool available at .

BioMed Research International | 2013

GPU-based cloud service for Smith-Waterman algorithm using frequency distance filtration scheme.

Sheng-Ta Lee; Chun-Yuan Lin; Che Lun Hung

As the conventional means of analyzing the similarity between a query sequence and database sequences, the Smith-Waterman algorithm is feasible for a database search owing to its high sensitivity. However, this algorithm is still quite time consuming. CUDA programming can improve computations efficiently by using the computational power of massive computing hardware as graphics processing units (GPUs). This work presents a novel Smith-Waterman algorithm with a frequency-based filtration method on GPUs rather than merely accelerating the comparisons yet expending computational resources to handle such unnecessary comparisons. A user friendly interface is also designed for potential cloud server applications with GPUs. Additionally, two data sets, H1N1 protein sequences (query sequence set) and human protein database (database set), are selected, followed by a comparison of CUDA-SW and CUDA-SW with the filtration method, referred to herein as CUDA-SWf. Experimental results indicate that reducing unnecessary sequence alignments can improve the computational time by up to 41%. Importantly, by using CUDA-SWf as a cloud service, this application can be accessed from any computing environment of a device with an Internet connection without time constraints.

bioinformatics and bioengineering | 2005

Feature selection and combination criteria for improving predictive accuracy in protein structure classification

Chun-Yuan Lin; Ken-Li Lin; Chuen-Der Huang; Hsiu-Ming Chang; Chiao Yun Yang; Chin-Teng Lin; Chuan Yi Tang; D.F. Hsu

The classification of protein structures is essential for their function determination in bioinformatics. The success of the protein structure classification depends on two factors: the computational methods used and the features selected. In this paper, we use a combinatorial fusion analysis technique to facilitate feature selection and combination for improving predictive accuracy in protein structure classification. When applying these criteria to our previous work, the resulting classification has an overall prediction accuracy rate of 87% for four classes and 69.6% for 27 folding categories. These rates are significantly higher than our previous work and demonstrate that combinatorial fusion is a valuable method for protein structure classification.

Explore More