Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Chumpol Ngamphiw is active.

Publication


Featured researches published by Chumpol Ngamphiw.


Science | 2009

Mapping Human Genetic Diversity in Asia

Mahmood Ameen Abdulla; Ikhlak Ahmed; Anunchai Assawamakin; Jong Bhak; Samir K. Brahmachari; Gayvelline C. Calacal; Amit Chaurasia; Chien-Hsiun Chen; Jieming Chen; Yuan-Tsong Chen; Jiayou Chu; Eva Maria Cutiongco-de la Paz; Maria Corazon A. De Ungria; Frederick C. Delfin; Juli Edo; Suthat Fuchareon; Ho Ghang; Takashi Gojobori; Junsong Han; Sheng Feng Ho; Boon Peng Hoh; Wei Huang; Hidetoshi Inoko; Pankaj Jha; Timothy A. Jinam; Li Jin; Jongsun Jung; Daoroong Kangwanpong; Jatupol Kampuansai; Giulia C. Kennedy

Patterns of Early Migration In order to gain insight into various migrations that must have happened during movement of early humans into Asia and the subsequent populating of the largest continent on Earth, the HUGO Pan-Asian SNP Consortium (p. 1541) analyzed genetic variation in almost 2000 individuals representing 73 Asian and two non-Asian populations. The results suggest that there may have been a single major migration of people into Asia and a subsequent south-to-north migration across the continent. While most populations from the same linguistic group tend to cluster together in terms of relatedness, several do not, clustering instead with their geographic neighbors, suggesting either substantial recent mixing among the populations or language replacement. Furthermore, data from indigenous Taiwanese populations appear to be inconsistent with the idea of a Taiwan homeland for Austronesian populations. Genetic analyses of Asian peoples suggest that the continent was populated through a single migration event. Asia harbors substantial cultural and linguistic diversity, but the geographic structure of genetic variation across the continent remains enigmatic. Here we report a large-scale survey of autosomal variation from a broad geographic sample of Asian human populations. Our results show that genetic ancestry is strongly correlated with linguistic affiliations as well as geography. Most populations show relatedness within ethnic/linguistic groups, despite prevalent gene flow among populations. More than 90% of East Asian (EA) haplotypes could be found in either Southeast Asian (SEA) or Central-South Asian (CSA) populations and show clinal structure with haplotype diversity decreasing from south to north. Furthermore, 50% of EA haplotypes were found in SEA only and 5% were found in CSA only, indicating that SEA was a major geographic source of EA populations.


DNA Research | 2011

Transcriptome Sequencing of Hevea brasiliensis for Development of Microsatellite Markers and Construction of a Genetic Linkage Map

Kanokporn Triwitayakorn; Pornsupa Chatkulkawin; Supanath Kanjanawattanawong; Supajit Sraphet; Thippawan Yoocha; Duangjai Sangsrakru; Juntima Chanprasert; Chumpol Ngamphiw; Nukoon Jomchai; Kanikar Therawattanasuk; Sithichoke Tangphatsornruang

To obtain more information on the Hevea brasiliensis genome, we sequenced the transcriptome from the vegetative shoot apex yielding 2 311 497 reads. Clustering and assembly of the reads produced a total of 113 313 unique sequences, comprising 28 387 isotigs and 84 926 singletons. Also, 17 819 expressed sequence tag (EST)-simple sequence repeats (SSRs) were identified from the data set. To demonstrate the use of this EST resource for marker development, primers were designed for 430 of the EST-SSRs. Three hundred and twenty-three primer pairs were amplifiable in H. brasiliensis clones. Polymorphic information content values of selected 47 SSRs among 20 H. brasiliensis clones ranged from 0.13 to 0.71, with an average of 0.51. A dendrogram of genetic similarities between the 20 H. brasiliensis clones using these 47 EST-SSRs suggested two distinct groups that correlated well with clone pedigree. These novel EST-SSRs together with the published SSRs were used for the construction of an integrated parental linkage map of H. brasiliensis based on 81 lines of an F1 mapping population. The map consisted of 97 loci, consisting of 37 novel EST-SSRs and 60 published SSRs, distributed on 23 linkage groups and covered 842.9 cM with a mean interval of 11.9 cM and ∼4 loci per linkage group. Although the numbers of linkage groups exceed the haploid number (18), but with several common markers between homologous linkage groups with the previous map indicated that the F1 map in this study is appropriate for further study in marker-assisted selection.


PLOS ONE | 2011

Hypomethylation of Intragenic LINE-1 Represses Transcription in Cancer Cells through AGO2

Chatchawit Aporntewan; Chureerat Phokaew; Jittima Piriyapongsa; Chumpol Ngamphiw; Chupong Ittiwut; Sissades Tongsima; Apiwat Mutirangura

In human cancers, the methylation of long interspersed nuclear element -1 (LINE-1 or L1) retrotransposons is reduced. This occurs within the context of genome wide hypomethylation, and although it is common, its role is poorly understood. L1s are widely distributed both inside and outside of genes, intragenic and intergenic, respectively. Interestingly, the insertion of active full-length L1 sequences into host gene introns disrupts gene expression. Here, we evaluated if intragenic L1 hypomethylation influences their host gene expression in cancer. First, we extracted data from L1base (http://l1base.molgen.mpg.de), a database containing putatively active L1 insertions, and compared intragenic and intergenic L1 characters. We found that intragenic L1 sequences have been conserved across evolutionary time with respect to transcriptional activity and CpG dinucleotide sites for mammalian DNA methylation. Then, we compared regulated mRNA levels of cells from two different experiments available from Gene Expression Omnibus (GEO), a database repository of high throughput gene expression data, (http://www.ncbi.nlm.nih.gov/geo) by chi-square. The odds ratio of down-regulated genes between demethylated normal bronchial epithelium and lung cancer was high (p<1E−27; OR = 3.14; 95% CI = 2.54–3.88), suggesting cancer genome wide hypomethylation down-regulating gene expression. Comprehensive analysis between L1 locations and gene expression showed that expression of genes containing L1s had a significantly higher likelihood to be repressed in cancer and hypomethylated normal cells. In contrast, many mRNAs derived from genes containing L1s are elevated in Argonaute 2 (AGO2 or EIF2C2)-depleted cells. Hypomethylated L1s increase L1 mRNA levels. Finally, we found that AGO2 targets intronic L1 pre-mRNA complexes and represses cancer genes. These findings represent one of the mechanisms of cancer genome wide hypomethylation altering gene expression. Hypomethylated intragenic L1s are a nuclear siRNA mediated cis-regulatory element that can repress genes. This epigenetic regulation of retrotransposons likely influences many aspects of genomic biology.


PLOS ONE | 2011

PanSNPdb: The Pan-Asian SNP Genotyping Database

Chumpol Ngamphiw; Anunchai Assawamakin; Shuhua Xu; Philip J. Shaw; Jin Ok Yang; Ho Ghang; Jong Bhak; Edison T. Liu; Sissades Tongsima

The HUGO Pan-Asian SNP consortium conducted the largest survey to date of human genetic diversity among Asians by sampling 1,719 unrelated individuals among 71 populations from China, India, Indonesia, Japan, Malaysia, the Philippines, Singapore, South Korea, Taiwan, and Thailand. We have constructed a database (PanSNPdb), which contains these data and various new analyses of them. PanSNPdb is a research resource in the analysis of the population structure of Asian peoples, including linkage disequilibrium patterns, haplotype distributions, and copy number variations. Furthermore, PanSNPdb provides an interactive comparison with other SNP and CNV databases, including HapMap3, JSNP, dbSNP and DGV and thus provides a comprehensive resource of human genetic diversity. The information is accessible via a widely accepted graphical interface used in many genetic variation databases. Unrestricted access to PanSNPdb and any associated files is available at: http://www4a.biotec.or.th/PASNP.


BMC Genomics | 2012

iLOCi: a SNP interaction prioritization technique for detecting epistasis in genome-wide association studies

Jittima Piriyapongsa; Chumpol Ngamphiw; Apichart Intarapanich; Supasak Kulawonganunchai; Anunchai Assawamakin; Philip J. Shaw; Sissades Tongsima

BackgroundGenome-wide association studies (GWAS) do not provide a full account of the heritability of genetic diseases since gene-gene interactions, also known as epistasis are not considered in single locus GWAS. To address this problem, a considerable number of methods have been developed for identifying disease-associated gene-gene interactions. However, these methods typically fail to identify interacting markers explaining more of the disease heritability over single locus GWAS, since many of the interactions significant for disease are obscured by uninformative marker interactions e.g., linkage disequilibrium (LD).ResultsIn this study, we present a novel SNP interaction prioritization algorithm, named iLOCi (Interacting Loci). This algorithm accounts for marker dependencies separately in case and control groups. Disease-associated interactions are then prioritized according to a novel ranking score calculated from the difference in marker dependencies for every possible pair between case and control groups. The analysis of a typical GWAS dataset can be completed in less than a day on a standard workstation with parallel processing capability. The proposed framework was validated using simulated data and applied to real GWAS datasets using the Wellcome Trust Case Control Consortium (WTCCC) data. The results from simulated data showed the ability of iLOCi to identify various types of gene-gene interactions, especially for high-order interaction. From the WTCCC data, we found that among the top ranked interacting SNP pairs, several mapped to genes previously known to be associated with disease, and interestingly, other previously unreported genes with biologically related roles.ConclusioniLOCi is a powerful tool for uncovering true disease interacting markers and thus can provide a more complete understanding of the genetic basis underlying complex disease. The program is available for download at http://www4a.biotec.or.th/GI/tools/iloci.


PLOS ONE | 2012

microPIR: An Integrated Database of MicroRNA Target Sites within Human Promoter Sequences

Jittima Piriyapongsa; Chumpol Ngamphiw; Sissades Tongsima

Background microRNAs are generally understood to regulate gene expression through binding to target sequences within 3′-UTRs of mRNAs. Therefore, computational prediction of target sites is usually restricted to these gene regions. Recent experimental studies though have suggested that microRNAs may alternatively modulate gene expression by interacting with promoters. A database of potential microRNA target sites in promoters would stimulate research in this field leading to more understanding of complex microRNA regulatory mechanism. Methodology We developed a database hosting predicted microRNA target sites located within human promoter sequences and their associated genomic features, called microPIR (microRNA-Promoter Interaction Resource). microRNA seed sequences were used to identify perfect complementary matching sequences in the human promoters and the potential target sites were predicted using the RNAhybrid program. >15 million target sites were identified which are located within 5000 bp upstream of all human genes, on both sense and antisense strands. The experimentally confirmed argonaute (AGO) binding sites and EST expression data including the sequence conservation across vertebrate species of each predicted target are presented for researchers to appraise the quality of predicted target sites. The microPIR database integrates various annotated genomic sequence databases, e.g. repetitive elements, transcription factor binding sites, CpG islands, and SNPs, offering users the facility to extensively explore relationships among target sites and other genomic features. Furthermore, functional information of target genes including gene ontologies, KEGG pathways, and OMIM associations are provided. The built-in genome browser of microPIR provides a comprehensive view of multidimensional genomic data. Finally, microPIR incorporates a PCR primer design module to facilitate experimental validation. Conclusions The proposed microPIR database is a useful integrated resource of microRNA-promoter target interactions for experimental microRNA researchers and computational biologists to study the microRNA regulation through gene promoter. The database can be freely accessed from: http://www4a.biotec.or.th/micropir.


BMC Bioinformatics | 2009

Iterative pruning PCA improves resolution of highly structured populations.

Apichart Intarapanich; Philip J. Shaw; Anunchai Assawamakin; Pongsakorn Wangkumhang; Chumpol Ngamphiw; Kridsadakorn Chaichoompu; Jittima Piriyapongsa; Sissades Tongsima

BackgroundNon-random patterns of genetic variation exist among individuals in a population owing to a variety of evolutionary factors. Therefore, populations are structured into genetically distinct subpopulations. As genotypic datasets become ever larger, it is increasingly difficult to correctly estimate the number of subpopulations and assign individuals to them. The computationally efficient non-parametric, chiefly Principal Components Analysis (PCA)-based methods are thus becoming increasingly relied upon for population structure analysis. Current PCA-based methods can accurately detect structure; however, the accuracy in resolving subpopulations and assigning individuals to them is wanting. When subpopulations are closely related to one another, they overlap in PCA space and appear as a conglomerate. This problem is exacerbated when some subpopulations in the dataset are genetically far removed from others. We propose a novel PCA-based framework which addresses this shortcoming.ResultsA novel population structure analysis algorithm called iterative pruning PCA (ipPCA) was developed which assigns individuals to subpopulations and infers the total number of subpopulations present. Genotypic data from simulated and real population datasets with different degrees of structure were analyzed. For datasets with simple structures, the subpopulation assignments of individuals made by ipPCA were largely consistent with the STRUCTURE, BAPS and AWclust algorithms. On the other hand, highly structured populations containing many closely related subpopulations could be accurately resolved only by ipPCA, and not by other methods.ConclusionThe algorithm is computationally efficient and not constrained by the dataset complexity. This systematic subpopulation assignment approach removes the need for prior population labels, which could be advantageous when cryptic stratification is encountered in datasets containing individuals otherwise assumed to belong to a homogenous population.


international joint conference on computer science and software engineering | 2011

Efficient large Pearson correlation matrix computing using hybrid MPI/CUDA

Ekasit Kijsipongse; Suriya U-ruekolan; Chumpol Ngamphiw; Sissades Tongsima

The calculation of pairwise correlation coefficient on a dataset, known as the correlation matrix, is often used in data analysis, signal processing, pattern recognition, image processing, and bioinformatics. With the state-of-the-art Graphic Processing Units (GPUs) that consist of massive cores capable to do processing up to several Gflops, the calculation of correlation matrix can be accelerated several times over traditional CPUs. However, due to the rapid growth of the data in the digital era, the correlation matrix calculation becomes computing intensive which needs to be executed on multiple GPUs. As of now, GPUs are common components in data center at many institutions. Their GPU deployment tends towards a GPU cluster which each node is equipped with GPUs. In this paper, we propose a parallel computing based on the hybrid MPI/CUDA programming for fast and efficient Pearson correlation matrix calculation on GPU clusters. At coarse grain parallelism, the correlation matrix is partitioned into tiles which are distributed to execute concurrently on many GPUs using MPI. At fine grain level, the CUDA kernel function on each node performs massively parallel computing on a GPU. To balance load across all GPUs, we adopt the work pool model which there is a master node that manages tasks in the work pool and dynamically assign tasks to worker nodes. The result of the evaluation shows that the proposed work can ensure the load balance across different GPUs and thus gives better execution time than using a simple static data partitioning.


BMC Bioinformatics | 2008

VarDetect: a nucleotide sequence variation exploratory tool

Chumpol Ngamphiw; Supasak Kulawonganunchai; Anunchai Assawamakin; Ekachai Jenwitheesuk; Sissades Tongsima

BackgroundSingle nucleotide polymorphisms (SNPs) are the most commonly studied units of genetic variation. The discovery of such variation may help to identify causative gene mutations in monogenic diseases and SNPs associated with predisposing genes in complex diseases. Accurate detection of SNPs requires software that can correctly interpret chromatogram signals to nucleotides.ResultsWe present VarDetect, a stand-alone nucleotide variation exploratory tool that automatically detects nucleotide variation from fluorescence based chromatogram traces. Accurate SNP base-calling is achieved using pre-calculated peak content ratios, and is enhanced by rules which account for common sequence reading artifacts. The proposed software tool is benchmarked against four other well-known SNP discovery software tools (PolyPhred, novoSNP, Genalys and Mutation Surveyor) using fluorescence based chromatograms from 15 human genes. These chromatograms were obtained from sequencing 16 two-pooled DNA samples; a total of 32 individual DNA samples. In this comparison of automatic SNP detection tools, VarDetect achieved the highest detection efficiency.AvailabilityVarDetect is compatible with most major operating systems such as Microsoft Windows, Linux, and Mac OSX. The current version of VarDetect is freely available at http://www.biotec.or.th/GI/tools/vardetect.


BMC Bioinformatics | 2011

Study of large and highly stratified population datasets by combining iterative pruning principal component analysis and structure.

Tulaya Limpiti; Apichart Intarapanich; Anunchai Assawamakin; Philip J. Shaw; Pongsakorn Wangkumhang; Jittima Piriyapongsa; Chumpol Ngamphiw; Sissades Tongsima

BackgroundThe ever increasing sizes of population genetic datasets pose great challenges for population structure analysis. The Tracy-Widom (TW) statistical test is widely used for detecting structure. However, it has not been adequately investigated whether the TW statistic is susceptible to type I error, especially in large, complex datasets. Non-parametric, Principal Component Analysis (PCA) based methods for resolving structure have been developed which rely on the TW test. Although PCA-based methods can resolve structure, they cannot infer ancestry. Model-based methods are still needed for ancestry analysis, but they are not suitable for large datasets. We propose a new structure analysis framework for large datasets. This includes a new heuristic for detecting structure and incorporation of the structure patterns inferred by a PCA method to complement STRUCTURE analysis.ResultsA new heuristic called EigenDev for detecting population structure is presented. When tested on simulated data, this heuristic is robust to sample size. In contrast, the TW statistic was found to be susceptible to type I error, especially for large population samples. EigenDev is thus better-suited for analysis of large datasets containing many individuals, in which spurious patterns are likely to exist and could be incorrectly interpreted as population stratification. EigenDev was applied to the iterative pruning PCA (ipPCA) method, which resolves the underlying subpopulations. This subpopulation information was used to supervise STRUCTURE analysis to infer patterns of ancestry at an unprecedented level of resolution. To validate the new approach, a bovine and a large human genetic dataset (3945 individuals) were analyzed. We found new ancestry patterns consistent with the subpopulations resolved by ipPCA.ConclusionsThe EigenDev heuristic is robust to sampling and is thus superior for detecting structure in large datasets. The application of EigenDev to the ipPCA algorithm improves the estimation of the number of subpopulations and the individual assignment accuracy, especially for very large and complex datasets. Furthermore, we have demonstrated that the structure resolved by this approach complements parametric analysis, allowing a much more comprehensive account of population structure. The new version of the ipPCA software with EigenDev incorporated can be downloaded from http://www4a.biotec.or.th/GI/tools/ippca.

Collaboration


Dive into the Chumpol Ngamphiw's collaboration.

Top Co-Authors

Avatar

Sissades Tongsima

Thailand National Science and Technology Development Agency

View shared research outputs
Top Co-Authors

Avatar

Jittima Piriyapongsa

Thailand National Science and Technology Development Agency

View shared research outputs
Top Co-Authors

Avatar

Apichart Intarapanich

Thailand National Science and Technology Development Agency

View shared research outputs
Top Co-Authors

Avatar

Philip J. Shaw

Thailand National Science and Technology Development Agency

View shared research outputs
Top Co-Authors

Avatar

Supasak Kulawonganunchai

Thailand National Science and Technology Development Agency

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge