Jong Kyoung Kim
Pohang University of Science and Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jong Kyoung Kim.
Nature Methods | 2013
Philip Brennecke; Simon Anders; Jong Kyoung Kim; Aleksandra A. Kolodziejczyk; Xiuwei Zhang; Valentina Proserpio; Bianka Baying; Vladimir Benes; Sarah A. Teichmann; John C. Marioni; Marcus G. Heisler
Single-cell RNA-seq can yield valuable insights about the variability within a population of seemingly homogeneous cells. We developed a quantitative statistical method to distinguish true biological variability from the high levels of technical noise in single-cell experiments. Our approach quantifies the statistical significance of observed cell-to-cell variability in expression strength on a gene-by-gene basis. We validate our approach using two independent data sets from Arabidopsis thaliana and Mus musculus.
Molecular Cell | 2015
Aleksandra A. Kolodziejczyk; Jong Kyoung Kim; Valentine Svensson; John C. Marioni; Sarah A. Teichmann
The differences between individual cells can have profound functional consequences, in both unicellular and multicellular organisms. Recently developed single-cell mRNA-sequencing methods enable unbiased, high-throughput, and high-resolution transcriptomic analysis of individual cells. This provides an additional dimension to transcriptomic information relative to traditional methods that profile bulk populations of cells. Already, single-cell RNA-sequencing methods have revealed new biology in terms of the composition of tissues, the dynamics of transcription, and the regulatory relationships between genes. Rapid technological developments at the level of cell capture, phenotyping, molecular biology, and bioinformatics promise an exciting future with numerous biological and medical applications.
Cell Stem Cell | 2015
Aleksandra A. Kolodziejczyk; Jong Kyoung Kim; Jason C.H. Tsang; Tomislav Ilicic; Johan Henriksson; Kedar Nath Natarajan; Alex Tuck; Xuefei Gao; Marc Bühler; Pentao Liu; John C. Marioni; Sarah A. Teichmann
Summary Embryonic stem cell (ESC) culture conditions are important for maintaining long-term self-renewal, and they influence cellular pluripotency state. Here, we report single cell RNA-sequencing of mESCs cultured in three different conditions: serum, 2i, and the alternative ground state a2i. We find that the cellular transcriptomes of cells grown in these conditions are distinct, with 2i being the most similar to blastocyst cells and including a subpopulation resembling the two-cell embryo state. Overall levels of intercellular gene expression heterogeneity are comparable across the three conditions. However, this masks variable expression of pluripotency genes in serum cells and homogeneous expression in 2i and a2i cells. Additionally, genes related to the cell cycle are more variably expressed in the 2i and a2i conditions. Mining of our dataset for correlations in gene expression allowed us to identify additional components of the pluripotency network, including Ptma and Zfp640, illustrating its value as a resource for future discovery.
The Plant Cell | 2008
Dong Wook Lee; Jong Kyoung Kim; Sumin Lee; Seungjin Choi; Sanguk Kim; Inhwan Hwang
The N-terminal transit peptides of nuclear-encoded plastid proteins are necessary and sufficient for their import into plastids, but the information encoded by these transit peptides remains elusive, as they have a high sequence diversity and lack consensus sequences or common sequence motifs. Here, we investigated the sequence information contained in transit peptides. Hierarchical clustering on transit peptides of 208 plastid proteins showed that the transit peptide sequences are grouped to multiple sequence subgroups. We selected representative proteins from seven of these multiple subgroups and confirmed that their transit peptide sequences are highly dissimilar. Protein import experiments revealed that each protein contained transit peptide–specific sequence motifs critical for protein import into chloroplasts. Bioinformatics analysis identified sequence motifs that were conserved among members of the identified subgroups. The sequence motifs identified by the two independent approaches were nearly identical or significantly overlapped. Furthermore, the accuracy of predicting a chloroplast protein was greatly increased by grouping the transit peptides into multiple sequence subgroups. Based on these data, we propose that the transit peptides are composed of multiple sequence subgroups that contain distinctive sequence motifs for chloroplast targeting.
Nature Communications | 2015
Jong Kyoung Kim; Aleksandra A. Kolodziejczyk; Tomislav Ilicic; Sarah A. Teichmann; John C. Marioni
Single-cell RNA-sequencing (scRNA-seq) facilitates identification of new cell types and gene regulatory networks as well as dissection of the kinetics of gene expression and patterns of allele-specific expression. However, to facilitate such analyses, separating biological variability from the high level of technical noise that affects scRNA-seq protocols is vital. Here we describe and validate a generative statistical model that accurately quantifies technical noise with the help of external RNA spike-ins. Applying our approach to investigate stochastic allele-specific expression in individual cells, we demonstrate that a large fraction of stochastic allele-specific expression can be explained by technical noise, especially for lowly and moderately expressed genes: we predict that only 17.8% of stochastic allele-specific expression patterns are attributable to biological noise with the remainder due to technical noise.
Genome Biology | 2013
Jong Kyoung Kim; John C. Marioni
BackgroundGenetically identical populations of cells grown in the same environmental condition show substantial variability in gene expression profiles. Although single-cell RNA-seq provides an opportunity to explore this phenomenon, statistical methods need to be developed to interpret the variability of gene expression counts.ResultsWe develop a statistical framework for studying the kinetics of stochastic gene expression from single-cell RNA-seq data. By applying our model to a single-cell RNA-seq dataset generated by profiling mouse embryonic stem cells, we find that the inferred kinetic parameters are consistent with RNA polymerase II binding and chromatin modifications. Our results suggest that histone modifications affect transcriptional bursting by modulating both burst size and frequency. Furthermore, we show that our model can be used to identify genes with slow promoter kinetics, which are important for probabilistic differentiation of embryonic stem cells.ConclusionsWe conclude that the proposed statistical model provides a flexible and efficient way to investigate the kinetics of transcription.
Cell Reports | 2014
Julian R. Peat; Wendy Dean; Stephen J. Clark; Felix Krueger; Sébastien A. Smallwood; Gabriella Ficz; Jong Kyoung Kim; John C. Marioni; Timothy A. Hore; Wolf Reik
Summary Fertilization triggers global erasure of paternal 5-methylcytosine as part of epigenetic reprogramming during the transition from gametic specialization to totipotency. This involves oxidation by TET3, but our understanding of its targets and the wider context of demethylation is limited to a small fraction of the genome. We employed an optimized bisulfite strategy to generate genome-wide methylation profiles of control and TET3-deficient zygotes, using SNPs to access paternal alleles. This revealed that in addition to pervasive removal from intergenic sequences and most retrotransposons, gene bodies constitute a major target of zygotic demethylation. Methylation loss is associated with zygotic genome activation and at gene bodies is also linked to increased transcriptional noise in early development. Our data map the primary contribution of oxidative demethylation to a subset of gene bodies and intergenic sequences and implicate redundant pathways at many loci. Unexpectedly, we demonstrate that TET3 activity also protects certain CpG islands against methylation buildup.
Pattern Recognition Letters | 2006
Jong Kyoung Kim; Gajendra P. S. Raghava; Sung Yang Bang; Seungjin Choi
Predicting the destination of a protein in a cell is important for annotating the function of the protein. Recent advances have allowed us to develop more accurate methods for predicting the subcellular localization of proteins. One of the most important factors for improving the accuracy of these methods is related to the introduction of new useful features for protein sequences. In this paper we present a new method for extracting appropriate features from the sequence data by computing pairwise sequence alignment scores. As a classifier, support vector machine (SVM) is used. The overall prediction accuracy evaluated by the jackknife validation technique reached 94.70% for the eukaryotic non-plant data set and 92.10% for the eukaryotic plant data set, which is the highest prediction accuracy among the methods reported so far with such data sets. Our experimental results confirm that our feature extraction method based on pairwise sequence alignment is useful for this classification problem.
PLOS ONE | 2009
Young Min Oh; Jong Kyoung Kim; Yongwook Choi; Seungjin Choi; Joo-Yeon Yoo
The comprehensive identification of functional transcription factor binding sites (TFBSs) is an important step in understanding complex transcriptional regulatory networks. This study presents a motif-based comparative approach, STAT-Finder, for identifying functional DNA binding sites of STAT3 transcription factor. STAT-Finder combines STAT-Scanner, which was designed to predict functional STAT TFBSs with improved sensitivity, and a motif-based alignment to minimize false positive prediction rates. Using two reference sets containing promoter sequences of known STAT3 target genes, STAT-Finder identified functional STAT3 TFBSs with enhanced prediction efficiency and sensitivity relative to other conventional TFBS prediction tools. In addition, STAT-Finder identified novel STAT3 target genes among a group of genes that are over-expressed in human cancer cells. The binding of STAT3 to the predicted TFBSs was also experimentally confirmed through chromatin immunoprecipitation. Our proposed method provides a systematic approach to the prediction of functional TFBSs that can be applied to other TFs.
Nucleic Acids Research | 2012
Young Min Oh; Jong Kyoung Kim; Seungjin Choi; Joo-Yeon Yoo
Accurate prediction of transcription factor binding sites (TFBSs) is a prerequisite for identifying cis-regulatory modules that underlie transcriptional regulatory circuits encoded in the genome. Here, we present a computational framework for detecting TFBSs, when multiple position weight matrices (PWMs) for a transcription factor are available. Grouping multiple PWMs of a transcription factor (TF) based on their sequence similarity improves the specificity of TFBS prediction, which was evaluated using multiple genome-wide ChIP-Seq data sets from 26 TFs. The Z-scores of the area under a receiver operating characteristic curve (AUC) values of 368 TFs were calculated and used to statistically identify co-occurring regulatory motifs in the TF bound ChIP loci. Motifs that are co-occurring along with the empirical bindings of E2F, JUN or MYC have been evaluated, in the basal or stimulated condition. Results prove our method can be useful to systematically identify the co-occurring motifs of the TF for the given conditions.