Dongsup Kim
KAIST
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Dongsup Kim.
Nature Biotechnology | 2010
Dong Uk Kim; Jacqueline Hayles; Dongsup Kim; Valerie Wood; Han Oh Park; Misun Won; Hyang Sook Yoo; Trevor Duhig; Miyoung Nam; Georgia Palmer; Sangjo Han; Linda Jeffery; Seung Tae Baek; Hyemi Lee; Young Sam Shim; Min-Ho Lee; Lila Kim; Kyung Sun Heo; Eun Joo Noh; Ah Reum Lee; Young Joo Jang; Kyung Sook Chung; Shin Jung Choi; Jo Young Park; Young Woo Park; Hwan Mook Kim; Song Kyu Park; Hae Joon Park; Eun Jung Kang; Hyong Bai Kim
We report the construction and analysis of 4,836 heterozygous diploid deletion mutants covering 98.4% of the fission yeast genome providing a tool for studying eukaryotic biology. Comprehensive gene dispensability comparisons with budding yeast—the only other eukaryote for which a comprehensive knockout library exists—revealed that 83% of single-copy orthologs in the two yeasts had conserved dispensability. Gene dispensability differed for certain pathways between the two yeasts, including mitochondrial translation and cell cycle checkpoint control. We show that fission yeast has more essential genes than budding yeast and that essential genes are more likely than nonessential genes to be present in a single copy, to be broadly conserved and to contain introns. Growth fitness analyses determined sets of haploinsufficient and haploproficient genes for fission yeast, and comparisons with budding yeast identified specific ribosomal proteins and RNA polymerase subunits, which may act more generally to regulate eukaryotic cell growth.
Journal of Bioinformatics and Computational Biology | 2003
Jinbo Xu; Ming Li; Dongsup Kim; Ying Xu
This paper presents a novel linear programming approach to do protein 3-dimensional (3D) structure prediction via threading. Based on the contact map graph of the protein 3D structure template, the protein threading problem is formulated as a large scale integer programming (IP) problem. The IP formulation is then relaxed to a linear programming (LP) problem, and then solved by the canonical branch-and-bound method. The final solution is globally optimal with respect to energy functions. In particular, our energy function includes pairwise interaction preferences and allowing variable gaps which are two key factors in making the protein threading problem NP-hard. A surprising result is that, most of the time, the relaxed linear programs generate integral solutions directly. Our algorithm has been implemented as a software package RAPTOR-RApid Protein Threading by Operation Research technique. Large scale benchmark test for fold recognition shows that RAPTOR significantly outperforms other programs at the fold similarity level. The CAFASP3 evaluation, a blind and public test by the protein structure prediction community, ranks RAPTOR as top 1, among individual prediction servers, in terms of the recognition capability and alignment accuracy for Fold Recognition (FR) family targets. RAPTOR also performs very well in recognizing the hard Homology Modeling (HM) targets. RAPTOR was implemented at the University of Waterloo and it can be accessed at http://www.cs.uwaterloo.ca/~j3xu/RAPTOR_form.htm.
Neuroscience Research | 2010
Toshio Munesue; Shigeru Yokoyama; Kazuhiko Nakamura; Ayyappan Anitha; Kazuo Yamada; Kenshi Hayashi; Tomoya Asaka; Hong-Xiang Liu; Duo Jin; Keita Koizumi; Mohammad Saharul Islam; Jian Jun Huang; Wen Jie Ma; Uh Hyun Kim; Sun Jun Kim; Keunwan Park; Dongsup Kim; Mitsuru Kikuchi; Yasuki Ono; Hideo Nakatani; Shiro Suda; Taishi Miyachi; Hirokazu Hirai; Alla B. Salmina; Yu A. Pichugina; Andrei A. Soumarokov; Nori Takei; Norio Mori; Masatsugu Tsujii; Toshiro Sugiyama
The neurobiological basis of autism spectrum disorder (ASD) remains poorly understood. Given the role of CD38 in social recognition through oxytocin (OT) release, we hypothesized that CD38 may play a role in the etiology of ASD. Here, we first examined the immunohistochemical expression of CD38 in the hypothalamus of post-mortem brains of non-ASD subjects and found that CD38 was colocalized with OT in secretory neurons. In studies of the association between CD38 and autism, we analyzed 10 single nucleotide polymorphisms (SNPs) and mutations of CD38 by re-sequencing DNAs mainly from a case-control study in Japan, and Caucasian cases mainly recruited to the Autism Genetic Resource Exchange (AGRE). The SNPs of CD38, rs6449197 (p<0.040) and rs3796863 (p<0.005) showed significant associations with a subset of ASD (IQ>70; designated as high-functioning autism (HFA)) in the U.S. 104 AGRE family trios, but not with Japanese 188 HFA subjects. A mutation that caused tryptophan to replace arginine at amino acid residue 140 (R140W; (rs1800561, 4693C>T)) was found in 0.6-4.6% of the Japanese population and was associated with ASD in the smaller case-control study. The SNP was clustered in pedigrees in which the fathers and brothers of T-allele-carrier probands had ASD or ASD traits. In this cohort OT plasma levels were lower in subjects with the T allele than in those without. One proband with the T allele who was taking nasal OT spray showed relief of symptoms. The two variant CD38 poloymorphysms tested may be of interest with regard of the pathophysiology of ASD.
Nucleic Acids Research | 2009
Kyu-il Cho; Dongsup Kim; Doheon Lee
Identifying features that effectively represent the energetic contribution of an individual interface residue to the interactions between proteins remains problematic. Here, we present several new features and show that they are more effective than conventional features. By combining the proposed features with conventional features, we develop a predictive model for interaction hot spots. Initially, 54 multifaceted features, composed of different levels of information including structure, sequence and molecular interaction information, are quantified. Then, to identify the best subset of features for predicting hot spots, feature selection is performed using a decision tree. Based on the selected features, a predictive model for hot spots is created using support vector machine (SVM) and tested on an independent test set. Our model shows better overall predictive accuracy than previous methods such as the alanine scanning methods Robetta and FOLDEF, and the knowledge-based method KFC. Subsequent analysis yields several findings about hot spots. As expected, hot spots have a larger relative surface area burial and are more hydrophobic than other residues. Unexpectedly, however, residue conservation displays a rather complicated tendency depending on the types of protein complexes, indicating that this feature is not good for identifying hot spots. Of the selected features, the weighted atomic packing density, relative surface area burial and weighted hydrophobicity are the top 3, with the weighted atomic packing density proving to be the most effective feature for predicting hot spots. Notably, we find that hot spots are closely related to π–related interactions, especially π · · · π interactions.
Proceedings of the National Academy of Sciences of the United States of America | 2012
Sang Chul Lee; Keunwan Park; Jieun Han; Joong-jae Lee; Hyun Jung Kim; Seungpyo Hong; Woosung Heu; Yu Jung Kim; Jae-Seok Ha; Seung-Goo Lee; Hae-Kap Cheong; Young Ho Jeon; Dongsup Kim; Hak-Sung Kim
Repeat proteins have recently been of great interest as potential alternatives to immunoglobulin antibodies due to their unique structural and biophysical features. We here present the development of a binding scaffold based on variable lymphocyte receptors, which are nonimmunoglobulin antibodies composed of Leucine-rich repeat modules in jawless vertebrates, by module engineering. A template scaffold was first constructed by joining consensus repeat modules between the N- and C-capping motifs of variable lymphocyte receptors. The N-terminal domain of the template scaffold was redesigned based on the internalin-B cap by analyzing the modular similarity between the respective repeat units using a computational approach. The newly designed scaffold, termed “Repebody,” showed a high level of soluble expression in bacteria, displaying high thermodynamic and pH stabilities. Ease of molecular engineering was shown by designing repebodies specific for myeloid differentiation protein-2 and hen egg lysozyme, respectively, by a rational approach. The crystal structures of designed repebodies were determined to elucidate the structural features and interaction interfaces. We demonstrate general applicability of the scaffold by selecting repebodies with different binding affinities for interleukin-6 using phage display.
Hepatology Research | 2011
Nury Kim; Hyemin Kim; Inkyung Jung; Yeji Kim; Dongsup Kim; Yong-Mahn Han
Aim: Human embryonic stem cells (hESCs) are able to self‐renew and differentiate into a variety of cell types. Although miRNAs have emerged as key regulators in the cellular process, a few studies have been reported about behaviors of miRNAs during differentiation of hESCs into a specialized cell type. Here, we demonstrate that different kinds of miRNAs may function in a lineage‐specific manner during the differentiation of human embryonic stem cells (hESCs).
BMC Bioinformatics | 2006
Sangjo Han; Dongsup Kim
BackgroundPrimer design is a critical step in all types of RT-PCR methods to ensure specificity and efficiency of a target amplicon. However, most traditional primer design programs suggest primers on a single template of limited genetic complexity. To provide researchers with a sufficient number of pre-designed specific RT-PCR primer pairs for whole genes in Arabidopsis, we aimed to construct a genome-wide primer-pair database.DescriptionWe considered the homogeneous physical and chemical properties of each primer (homogeneity) of a gene, non-specific binding against all other known genes (specificity), and other possible amplicons from its corresponding genomic DNA or similar cDNAs (additional information). Then, we evaluated the reliability of our database with selected primer pairs from 15 genes using conventional and real time RT-PCR.ConclusionApproximately 97% of 28,952 genes investigated were finally registered in AtRTPrimer. Unlike other freely available primer databases for Arabidopsis thaliana, AtRTPrimer provides a large number of reliable primer pairs for each gene so that researchers can perform various types of RT-PCR experiments for their specific needs. Furthermore, by experimentally evaluating our database, we made sure that our database provides good starting primer pairs for Arabidopsis researchers to perform various types of RT-PCR experiments.
Journal of Biological Chemistry | 2012
Seung-Kyoon Kim; Inkyung Jung; Hosuk Lee; Keunsoo Kang; Mirang Kim; Kwiwan Jeong; Chang Seob Kwon; Yong-Mahn Han; Yong Sung Kim; Dongsup Kim; Daeyoup Lee
Background: It remains unknown how Dot1 or the Dot1 complex specifically targets the transcribed regions. Results: A functional interaction between hDOT1L and RNAPII targets hDOT1L and subsequent H3K79 methylations to active genes. Conclusion: hDOT1L interacts with phosphorylated CTD of RNAPII. Significance: This represents novel mechanistic insight into the understanding of targeting and propagation of hDOT1L along gene transcription. Histone-modifying enzymes play a pivotal role in gene expression and repression. In human, DOT1L (Dot1-like) is the only known histone H3 lysine 79 methyltransferase. hDOT1L is associated with transcriptional activation, but the general mechanism connecting hDOT1L to active transcription remains largely unknown. Here, we report that hDOT1L interacts with the phosphorylated C-terminal domain of actively transcribing RNA polymerase II (RNAPII) through a region conserved uniquely in multicellular DOT1 proteins. Genome-wide profiling analyses indicate that the occupancy of hDOT1L largely overlaps with that of RNAPII at actively transcribed genes, especially surrounding transcriptional start sites, in embryonic carcinoma NCCIT cells. We also find that C-terminal domain binding or H3K79 methylations by hDOT1L is important for the expression of target genes such as NANOG and OCT4 and a marker for pluripotency in NCCIT cells. Our results indicate that a functional interaction between hDOT1L and RNAPII targets hDOT1L and subsequent H3K79 methylations to actively transcribed genes.
Proteins | 2008
Byung-chul Lee; Keunwan Park; Dongsup Kim
It is a common belief that some residues of a protein are more important than others. In some cases, point mutations of some residues make butterfly effect on the protein structure and function, but in other cases they do not. In addition, the residues important for the protein function tend to be not only conserved but also coevolved with other interacting residues in a protein. Motivated by these observations, the authors propose that there is a network composed of the residues, the residue–residue coevolution network (RRCN), where nodes are residues and links are set when the coevolutionary interaction strengths between residues are sufficiently large. The authors build the RRCN for the 44 diverse protein families. The interaction strengths are calculated by using McBASC algorithm. After constructing the RRCN, the authors identify residues that have high degree of connectivity (hub nodes), and residues that play a central role in network flow of information (CI nodes). The authors show that these residues are likely to be functionally important residues. Moreover, the CI nodes appear to be more relevant to the function than the hub nodes. Unlike other similar methods, the method described in this study is solely based on sequences. Therefore, the method can be applied to the function annotation of a wider range of proteins. Proteins 2008.
Bioinformatics | 2005
Sangjo Han; Byung-chul Lee; Seung Taek Yu; Chan-seok Jeong; Soyoung Lee; Dongsup Kim
MOTIVATION Currently, the most accurate fold-recognition method is to perform profile-profile alignments and estimate the statistical significances of those alignments by calculating Z-score or E-value. Although this scheme is reliable in recognizing relatively close homologs related at the family level, it has difficulty in finding the remote homologs that are related at the superfamily or fold level. RESULTS In this paper, we present an alternative method to estimate the significance of the alignments. The alignment between a query protein and a template of length n in the fold library is transformed into a feature vector of length n + 1, which is then evaluated by support vector machine (SVM). The output from SVM is converted to a posterior probability that a query sequence is related to a template, given SVM output. Results show that a new method shows significantly better performance than PSI-BLAST and profile-profile alignment with Z-score scheme. While PSI-BLAST and Z-score scheme detect 16 and 20% of superfamily-related proteins, respectively, at 90% specificity, a new method detects 46% of these proteins, resulting in more than 2-fold increase in sensitivity. More significantly, at the fold level, a new method can detect 14% of remotely related proteins at 90% specificity, a remarkable result considering the fact that the other methods can detect almost none at the same level of specificity.