Insuk Lee | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Insuk Lee is active.

Explore More

Publication

Featured researches published by Insuk Lee.

PLOS Genetics | 2010

Characterising and Predicting Haploinsufficiency in the Human Genome

Ni Huang; Insuk Lee; Edward M. Marcotte

Haploinsufficiency, wherein a single functional copy of a gene is insufficient to maintain normal function, is a major cause of dominant disease. Human disease studies have identified several hundred haploinsufficient (HI) genes. We have compiled a map of 1,079 haplosufficient (HS) genes by systematic identification of genes unambiguously and repeatedly compromised by copy number variation among 8,458 apparently healthy individuals and contrasted the genomic, evolutionary, functional, and network properties between these HS genes and known HI genes. We found that HI genes are typically longer and have more conserved coding sequences and promoters than HS genes. HI genes exhibit higher levels of expression during early development and greater tissue specificity. Moreover, within a probabilistic human functional interaction network HI genes have more interaction partners and greater network proximity to other known HI genes. We built a predictive model on the basis of these differences and annotated 12,443 genes with their predicted probability of being haploinsufficient. We validated these predictions of haploinsufficiency by demonstrating that genes with a high predicted probability of exhibiting haploinsufficiency are enriched among genes implicated in human dominant diseases and among genes causing abnormal phenotypes in heterozygous knockout mice. We have transformed these gene-based haploinsufficiency predictions into haploinsufficiency scores for genic deletions, which we demonstrate to better discriminate between pathogenic and benign deletions than consideration of the deletion size or numbers of genes deleted. These robust predictions of haploinsufficiency support clinical interpretation of novel loss-of-function variants and prioritization of variants and genes for follow-up studies.

Nature Biotechnology | 2010

Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana

Insuk Lee; Bindu Ambaru; Pranjali Thakkar; Edward M. Marcotte; Seung Y. Rhee

We introduce a rational approach for associating genes with plant traits by combined use of a genome-scale functional network and targeted reverse genetic screening. We present a probabilistic network (AraNet) of functional associations among 19,647 (73%) genes of the reference flowering plant Arabidopsis thaliana. AraNet associations are predictive for diverse biological pathways, and outperform predictions derived only from literature-based protein interactions, achieving 21% precision for 55% of genes. AraNet prioritizes genes for limited-scale functional screening, resulting in a hit-rate tenfold greater than screens of random insertional mutants, when applied to early seedling development as a test case. By interrogating network neighborhoods, we identify AT1G80710 (now DROUGHT SENSITIVE 1; DRS1) and AT3G05090 (now LATERAL ROOT STIMULATOR 1; LRS1) as regulators of drought sensitivity and lateral root development, respectively. AraNet (http://www.functionalnet.org/aranet/) provides a resource for plant gene function identification and genetic dissection of plant traits.

Nature Genetics | 2008

A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans

Insuk Lee; Ben Lehner; Catriona Crombie; Wendy S.W. Wong; Andrew G. Fraser; Edward M. Marcotte

The fundamental aim of genetics is to understand how an organisms phenotype is determined by its genotype, and implicit in this is predicting how changes in DNA sequence alter phenotypes. A single network covering all the genes of an organism might guide such predictions down to the level of individual cells and tissues. To validate this approach, we computationally generated a network covering most C. elegans genes and tested its predictive capacity. Connectivity within this network predicts essentiality, identifying this relationship as an evolutionarily conserved biological principle. Critically, the network makes tissue-specific predictions—we accurately identify genes for most systematically assayed loss-of-function phenotypes, which span diverse cellular and developmental processes. Using the network, we identify 16 genes whose inactivation suppresses defects in the retinoblastoma tumor suppressor pathway, and we successfully predict that the dystrophin complex modulates EGF signaling. We conclude that an analogous network for human genes might be similarly predictive and thus facilitate identification of disease genes and rational therapeutic targets.

PLOS ONE | 2007

An Improved, Bias-Reduced Probabilistic Functional Gene Network of Baker's Yeast, Saccharomyces cerevisiae

Insuk Lee; Zhihua Li; Edward M. Marcotte

Background Probabilistic functional gene networks are powerful theoretical frameworks for integrating heterogeneous functional genomics and proteomics data into objective models of cellular systems. Such networks provide syntheses of millions of discrete experimental observations, spanning DNA microarray experiments, physical protein interactions, genetic interactions, and comparative genomics; the resulting networks can then be easily applied to generate testable hypotheses regarding specific gene functions and associations. Methodology/Principal Findings We report a significantly improved version (v. 2) of a probabilistic functional gene network [1] of the bakers yeast, Saccharomyces cerevisiae. We describe our optimization methods and illustrate their effects in three major areas: the reduction of functional bias in network training reference sets, the application of a probabilistic model for calculating confidences in pair-wise protein physical or genetic interactions, and the introduction of simple thresholds that eliminate many false positive mRNA co-expression relationships. Using the network, we predict and experimentally verify the function of the yeast RNA binding protein Puf6 in 60S ribosomal subunit biogenesis. Conclusions/Significance YeastNet v. 2, constructed using these optimizations together with additional data, shows significant reduction in bias and improvements in precision and recall, in total covering 102,803 linkages among 5,483 yeast proteins (95% of the validated proteome). YeastNet is available from http://www.yeastnet.org.

PLOS Genetics | 2011

Towards Establishment of a Rice Stress Response Interactome

Young-Su Seo; Mawsheng Chern; Laura E. Bartley; Muho Han; Ki-Hong Jung; Insuk Lee; Harkamal Walia; Todd Richter; Xia Xu; Peijian Cao; Wei Bai; Rajeshwari Ramanan; Fawn Amonpant; Loganathan Arul; Patrick E. Canlas; Randy Ruan; Chang-Jin Park; Xuewei Chen; Sohyun Hwang; Jong-Seong Jeon; Pamela C. Ronald

Rice (Oryza sativa) is a staple food for more than half the world and a model for studies of monocotyledonous species, which include cereal crops and candidate bioenergy grasses. A major limitation of crop production is imposed by a suite of abiotic and biotic stresses resulting in 30%–60% yield losses globally each year. To elucidate stress response signaling networks, we constructed an interactome of 100 proteins by yeast two-hybrid (Y2H) assays around key regulators of the rice biotic and abiotic stress responses. We validated the interactome using protein–protein interaction (PPI) assays, co-expression of transcripts, and phenotypic analyses. Using this interactome-guided prediction and phenotype validation, we identified ten novel regulators of stress tolerance, including two from protein classes not previously known to function in stress responses. Several lines of evidence support cross-talk between biotic and abiotic stress responses. The combination of focused interactome and systems analyses described here represents significant progress toward elucidating the molecular basis of traits of agronomic importance.

Proceedings of the National Academy of Sciences of the United States of America | 2011

Genetic dissection of the biotic stress response using a genome-scale gene network for rice

Insuk Lee; Young Su Seo; Dusica Coltrane; Sohyun Hwang; Taeyun Oh; Edward M. Marcotte; Pamela C. Ronald

Rice is a staple food for one-half the worlds population and a model for other monocotyledonous species. Thus, efficient approaches for identifying key genes controlling simple or complex traits in rice have important biological, agricultural, and economic consequences. Here, we report on the construction of RiceNet, an experimentally tested genome-scale gene network for a monocotyledonous species. Many different datasets, derived from five different organisms including plants, animals, yeast, and humans, were evaluated, and 24 of the most useful were integrated into a statistical framework that allowed for the prediction of functional linkages between pairs of genes. Genes could be linked to traits by using guilt-by-association, predicting gene attributes on the basis of network neighbors. We applied RiceNet to an important agronomic trait, the biotic stress response. Using network guilt-by-association followed by focused protein–protein interaction assays, we identified and validated, in planta, two positive regulators, LOC_Os01g70580 (now Regulator of XA21; ROX1) and LOC_Os02g21510 (ROX2), and one negative regulator, LOC_Os06g12530 (ROX3). These proteins control resistance mediated by rice XA21, a pattern recognition receptor. We also showed that RiceNet can accurately predict gene function in another major monocotyledonous crop species, maize. RiceNet thus enables the identification of genes regulating important crop traits, facilitating engineering of pathways critical to crop productivity.

PLOS Biology | 2009

Rational extension of the ribosome biogenesis pathway using network-guided genetics.

Zhihua Li; Insuk Lee; Emily Moradi; Nai Jung Hung; Arlen W. Johnson; Edward M. Marcotte

Gene networks are an efficient route for associating candidate genes with biological processes. Here, networks are used to discover more than 15 new genes for ribosomal subunit maturation, rRNA processing, and ribosomal export from the nucleus.

Molecular and Cellular Biology | 2000

Saccharomyces cerevisiae RAI1 (YGL246c) Is Homologous to Human DOM3Z and Encodes a Protein That Binds the Nuclear Exoribonuclease Rat1p

Yang Xue; Xinxue Bai; Insuk Lee; George Kallstrom; Jennifer Hei-Ngam Ho; Justin T. Brown; Audrey Stevens; Arlen W. Johnson

ABSTRACT The RAT1 gene of Saccharomyces cerevisiaeencodes a 5′→3′ exoribonuclease which plays an essential role in yeast RNA degradation and/or processing in the nucleus. We have cloned a previously uncharacterized gene (YGL246c) that we refer to asRAI1 (Rat1p interacting protein 1). RAI1 is homologous to Caenorhabditis elegans DOM-3 and humanDOM3Z. Deletion of RAI1 confers a growth defect which can be complemented by an additional copy of RAT1 on a centromeric vector or by directing Xrn1p, the cytoplasmic homolog of Rat1p, to the nucleus through the addition of a nuclear targeting sequence. Deletion of RAI1 is synthetically lethal with therat1-1ts mutation and shows genetic interaction with a deletion of SKI2 but not XRN1. Polysome analysis of an rai1 deletion mutant indicated a defect in 60S biogenesis which was nearly fully reversed by high-copyRAT1. Northern blot analysis of rRNAs revealed thatrai1 is required for normal 5.8S processing. In the absence of RAI1, 5.8SL was the predominant form of 5.8S and there was an accumulation of 3′-extended forms but not 5′-extended species of 5.8S. In addition, a 27S pre-rRNA species accumulated in therai1 mutant. Thus, deletion of RAI1 affects both 5′ and 3′ processing reactions of 5.8S rRNA. Consistent with the in vivo data suggesting that RAI1 enhances RAT1function, purified Rai1p stabilized the in vitro exoribonuclease activity of Rat1p.

Scientific Reports | 2016

Systematic comparison of variant calling pipelines using gold standard personal exome variants

Sohyun Hwang; Eiru Kim; Insuk Lee; Edward M. Marcotte

The success of clinical genomics using next generation sequencing (NGS) requires the accurate and consistent identification of personal genome variants. Assorted variant calling methods have been developed, which show low concordance between their calls. Hence, a systematic comparison of the variant callers could give important guidance to NGS-based clinical genomics. Recently, a set of high-confident variant calls for one individual (NA12878) has been published by the Genome in a Bottle (GIAB) consortium, enabling performance benchmarking of different variant calling pipelines. Based on the gold standard reference variant calls from GIAB, we compared the performance of thirteen variant calling pipelines, testing combinations of three read aligners—BWA-MEM, Bowtie2, and Novoalign—and four variant callers—Genome Analysis Tool Kit HaplotypeCaller (GATK-HC), Samtools mpileup, Freebayes and Ion Proton Variant Caller (TVC), for twelve data sets for the NA12878 genome sequenced by different platforms including Illumina2000, Illumina2500, and Ion Proton, with various exome capture systems and exome coverage. We observed different biases toward specific types of SNP genotyping errors by the different variant callers. The results of our study provide useful guidelines for reliable variant identification from deep sequencing of personal genomes.

Genome Research | 2010

Predicting genetic modifier loci using functional gene networks

Insuk Lee; Ben Lehner; Tanya Vavouri; Junha Shin; Andrew G. Fraser; Edward M. Marcotte

Most phenotypes are genetically complex, with contributions from mutations in many different genes. Mutations in more than one gene can combine synergistically to cause phenotypic change, and systematic studies in model organisms show that these genetic interactions are pervasive. However, in human association studies such nonadditive genetic interactions are very difficult to identify because of a lack of statistical power--simply put, the number of potential interactions is too vast. One approach to resolve this is to predict candidate modifier interactions between loci, and then to specifically test these for associations with the phenotype. Here, we describe a general method for predicting genetic interactions based on the use of integrated functional gene networks. We show that in both Saccharomyces cerevisiae and Caenorhabditis elegans a single high-coverage, high-quality functional network can successfully predict genetic modifiers for the majority of genes. For C. elegans we also describe the construction of a new, improved, and expanded functional network, WormNet 2. Using this network we demonstrate how it is possible to rapidly expand the number of modifier loci known for a gene, predicting and validating new genetic interactions for each of three signal transduction genes. We propose that this approach, termed network-guided modifier screening, provides a general strategy for predicting genetic interactions. This work thus suggests that a high-quality integrated human gene network will provide a powerful resource for modifier locus discovery in many different diseases.

Explore More