Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Goo Jun is active.

Publication


Featured researches published by Goo Jun.


Science | 2012

Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes

Jacob A. Tennessen; Abigail W. Bigham; Timothy D. O'Connor; Wenqing Fu; Eimear E. Kenny; Simon Gravel; Sean McGee; Ron Do; Xiaoming Liu; Goo Jun; Hyun Min Kang; Daniel M. Jordan; Suzanne M. Leal; Stacey Gabriel; Mark J. Rieder; Gonçalo R. Abecasis; David Altshuler; Deborah A. Nickerson; Eric Boerwinkle; Shamil R. Sunyaev; Carlos Bustamante; Michael J. Bamshad; Joshua M. Akey

A Deep Look Into Our Genes Recent debates have focused on the degree of genetic variation and its impact upon health at the genomic level in humans (see the Perspective by Casals and Bertranpetit). Tennessen et al. (p. 64, published online 17 May), looking at all of the protein-coding genes in the human genome, and Nelson et al. (p. 100, published online 17 May), looking at genes that encode drug targets, address this question through deep sequencing efforts on samples from multiple individuals. The findings suggest that most human variation is rare, not shared between populations, and that rare variants are likely to play a role in human health. Most functionally consequential variants in protein-coding genes are rare and, thus, difficult to find. As a first step toward understanding how rare variants contribute to risk for complex diseases, we sequenced 15,585 human protein-coding genes to an average median depth of 111× in 2440 individuals of European (n = 1351) and African (n = 1088) ancestry. We identified over 500,000 single-nucleotide variants (SNVs), the majority of which were rare (86% with a minor allele frequency less than 0.5%), previously unknown (82%), and population-specific (82%). On average, 2.3% of the 13,595 SNVs each person carried were predicted to affect protein function of ~313 genes per genome, and ~95.7% of SNVs predicted to be functionally important were rare. This excess of rare functional variants is due to the combined effects of explosive, recent accelerated population growth and weak purifying selection. Furthermore, we show that large sample sizes will be required to associate rare variants with complex traits.


Nature | 2012

Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants

Wenqing Fu; Timothy D. O'Connor; Goo Jun; Hyun Min Kang; Gonçalo R. Abecasis; Suzanne M. Leal; Stacey Gabriel; David Altshuler; Jay Shendure; Deborah A. Nickerson; Michael J. Bamshad; Joshua M. Akey

Establishing the age of each mutation segregating in contemporary human populations is important to fully understand our evolutionary history and will help to facilitate the development of new approaches for disease-gene discovery. Large-scale surveys of human genetic variation have reported signatures of recent explosive population growth, notable for an excess of rare genetic variants, suggesting that many mutations arose recently. To more quantitatively assess the distribution of mutation ages, we resequenced 15,336 genes in 6,515 individuals of European American and African American ancestry and inferred the age of 1,146,401 autosomal single nucleotide variants (SNVs). We estimate that approximately 73% of all protein-coding SNVs and approximately 86% of SNVs predicted to be deleterious arose in the past 5,000–10,000 years. The average age of deleterious SNVs varied significantly across molecular pathways, and disease genes contained a significantly higher proportion of recently arisen deleterious SNVs than other genes. Furthermore, European Americans had an excess of deleterious variants in essential and Mendelian disease genes compared to African Americans, consistent with weaker purifying selection due to the Out-of-Africa dispersal. Our results better delimit the historical details of human protein-coding variation, show the profound effect of recent human history on the burden of deleterious SNVs segregating in contemporary populations, and provide important practical information that can be used to prioritize variants in disease-gene discovery.


Nature | 2015

An integrated map of structural variation in 2,504 human genomes

Peter H. Sudmant; Tobias Rausch; Eugene J. Gardner; Robert E. Handsaker; Alexej Abyzov; John Huddleston; Zhang Y; Kai Ye; Goo Jun; Markus His Yang Fritz; Miriam K. Konkel; Ankit Malhotra; Adrian M. Stütz; Xinghua Shi; Francesco Paolo Casale; Jieming Chen; Fereydoun Hormozdiari; Gargi Dayama; Ken Chen; Maika Malig; Mark Chaisson; Klaudia Walter; Sascha Meiers; Seva Kashin; Erik Garrison; Adam Auton; Hugo Y. K. Lam; Xinmeng Jasmine Mu; Can Alkan; Danny Antaki

Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.


The New England Journal of Medicine | 2014

Loss-of-Function Mutations in APOC3, Triglycerides, and Coronary Disease

Jacy R. Crosby; Gina M. Peloso; Paul L. Auer; David R. Crosslin; Nathan O. Stitziel; Leslie A. Lange; Yingchang Lu; Zheng-zheng Tang; He Zhang; George Hindy; Nicholas G. D. Masca; Kathleen Stirrups; Stavroula Kanoni; Ron Do; Goo Jun; Youna Hu; Hyun Min Kang; Chenyi Xue; Anuj Goel; Martin Farrall; Stefano Duga; Pier Angelica Merlini; Rosanna Asselta; Domenico Girelli; Nicola Martinelli; Wu Yin; Dermot F. Reilly; Elizabeth K. Speliotes; Caroline S. Fox; Kristian Hveem

BACKGROUND Plasma triglyceride levels are heritable and are correlated with the risk of coronary heart disease. Sequencing of the protein-coding regions of the human genome (the exome) has the potential to identify rare mutations that have a large effect on phenotype. METHODS We sequenced the protein-coding regions of 18,666 genes in each of 3734 participants of European or African ancestry in the Exome Sequencing Project. We conducted tests to determine whether rare mutations in coding sequence, individually or in aggregate within a gene, were associated with plasma triglyceride levels. For mutations associated with triglyceride levels, we subsequently evaluated their association with the risk of coronary heart disease in 110,970 persons. RESULTS An aggregate of rare mutations in the gene encoding apolipoprotein C3 (APOC3) was associated with lower plasma triglyceride levels. Among the four mutations that drove this result, three were loss-of-function mutations: a nonsense mutation (R19X) and two splice-site mutations (IVS2+1G→A and IVS3+1G→T). The fourth was a missense mutation (A43T). Approximately 1 in 150 persons in the study was a heterozygous carrier of at least one of these four mutations. Triglyceride levels in the carriers were 39% lower than levels in noncarriers (P<1×10(-20)), and circulating levels of APOC3 in carriers were 46% lower than levels in noncarriers (P=8×10(-10)). The risk of coronary heart disease among 498 carriers of any rare APOC3 mutation was 40% lower than the risk among 110,472 noncarriers (odds ratio, 0.60; 95% confidence interval, 0.47 to 0.75; P=4×10(-6)). CONCLUSIONS Rare mutations that disrupt APOC3 function were associated with lower levels of plasma triglycerides and APOC3. Carriers of these mutations were found to have a reduced risk of coronary heart disease. (Funded by the National Heart, Lung, and Blood Institute and others.).


Nature Genetics | 2013

Identification of a rare coding variant in complement 3 associated with age-related macular degeneration

Xiaowei Zhan; David E. Larson; Chaolong Wang; Daniel C. Koboldt; Yuri V. Sergeev; Robert S. Fulton; Lucinda Fulton; Catrina C. Fronick; Kari Branham; Jennifer L. Bragg-Gresham; Goo Jun; Youna Hu; Hyun Min Kang; Dajiang J. Liu; Mohammad Othman; Matthew Brooks; Rinki Ratnapriya; Alexis Boleda; Felix Grassmann; Claudia N. von Strachwitz; Lana M. Olson; Gabriëlle H.S. Buitendijk; Albert Hofman; Cornelia M. van Duijn; Valentina Cipriani; Anthony T. Moore; Humma Shahid; Yingda Jiang; Yvette P. Conley; Denise J. Morgan

Macular degeneration is a common cause of blindness in the elderly. To identify rare coding variants associated with a large increase in risk of age-related macular degeneration (AMD), we sequenced 2,335 cases and 789 controls in 10 candidate loci (57 genes). To increase power, we augmented our control set with ancestry-matched exome-sequenced controls. An analysis of coding variation in 2,268 AMD cases and 2,268 ancestry-matched controls identified 2 large-effect rare variants: previously described p.Arg1210Cys encoded in the CFH gene (case frequency (fcase) = 0.51%; control frequency (fcontrol) = 0.02%; odds ratio (OR) = 23.11) and newly identified p.Lys155Gln encoded in the C3 gene (fcase = 1.06%; fcontrol = 0.39%; OR = 2.68). The variants suggest decreased inhibition of C3 by complement factor H, resulting in increased activation of the alternative complement pathway, as a key component of disease biology.


BMC Proceedings | 2014

Data for Genetic Analysis Workshop 18: human whole genome sequence, blood pressure, and simulated phenotypes in extended pedigrees

Laura Almasy; Thomas D. Dyer; Juan Manuel Peralta; Goo Jun; Andrew R. Wood; Christian Fuchsberger; Marcio Almeida; Jack W. Kent; Sharon P. Fowler; Thomas W. Blackwell; Sobha Puppala; Satish Kumar; Joanne E. Curran; Donna M. Lehman; Gonçalo R. Abecasis; Ravindranath Duggirala; John Blangero

Genetic Analysis Workshop 18 (GAW18) focused on identification of genes and functional variants that influence complex phenotypes in human sequence data. Data for the workshop were donated by the T2D-GENES Consortium and included whole genome sequences for odd-numbered autosomes in 464 key individuals selected from 20 Mexican American families, a dense set of single-nucleotide polymorphisms in 959 individuals in these families, and longitudinal data on systolic and diastolic blood pressure measured at 1-4 examinations over a period of 20 years. Simulated phenotypes were generated based on the real sequence data and pedigree structures. In the design of the simulation model, gene expression measures from the San Antonio Family Heart Study (not distributed as part of the GAW18 data) were used to identify genes whose mRNA levels were correlated with blood pressure. Observed variants within these genes were designated as functional in the GAW18 simulation if they were nonsynonymous and predicted to have deleterious effects on protein function or if they were noncoding and associated with mRNA levels. Two simulated longitudinal phenotypes were modeled to have the same trait distributions as the real systolic and diastolic blood pressure data, with effects of age, sex, and medication use, including a genotype-medication interaction. For each phenotype, more than 1000 sequence variants in more than 200 genes present on the odd-numbered autosomes individually explained less than 0.01-2.78% of phenotypic variance. Cumulatively, variants in the most influential gene explained 7.79% of trait variance. An additional simulated phenotype, Q1, was designed to be correlated among family members but to not be associated with any sequence variants. Two hundred replicates of the phenotypes were simulated, with each including data for 849 individuals.


PLOS Genetics | 2015

Identification and Functional Characterization of G6PC2 Coding Variants Influencing Glycemic Traits Define an Effector Transcript at the G6PC2-ABCB11 Locus

Anubha Mahajan; Xueling Sim; Hui Jin Ng; Alisa K. Manning; Manuel A. Rivas; Heather M Highland; Adam E. Locke; Niels Grarup; Hae Kyung Im; Pablo Cingolani; Jason Flannick; Pierre Fontanillas; Christian Fuchsberger; Kyle J. Gaulton; Tanya M. Teslovich; N. William Rayner; Neil R. Robertson; Nicola L. Beer; Jana K. Rundle; Jette Bork-Jensen; Claes Ladenvall; Christine Blancher; David Buck; Gemma Buck; Noël P. Burtt; Stacey Gabriel; Anette P. Gjesing; Christopher J. Groves; Mette Hollensted; Jeroen R. Huyghe

Genome wide association studies (GWAS) for fasting glucose (FG) and insulin (FI) have identified common variant signals which explain 4.8% and 1.2% of trait variance, respectively. It is hypothesized that low-frequency and rare variants could contribute substantially to unexplained genetic variance. To test this, we analyzed exome-array data from up to 33,231 non-diabetic individuals of European ancestry. We found exome-wide significant (P<5×10-7) evidence for two loci not previously highlighted by common variant GWAS: GLP1R (p.Ala316Thr, minor allele frequency (MAF)=1.5%) influencing FG levels, and URB2 (p.Glu594Val, MAF = 0.1%) influencing FI levels. Coding variant associations can highlight potential effector genes at (non-coding) GWAS signals. At the G6PC2/ABCB11 locus, we identified multiple coding variants in G6PC2 (p.Val219Leu, p.His177Tyr, and p.Tyr207Ser) influencing FG levels, conditionally independent of each other and the non-coding GWAS signal. In vitro assays demonstrate that these associated coding alleles result in reduced protein abundance via proteasomal degradation, establishing G6PC2 as an effector gene at this locus. Reconciliation of single-variant associations and functional effects was only possible when haplotype phase was considered. In contrast to earlier reports suggesting that, paradoxically, glucose-raising alleles at this locus are protective against type 2 diabetes (T2D), the p.Val219Leu G6PC2 variant displayed a modest but directionally consistent association with T2D risk. Coding variant associations for glycemic traits in GWAS signals highlight PCSK1, RREB1, and ZHX3 as likely effector transcripts. These coding variant association signals do not have a major impact on the trait variance explained, but they do provide valuable biological insights.


eLife | 2015

Non-crossover gene conversions show strong GC bias and unexpected clustering in humans

Amy Williams; Giulio Genovese; Thomas D. Dyer; Nicolas Altemose; Katherine Truax; Goo Jun; Nick Patterson; Simon Myers; Joanne E. Curran; Ravindranath Duggirala; John Blangero; David Reich; Molly Przeworski

Although the past decade has seen tremendous progress in our understanding of fine-scale recombination, little is known about non-crossover (NCO) gene conversion. We report the first genome-wide study of NCO events in humans. Using SNP array data from 98 meioses, we identified 103 sites affected by NCO, of which 50/52 were confirmed in sequence data. Overlap with double strand break (DSB) hotspots indicates that most of the events are likely of meiotic origin. We estimate that a site is involved in a NCO at a rate of 5.9 × 10−6/bp/generation, consistent with sperm-typing studies, and infer that tract lengths span at least an order of magnitude. Observed NCO events show strong allelic bias at heterozygous AT/GC SNPs, with 68% (58–78%) transmitting GC alleles (p = 5 × 10−4). Strikingly, in 4 of 15 regions with resequencing data, multiple disjoint NCO tracts cluster in close proximity (∼20–30 kb), a phenomenon not previously seen in mammals. DOI: http://dx.doi.org/10.7554/eLife.04637.001


IEEE Transactions on Geoscience and Remote Sensing | 2011

Spatially Adaptive Classification of Land Cover With Remote Sensing Data

Goo Jun; Joydeep Ghosh

This paper proposes a novel framework called Gaussian process maximum likelihood for spatially adaptive classification of hyperspectral data. In hyperspectral images, spectral responses of land covers vary over space, and conventional classification algorithms that result in spatially invariant solutions are fundamentally limited. In the proposed framework, each band of a given class is modeled by a Gaussian random process indexed by spatial coordinates. These models are then used to characterize each land cover class at a given location by a multivariate Gaussian distribution with parameters adapted for that location. Experimental results show that the proposed method effectively captures the spatial variations of hyperspectral data, significantly outperforming a variety of other classification algorithms on three different hyperspectral data sets.


workshop on applications of computer vision | 2008

Tracking and Segmentation of Highway Vehicles in Cluttered and Crowded Scenes

Goo Jun; Jake K. Aggarwal; Muhittin Gökmen

Monitoring highway traffic is an important application of computer vision research. In this paper, we analyze congested highway situations where it is difficult to track individual vehicles in heavy traffic because vehicles either occlude each other or are connected together by shadow. Moreover, scenes from traffic monitoring videos are usually noisy due to weather conditions and/or video compression. We present a method that can separate occluded vehicles by tracking movements of feature points and assigning over-segmented image fragments to the motion vector that best represents the fragments movement. Experiments were conducted on traffic videos taken from highways in Turkey, and the proposed method can successfully separate vehicles in overpopulated and cluttered scenes.

Collaboration


Dive into the Goo Jun's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Joydeep Ghosh

University of Texas at Austin

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jay Shendure

University of Washington

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Marcio Almeida

Texas Biomedical Research Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge