Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Heejong Sung is active.

Publication


Featured researches published by Heejong Sung.


BMC Medical Genomics | 2010

A combined genome-wide linkage and association approach to find susceptibility loci for platelet function phenotypes in European American and African American families with coronary artery disease

Rasika A. Mathias; Yoonhee Kim; Heejong Sung; Lisa R. Yanek; Vj Mantese; J Enrique Hererra-Galeano; Ingo Ruczinski; Alexander F. Wilson; Nauder Faraday; Lewis C. Becker; Diane M. Becker

BackgroundThe inability of aspirin (ASA) to adequately suppress platelet aggregation is associated with future risk of coronary artery disease (CAD). Heritability studies of agonist-induced platelet function phenotypes suggest that genetic variation may be responsible for ASA responsiveness. In this study, we leverage independent information from genome-wide linkage and association data to determine loci controlling platelet phenotypes before and after treatment with ASA.MethodsClinical data on 37 agonist-induced platelet function phenotypes were evaluated before and after a 2-week trial of ASA (81 mg/day) in 1231 European American and 846 African American healthy subjects with a family history of premature CAD. Principal component analysis was performed to minimize the number of independent factors underlying the covariance of these various phenotypes. Multi-point sib-pair based linkage analysis was performed using a microsatellite marker set, and single-SNP association tests were performed using markers from the Illumina 1 M genotyping chip from deCODE Genetics, Inc. All analyses were performed separately within each ethnic group.ResultsSeveral genomic regions appear to be linked to ASA response factors: a 10 cM region in African Americans on chromosome 5q11.2 had several STRs with suggestive (p-value < 7 × 10-4) and significant (p-value < 2 × 10-5) linkage to post aspirin platelet response to ADP, and ten additional factors had suggestive evidence for linkage (p-value < 7 × 10-4) to thirteen genomic regions. All but one of these factors were aspirin response variables. While the strength of genome-wide SNP association signals for factors showing evidence for linkage is limited, especially at the strict thresholds of genome-wide criteria (N = 9 SNPs for 11 factors), more signals were considered significant when the association signal was weighted by evidence for linkage (N = 30 SNPs).ConclusionsOur study supports the hypothesis that platelet phenotypes in response to ASA likely have genetic control and the combined approach of linkage and association offers an alternative approach to prioritizing regions of interest for subsequent follow-up.


BMC Proceedings | 2009

Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects

Yoonhee Kim; Robert Wojciechowski; Heejong Sung; Rasika A. Mathias; Alison P. Klein; Rhoshel Lenroot; James D. Malley; Joan E. Bailey-Wilson

Random forests (RF) is one of a broad class of machine learning methods that are able to deal with large-scale data without model specification, which makes it an attractive method for genome-wide association studies (GWAS). The performance of RF and other association methods in the presence of interactions was evaluated using the simulated data from Genetic Analysis Workshop 16 Problem 3, with knowledge of the major causative markers, risk factors, and their interactions in the simulated traits. There was good power to detect the environmental risk factors using RF, trend tests, or regression analyses but the power to detect the effects of the causal markers was poor for all methods. The causal marker that had an interactive effect with smoking did show moderate evidence of association in the RF and regression analyses, suggesting that RF may perform well at detecting such interactions in larger, more highly powered datasets.


PLOS Genetics | 2014

Genetic modifiers of neurofibromatosis type 1-associated café-au-lait macule count identified using multi-platform analysis.

Alexander Pemov; Heejong Sung; Paula L. Hyland; Jennifer L. Sloan; Sarah L. Ruppert; Andrea Baldwin; Joseph Boland; Sara Bass; Hyo Jung Lee; Xijun Zhang; Nisc Comparative Sequencing Program; James C. Mullikin; Brigitte C. Widemann; Alexander F. Wilson; Douglas R. Stewart

Neurofibromatosis type 1 (NF1) is an autosomal dominant, monogenic disorder of dysregulated neurocutaneous tissue growth. Pleiotropy, variable expressivity and few NF1 genotype-phenotype correlates limit clinical prognostication in NF1. Phenotype complexity in NF1 is hypothesized to derive in part from genetic modifiers unlinked to the NF1 locus. In this study, we hypothesized that normal variation in germline gene expression confers risk for certain phenotypes in NF1. In a set of 79 individuals with NF1, we examined the association between gene expression in lymphoblastoid cell lines with NF1-associated phenotypes and sequenced select genes with significant phenotype/expression correlations. In a discovery cohort of 89 self-reported European-Americans with NF1 we examined the association between germline sequence variants of these genes with café-au-lait macule (CALM) count, a tractable, tumor-like phenotype in NF1. Two correlated, common SNPs (rs4660761 and rs7161) between DPH2 and ATP6V0B were significantly associated with the CALM count. Analysis with tiled regression also identified SNP rs4660761 as significantly associated with CALM count. SNP rs1800934 and 12 rare variants in the mismatch repair gene MSH6 were also associated with CALM count. Both SNPs rs7161 and rs4660761 (DPH2 and ATP6V0B) were highly significant in a mega-analysis in a combined cohort of 180 self-reported European-Americans; SNP rs1800934 (MSH6) was near-significant in a meta-analysis assuming dominant effect of the minor allele. SNP rs4660761 is predicted to regulate ATP6V0B, a gene associated with melanosome biology. Individuals with homozygous mutations in MSH6 can develop an NF1-like phenotype, including multiple CALMs. Through a multi-platform approach, we identified variants that influence NF1 CALM count.


Human Heredity | 2012

Intra-Familial Tests of Association between Familial Idiopathic Scoliosis and Linked Regions on 9q31.3-q34.3 and 16p12.3-q22.2

Nancy H. Miller; Cristina M. Justice; Beth Marosy; Kandice Swindle; Yoonhee Kim; Marie-Hélène Roy-Gagnon; Heejong Sung; Dana Behneman; Kimberly F. Doheny; Elizabeth W. Pugh; Alexander F. Wilson

Objective: Custom genotyping of markers in families with familial idiopathic scoliosis were used to fine-map candidate regions on chromosomes 9 and 16 in order to identify candidate genes that contribute to this disorder and prioritize them for next-generation sequence analysis. Methods: Candidate regions on 9q and 16p–16q, previously identified as linked to familial idiopathic scoliosis in a study of 202 families, were genotyped with a high-density map of single nucleotide polymorphisms. Tests of linkage for fine-mapping and intra-familial tests of association, including tiled regression, were performed on scoliosis as both a qualitative and quantitative trait. Results and Conclusions: Nominally significant linkage results were found for markers in both candidate regions. Results from intra-familial tests of association and tiled regression corroborated the linkage findings and identified possible candidate genes suitable for follow-up with next-generation sequencing in these same families. Candidate genes that met our prioritization criteria included FAM129B and CERCAM on chromosome 9 and SYT1, GNAO1, and CDH3 on chromosome 16.


BMC Proceedings | 2011

Comparison of results from tests of association in unrelated individuals with uncollapsed and collapsed sequence variants using tiled regression

Heejong Sung; Yoonhee Kim; Juanliang Cai; Cheryl D. Cropp; Claire L. Simpson; Qing Li; Brian C Perry; Alexa J.M. Sorant; Joan E. Bailey-Wilson; Alexander F. Wilson

Tiled regression is an approach designed to determine the set of independent genetic variants that contribute to the variation of a quantitative trait in the presence of many highly correlated variants. In this study, we evaluate the statistical properties of the tiled regression method using the Genetic Analysis Workshop 17 data in unrelated individuals for traits Q1, Q2, and Q4. To increase the power to detect rare variants, we use two methods to collapse rare variants and compare the results with those from the uncollapsed data. In addition, we compare the tiled regression method to traditional tests of association with and without collapsed rare variants. The results show that collapsing rare variants generally improves the power to detect associations regardless of method, although only variants with the largest allelic effects could be detected. However, for traditional simple linear regression, the average estimated type I error is dependent on the trait and varies by about three orders of magnitude. The estimated type I error rate is stable for tiled regression across traits.


BMC Proceedings | 2011

Old lessons learned anew: family-based methods for detecting genes responsible for quantitative and qualitative traits in the Genetic Analysis Workshop 17 mini-exome sequence data

Claire L. Simpson; Cristina M. Justice; Mera Krishnan; Robert Wojciechowski; Heejong Sung; Jerry Cai; Tiffany Green; Deyana D. Lewis; Dana Behneman; Alexander F. Wilson; Joan E. Bailey-Wilson

Family-based study designs are again becoming popular as new next-generation sequencing technologies make whole-exome and whole-genome sequencing projects economically and temporally feasible. Here we evaluate the statistical properties of linkage analyses and family-based tests of association for the Genetic Analysis Workshop 17 mini-exome sequence data. Based on our results, the linkage methods using relative pairs or nuclear families had low power, with the best results coming from variance components linkage analysis in nuclear families and Elston-Stewart model-based linkage analysis in extended pedigrees. For family-based tests of association, both ASSOC and ROMP performed well for genes with large effects, but ROMP had the advantage of not requiring parental genotypes in the analysis. For the linkage analyses we conclude that genome-wide significance levels appear to control type I error well but that “suggestive” significance levels do not. Methods that make use of the extended pedigrees are well powered to detect major loci segregating in the families even when there is substantial genetic heterogeneity and the trait is mainly polygenic. However, large numbers of such pedigrees will be necessary to detect all major loci. The family-based tests of association found the same major loci as the linkage analyses and detected low-frequency loci with moderate effect sizes, but control of type I error was not as stringent.


BMC Proceedings | 2011

Performance of random forests and logic regression methods using mini-exome sequence data

Yoonhee Kim; Qing Li; Cheryl D. Cropp; Heejong Sung; Juanliang Cai; Claire L. Simpson; Brian C Perry; Abhijit Dasgupta; James D. Malley; Alexander F. Wilson; Joan E. Bailey-Wilson

Machine learning approaches are an attractive option for analyzing large-scale data to detect genetic variants that contribute to variation of a quantitative trait, without requiring specific distributional assumptions. We evaluate two machine learning methods, random forests and logic regression, and compare them to standard simple univariate linear regression, using the Genetic Analysis Workshop 17 mini-exome data. We also apply these methods after collapsing multiple rare variants within genes and within gene pathways. Linear regression and the random forest method performed better when rare variants were collapsed based on genes or gene pathways than when each variant was analyzed separately. Logic regression performed better when rare variants were collapsed based on genes rather than on pathways.


Genetic Epidemiology | 2018

Genetic associations with childhood brain growth, defined in two longitudinal cohorts.

Eszter Szekely; Tae Hwi Schwantes-An; Cristina M. Justice; Jeremy A. Sabourin; Philip R. Jansen; Ryan L. Muetzel; Wendy Sharp; Henning Tiemeier; Heejong Sung; Tonya White; Alexander F. Wilson; Philip Shaw

Genome‐wide association studies (GWASs) are unraveling the genetics of adult brain neuroanatomy as measured by cross‐sectional anatomic magnetic resonance imaging (aMRI). However, the genetic mechanisms that shape childhood brain development are, as yet, largely unexplored. In this study we identify common genetic variants associated with childhood brain development as defined by longitudinal aMRI. Genome‐wide single nucleotide polymorphism (SNP) data were determined in two cohorts: one enriched for attention‐deficit/hyperactivity disorder (ADHD) (LONG cohort: 458 participants; 119 with ADHD) and the other from a population‐based cohort (Generation R: 257 participants). The growth of the brains major regions (cerebral cortex, white matter, basal ganglia, and cerebellum) and one region of interest (the right lateral prefrontal cortex) were defined on all individuals from two aMRIs, and a GWAS and a pathway analysis were performed. In addition, association between polygenic risk for ADHD and brain growth was determined for the LONG cohort. For white matter growth, GWAS meta‐analysis identified a genome‐wide significant intergenic SNP (rs12386571, P = 9.09 × 10−9), near AKR1B10. This gene is part of the aldo‐keto reductase superfamily and shows neural expression. No enrichment of neural pathways was detected and polygenic risk for ADHD was not associated with the brain growth phenotypes in the LONG cohort that was enriched for the diagnosis of ADHD. The study illustrates the use of a novel brain growth phenotype defined in vivo for further study.


BMC Proceedings | 2016

Type I error rates of rare single nucleotide variants are inflated in tests of association with non-normally distributed traits using simple linear regression methods.

Tae-Hwi Schwantes-An; Heejong Sung; Jeremy A. Sabourin; Cristina M. Justice; Alexa J.M. Sorant; Alexander F. Wilson

In this study, the effects of (a) the minor allele frequency of the single nucleotide variant (SNV), (b) the degree of departure from normality of the trait, and (c) the position of the SNVs on type I error rates were investigated in the Genetic Analysis Workshop (GAW) 19 whole exome sequence data. To test the distribution of the type I error rate, 5 simulated traits were considered: standard normal and gamma distributed traits; 2 transformed versions of the gamma trait (log10 and rank-based inverse normal transformations); and trait Q1 provided by GAW 19. Each trait was tested with 313,340 SNVs. Tests of association were performed with simple linear regression and average type I error rates were determined for minor allele frequency classes. Rare SNVs (minor allele frequency < 0.05) showed inflated type I error rates for non–normally distributed traits that increased as the minor allele frequency decreased. The inflation of average type I error rates increased as the significance threshold decreased. Normally distributed traits did not show inflated type I error rates with respect to the minor allele frequency for rare SNVs. There was no consistent effect of transformation on the uniformity of the distribution of the location of SNVs with a type I error.


bioinformatics and biomedicine | 2015

Tiled regression reduces type I error rates in tests of association of rare single nucleotide variants with non-normally distributed traits, compared with simple linear regression

Heejong Sung; Alexa J.M. Sorant; Jeremy A. Sabourin; Tae-Hwi Schwantes-An; Cristina M. Justice; Joan E. Bailey-Wilson; Alexander F. Wilson

The effects of the minor allele frequency of single nucleotide variants and the degree of departure from normality of a quantitative trait on type I error rates were evaluated using Genetic Analysis Workshop 17 mini-exome sequence data. Four simulated traits were generated: standard normal and gamma distributed traits and two transformations of the gamma distributed trait by log10 and rank-based inverse normal functions. Tiled regression was compared with simple linear regression. Average type I error rates were obtained for minor allele frequency classes. The distribution of the type I error rate for tiled regression analysis followed a pattern similar to that of simple linear regression analysis, but with much lower type I error.

Collaboration


Dive into the Heejong Sung's collaboration.

Top Co-Authors

Avatar

Alexander F. Wilson

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Joan E. Bailey-Wilson

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Cristina M. Justice

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Jeremy A. Sabourin

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Yoonhee Kim

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Alexa J.M. Sorant

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Cheryl D. Cropp

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Claire L. Simpson

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Brian C Perry

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Dana Behneman

National Institutes of Health

View shared research outputs
Researchain Logo
Decentralizing Knowledge