Jinyoung Byun
Dartmouth College
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jinyoung Byun.
Cancer Epidemiology, Biomarkers & Prevention | 2017
Christopher I. Amos; Joe Dennis; Zhaoming Wang; Jinyoung Byun; Fredrick R. Schumacher; Simon A. Gayther; Graham Casey; David J. Hunter; Thomas A. Sellers; Stephen B. Gruber; Alison M. Dunning; Kyriaki Michailidou; Laura Fachal; Kimberly F. Doheny; Amanda B. Spurdle; Yafang Li; Xiangjun Xiao; Jane Romm; Elizabeth W. Pugh; Gerhard A. Coetzee; Dennis J. Hazelett; Stig E. Bojesen; Charlisse F. Caga-anan; Christopher A. Haiman; Ahsan Kamal; Craig Luccarini; Daniel C. Tessier; Daniel Vincent; Francois Bacot; David Van Den Berg
Background: Common cancers develop through a multistep process often including inherited susceptibility. Collaboration among multiple institutions, and funding from multiple sources, has allowed the development of an inexpensive genotyping microarray, the OncoArray. The array includes a genome-wide backbone, comprising 230,000 SNPs tagging most common genetic variants, together with dense mapping of known susceptibility regions, rare variants from sequencing experiments, pharmacogenetic markers, and cancer-related traits. Methods: The OncoArray can be genotyped using a novel technology developed by Illumina to facilitate efficient genotyping. The consortium developed standard approaches for selecting SNPs for study, for quality control of markers, and for ancestry analysis. The array was genotyped at selected sites and with prespecified replicate samples to permit evaluation of genotyping accuracy among centers and by ethnic background. Results: The OncoArray consortium genotyped 447,705 samples. A total of 494,763 SNPs passed quality control steps with a sample success rate of 97% of the samples. Participating sites performed ancestry analysis using a common set of markers and a scoring algorithm based on principal components analysis. Conclusions: Results from these analyses will enable researchers to identify new susceptibility loci, perform fine-mapping of new or known loci associated with either single or multiple cancers, assess the degree of overlap in cancer causation and pleiotropic effects of loci that have been identified for disease-specific risk, and jointly model genetic, environmental, and lifestyle-related exposures. Impact: Ongoing analyses will shed light on etiology and risk assessment for many types of cancer. Cancer Epidemiol Biomarkers Prev; 26(1); 126–35. ©2016 AACR.
Genes and Immunity | 2015
Frederick W. Miller; Wei Vivien Chen; Terrance P. O'Hanlon; Robert G. Cooper; Jiri Vencovsky; Lisa G. Rider; Katalin Dankó; Lucy R. Wedderburn; Ingrid E. Lundberg; Lauren M. Pachman; Ann M. Reed; Steven R. Ytterberg; Leonid Padyukov; Albert Selva-O'Callaghan; Timothy R. D. J. Radstake; David A. Isenberg; Hector Chinoy; William Ollier; Paul Scheet; Bo Peng; Annette Lee; Jinyoung Byun; Janine A. Lamb; Peter K. Gregersen; Christopher I. Amos
Autoimmune muscle diseases (myositis) comprise a group of complex phenotypes influenced by genetic and environmental factors. To identify genetic risk factors in patients of European ancestry, we conducted a genome-wide association study (GWAS) of the major myositis phenotypes in a total of 1710 cases, which included 705 adult dermatomyositis, 473 juvenile dermatomyositis, 532 polymyositis and 202 adult dermatomyositis, juvenile dermatomyositis or polymyositis patients with anti-histidyl-tRNA synthetase (anti-Jo-1) autoantibodies, and compared them with 4724 controls. Single-nucleotide polymorphisms showing strong associations (P<5 × 10−8) in GWAS were identified in the major histocompatibility complex (MHC) region for all myositis phenotypes together, as well as for the four clinical and autoantibody phenotypes studied separately. Imputation and regression analyses found that alleles comprising the human leukocyte antigen (HLA) 8.1 ancestral haplotype (AH8.1) defined essentially all the genetic risk in the phenotypes studied. Although the HLA DRB1*03:01 allele showed slightly stronger associations with adult and juvenile dermatomyositis, and HLA B*08:01 with polymyositis and anti-Jo-1 autoantibody-positive myositis, multiple alleles of AH8.1 were required for the full risk effects. Our findings establish that alleles of the AH8.1 comprise the primary genetic risk factors associated with the major myositis phenotypes in geographically diverse Caucasian populations.
American Journal of Human Genetics | 2015
Dong Hai Xiong; Yian Wang; Elena Kupert; Claire L. Simpson; Susan M. Pinney; Colette Gaba; Diptasri Mandal; Ann G. Schwartz; Ping Yang; Mariza de Andrade; Claudio W. Pikielny; Jinyoung Byun; Yafang Li; Dwight Stambolian; Margaret R. Spitz; Yanhong Liu; Christopher I. Amos; Joan E. Bailey-Wilson; Marshall W. Anderson; Ming You
PARK2, a gene associated with Parkinson disease, is a tumor suppressor in human malignancies. Here, we show that c.823C>T (p.Arg275Trp), a germline mutation in PARK2, is present in a family with eight cases of lung cancer. The resulting amino acid change, p.Arg275Trp, is located in the highly conserved RING finger 1 domain of PARK2, which encodes an E3 ubiquitin ligase. Upon further analysis, the c.823C>T mutation was detected in three additional families affected by lung cancer. The effect size for PARK2 c.823C>T (odds ratio = 5.24) in white individuals was larger than those reported for variants from lung cancer genome-wide association studies. These data implicate this PARK2 germline mutation as a genetic susceptibility factor for lung cancer. Our results provide a rationale for further investigations of this specific mutation and gene for evaluation of the possibility of developing targeted therapies against lung cancer in individuals with PARK2 variants by compensating for the loss-of-function effect caused by the associated variation.
Arthritis & Rheumatism | 2017
Peter A. Merkel; Gang Xie; Paul A. Monach; Xuemei Ji; Dominic J. Ciavatta; Jinyoung Byun; Benjamin D. Pinder; Ai Zhao; Jinyi Zhang; Yohannes Tadesse; David C. Qian; Matthew T. Weirauch; Rajan P. Nair; A. Tsoi; Christian Pagnoux; Simon Carette; Sharon A. Chung; David Cuthbertson; John C. Davis; Paul F. Dellaripa; Lindsy Forbess; Ora Gewurz-Singer; Gary S. Hoffman; Nader Khalidi; Curry L. Koening; Carol A. Langford; Alfred Mahr; Carol A. McAlear; Larry W. Moreland; E. Philip Seo
To identify risk alleles relevant to the causal and biologic mechanisms of antineutrophil cytoplasmic antibody (ANCA)–associated vasculitis (AAV).
Cancer Epidemiology, Biomarkers & Prevention | 2016
David C. Qian; Younghun Han; Jinyoung Byun; Hae Ri Shin; Rayjean J. Hung; John R. McLaughlin; Maria Teresa Landi; Daniela Seminara; Christopher I. Amos
Background: Although genome-wide association studies (GWAS) have identified many genetic variants that are strongly associated with lung cancer, these variants have low penetrance and serve as poor predictors of lung cancer in individuals. We sought to increase the predictive value of germline variants by considering their cumulative effects in the context of biologic pathways. Methods: For individuals in the Environment and Genetics in Lung Cancer Etiology study (1,815 cases/1,971 controls), we computed pathway-level susceptibility effects as the sum of relevant SNP variant alleles weighted by their log-additive effects from a separate lung cancer GWAS meta-analysis (7,766 cases/37,482 controls). Logistic regression models based on age, sex, smoking, genetic variants, and principal components of pathway effects and pathway–smoking interactions were trained and optimized in cross-validation and further tested on an independent dataset (556 cases/830 controls). We assessed prediction performance using area under the receiver operating characteristic curve (AUC). Results: Compared with typical binomial prediction models that have epidemiologic predictors (AUC = 0.607) in addition to top GWAS variants (AUC = 0.617), our pathway-based smoking-interactive multinomial model significantly improved prediction performance in external validation (AUC = 0.656, P < 0.0001). Conclusions: Our biologically informed approach demonstrated a larger increase in AUC over nongenetic counterpart models relative to previous approaches that incorporate variants. Impact: This model is the first of its kind to evaluate lung cancer prediction using subtype-stratified genetic effects organized into pathways and interacted with smoking. We propose pathway–exposure interactions as a potentially powerful new contributor to risk inference. Cancer Epidemiol Biomarkers Prev; 25(8); 1208–15. ©2016 AACR.
BMC Genomics | 2014
Ivan P. Gorlov; Ji Yeon Yang; Jinyoung Byun; Christopher J. Logothetis; Olga Y. Gorlova; Kim Anh Do; Christopher I. Amos
BackgroundWhole-genome profiling of gene expression is a powerful tool for identifying cancer-associated genes. Genes differentially expressed between normal and tumorous tissues are usually considered to be cancer associated. We recently demonstrated that the analysis of interindividual variation in gene expression can be useful for identifying cancer associated genes. The goal of this study was to identify the best microarray data–derived predictor of known cancer associated genes.ResultsWe found that the traditional approach of identifying cancer genes—identifying differentially expressed genes—is not very efficient. The analysis of interindividual variation of gene expression in tumor samples identifies cancer-associated genes more effectively. The results were consistent across 4 major types of cancer: breast, colorectal, lung, and prostate. We used recently reported cancer-associated genes (2011–2012) for validation and found that novel cancer-associated genes can be best identified by elevated variance of the gene expression in tumor samples.ConclusionsThe observation that the high interindividual variation of gene expression in tumor tissues is the best predictor of cancer-associated genes is likely a result of tumor heterogeneity on gene level. Computer simulation demonstrates that in the case of heterogeneity, an assessment of variance in tumors provides a better identification of cancer genes than does the comparison of the expression in normal and tumor tissues. Our results thus challenge the current paradigm that comparing the mean expression between normal and tumorous tissues is the best approach to identifying cancer-associated genes; we found that the high interindividual variation in expression is a better approach, and that using variation would improve our chances of identifying cancer-associated genes.
Human Molecular Genetics | 2015
David C. Qian; Jinyoung Byun; Younghun Han; Casey S. Greene; John K. Field; Rayjean J. Hung; Yonathan Brhane; John R. McLaughlin; Gordon Fehringer; Maria Teresa Landi; Albert Rosenberger; Heike Bickeböller; Jyoti Malhotra; Angela Risch; Joachim Heinrich; David J. Hunter; Brian E. Henderson; Christopher A. Haiman; Fredrick R. Schumacher; Rosalind Eeles; Douglas F. Easton; Daniela Seminara; Christopher I. Amos
Results from genome-wide association studies (GWAS) have indicated that strong single-gene effects are the exception, not the rule, for most diseases. We assessed the joint effects of germline genetic variations through a pathway-based approach that considers the tissue-specific contexts of GWAS findings. From GWAS meta-analyses of lung cancer (12 160 cases/16 838 controls), breast cancer (15 748 cases/18 084 controls) and prostate cancer (14 160 cases/12 724 controls) in individuals of European ancestry, we determined the tissue-specific interaction networks of proteins expressed from genes that are likely to be affected by disease-associated variants. Reactome pathways exhibiting enrichment of proteins from each network were compared across the cancers. Our results show that pathways associated with all three cancers tend to be broad cellular processes required for growth and survival. Significant examples include the nerve growth factor (P = 7.86 × 10(-33)), epidermal growth factor (P = 1.18 × 10(-31)) and fibroblast growth factor (P = 2.47 × 10(-31)) signaling pathways. However, within these shared pathways, the genes that influence risk largely differ by cancer. Pathways found to be unique for a single cancer focus on more specific cellular functions, such as interleukin signaling in lung cancer (P = 1.69 × 10(-15)), apoptosis initiation by Bad in breast cancer (P = 3.14 × 10(-9)) and cellular responses to hypoxia in prostate cancer (P = 2.14 × 10(-9)). We present the largest comparative cross-cancer pathway analysis of GWAS to date. Our approach can also be applied to the study of inherited mechanisms underlying risk across multiple diseases in general.
Statistics in Medicine | 2013
Jinyoung Byun; Dejian Lai; Sheng Luo; Jan Risser; Betty Tung; Robert J. Hardy
The most common data structures in the biomedical studies have been matched or unmatched designs. Data structures resulting from a hybrid of the two may create challenges for statistical inferences. The question may arise whether to use parametric or nonparametric methods on the hybrid data structure. The Early Treatment for Retinopathy of Prematurity study was a multicenter clinical trial sponsored by the National Eye Institute. The design produced data requiring a statistical method of a hybrid nature. An infant in this multicenter randomized clinical trial had high-risk prethreshold retinopathy of prematurity that was eligible for treatment in one or both eyes at entry into the trial. During follow-up, recognition visual acuity was accessed for both eyes. Data from both eyes (matched) and from only one eye (unmatched) were eligible to be used in the trial. The new hybrid nonparametric method is a meta-analysis based on combining the Hodges-Lehmann estimates of treatment effects from the Wilcoxon signed rank and rank sum tests. To compare the new method, we used the classic meta-analysis with the t-test method to combine estimates of treatment effects from the paired and two sample t-tests. We used simulations to calculate the empirical size and power of the test statistics, as well as the bias, mean square and confidence interval width of the corresponding estimators. The proposed method provides an effective tool to evaluate data from clinical trials and similar comparative studies.
BMC Genomics | 2017
Jinyoung Byun; Younghun Han; Ivan P. Gorlov; Jonathan A. Busam; Michael F. Seldin; Christopher I. Amos
BackgroundAccurate inference of genetic ancestry is of fundamental interest to many biomedical, forensic, and anthropological research areas. Genetic ancestry memberships may relate to genetic disease risks. In a genome association study, failing to account for differences in genetic ancestry between cases and controls may also lead to false-positive results. Although a number of strategies for inferring and taking into account the confounding effects of genetic ancestry are available, applying them to large studies (tens thousands samples) is challenging. The goal of this study is to develop an approach for inferring genetic ancestry of samples with unknown ancestry among closely related populations and to provide accurate estimates of ancestry for application to large-scale studies.MethodsIn this study we developed a novel distance-based approach, Ancestry Inference using Principal component analysis and Spatial analysis (AIPS) that incorporates an Inverse Distance Weighted (IDW) interpolation method from spatial analysis to assign individuals to population memberships.ResultsWe demonstrate the benefits of AIPS in analyzing population substructure, specifically related to the four most commonly used tools EIGENSTRAT, STRUCTURE, fastSTRUCTURE, and ADMIXTURE using genotype data from various intra-European panels and European-Americans. While the aforementioned commonly used tools performed poorly in inferring ancestry from a large number of subpopulations, AIPS accurately distinguished variations between and within subpopulations.ConclusionsOur results show that AIPS can be applied to large-scale data sets to discriminate the modest variability among intra-continental populations as well as for characterizing inter-continental variation. The method we developed will protect against spurious associations when mapping the genetic basis of a disease. Our approach is more accurate and computationally efficient method for inferring genetic ancestry in the large-scale genetic studies.
Nature Communications | 2018
Aida Ferreiro-Iglesias; Corina Lesseur; James D. McKay; Rayjean J. Hung; Younghun Han; Xuchen Zong; David C. Christiani; Mattias Johansson; Xiangjun Xiao; Yafang Li; David C. Qian; Xuemei Ji; Geoffrey Liu; Neil E. Caporaso; Ghislaine Scelo; David Zaridze; Anush Mukeriya; Milica Kontic; Simona Ognjanovic; Jolanta Lissowska; Małgorzata Szołkowska; Beata Swiatkowska; Vladimir Janout; Ivana Holcatova; Ciprian Bolca; Milan Savic; Miodrag Ognjanovic; Stig E. Bojesen; Xifeng Wu; Demetrios Albanes
The basis for associations between lung cancer and major histocompatibility complex genes is not completely understood. Here the authors further consider genetic variation within the MHC region in lung cancer patients and identify independent associations within HLA genes that explain MHC lung cancer associations in Europeans and Asian populations.AbstractLung cancer has several genetic associations identified within the major histocompatibility complex (MHC); although the basis for these associations remains elusive. Here, we analyze MHC genetic variation among 26,044 lung cancer patients and 20,836 controls densely genotyped across the MHC, using the Illumina Illumina OncoArray or Illumina 660W SNP microarray. We impute sequence variation in classical HLA genes, fine-map MHC associations for lung cancer risk with major histologies and compare results between ethnicities. Independent and novel associations within HLA genes are identified in Europeans including amino acids in the HLA-B*0801 peptide binding groove and an independent HLA-DQB1*06 loci group. In Asians, associations are driven by two independent HLA allele sets that both increase risk in HLA-DQB1*0401 and HLA-DRB1*0701; the latter better represented by the amino acid Ala-104. These results implicate several HLA–tumor peptide interactions as the major MHC factor modulating lung cancer susceptibility.