
Nature Genetics | 2013

Systematic identification of trans eQTLs as putative drivers of known disease associations

Harm-Jan Westra; Marjolein J. Peters; Tonu Esko; Hanieh Yaghootkar; Johannes Kettunen; Mark W. Christiansen; Benjamin P. Fairfax; Katharina Schramm; Joseph E. Powell; Alexandra Zhernakova; Daria V. Zhernakova; Jan H. Veldink; Leonard H. van den Berg; Juha Karjalainen; Sebo Withoff; André G. Uitterlinden; Albert Hofman; Fernando Rivadeneira; Peter A. C. 't Hoen; Eva Reinmaa; Krista Fischer; Mari Nelis; Lili Milani; David Melzer; Luigi Ferrucci; Andrew Singleton; Dena Hernandez; Michael A. Nalls; Georg Homuth; Matthias Nauck

Identifying the downstream effects of disease-associated SNPs is challenging. To help overcome this problem, we performed expression quantitative trait locus (eQTL) meta-analysis in non-transformed peripheral blood samples from 5,311 individuals with replication in 2,775 individuals. We identified and replicated trans eQTLs for 233 SNPs (reflecting 103 independent loci) that were previously associated with complex traits at genome-wide significance. Some of these SNPs affect multiple genes in trans that are known to be altered in individuals with disease: rs4917014, previously associated with systemic lupus erythematosus (SLE), altered gene expression of C1QB and five type I interferon response genes, both hallmarks of SLE. DeepSAGE RNA sequencing showed that rs4917014 strongly alters the 3′ UTR levels of IKZF1 in cis, and chromatin immunoprecipitation and sequencing analysis of the trans-regulated genes implicated IKZF1 as the causal gene. Variants associated with cholesterol metabolism and type 1 diabetes showed similar phenomena, indicating that large-scale eQTL mapping provides insight into the downstream effects of many trait-associated variants.

Nature | 2014

Genetics of rheumatoid arthritis contributes to biology and drug discovery

Yukinori Okada; Di Wu; Gosia Trynka; Towfique Raj; Chikashi Terao; Katsunori Ikari; Yuta Kochi; Koichiro Ohmura; Akari Suzuki; Shinji Yoshida; Robert R. Graham; Arun Manoharan; Ward Ortmann; Tushar Bhangale; Joshua C. Denny; Robert J. Carroll; Anne E. Eyler; Jeffrey D. Greenberg; Joel M. Kremer; Dimitrios A. Pappas; Lei Jiang; Jian Yin; Lingying Ye; Ding Feng Su; Jian Yang; Gang Xie; E. Keystone; Harm-Jan Westra; Tonu Esko; Andres Metspalu

A major challenge in human genetics is to devise a systematic strategy to integrate disease-associated variants with diverse genomic and biological data sets to provide insight into disease pathogenesis and guide drug discovery for complex traits such as rheumatoid arthritis (RA). Here we performed a genome-wide association study meta-analysis in a total of >100,000 subjects of European and Asian ancestries (29,880 RA cases and 73,758 controls), by evaluating ∼10 million single-nucleotide polymorphisms. We discovered 42 novel RA risk loci at a genome-wide level of significance, bringing the total to 101 (refs 2, 3, 4). We devised an in silico pipeline using established bioinformatics methods based on functional annotation, cis-acting expression quantitative trait loci and pathway analyses—as well as novel methods based on genetic overlap with human primary immunodeficiency, haematological cancer somatic mutations and knockout mouse phenotypes—to identify 98 biological candidate genes at these 101 risk loci. We demonstrate that these genes are the targets of approved therapies for RA, and further suggest that drugs approved for other indications may be repurposed for the treatment of RA. Together, this comprehensive genetic study sheds light on fundamental genes, pathways and cell types that contribute to RA pathogenesis, and provides empirical evidence that the genetics of RA can provide important information for drug discovery.

Nature Communications | 2015

Novel loci affecting iron homeostasis and their effects in individuals at risk for hemochromatosis (vol 5, 4926, 2014)

Beben Benyamin; Tonu Esko; Janina S. Ried; Aparna Radhakrishnan; Sita H. Vermeulen; Michela Traglia; Martin Goegele; Denise Anderson; Linda Broer; Clara Podmore; Jian'an Luan; Zoltán Kutalik; Serena Sanna; Peter van der Meer; Toshiko Tanaka; Fudi Wang; Harm-Jan Westra; Lude Franke; Evelin Mihailov; Lili Milani; Jonas Haelldin; Juliane Winkelmann; Thomas Meitinger; Joachim Thiery; Annette Peters; Melanie Waldenberger; Augusto Rendon; Jennifer Jolley; Jennifer Sambrook; Lambertus A. Kiemeney

Corrigendum: Novel loci affecting iron homeostasis and their effects in individuals at risk for hemochromatosis

Nature Genetics | 2013

Identification of seven loci affecting mean telomere length and their association with disease

Veryan Codd; Christopher P. Nelson; Eva Albrecht; Massimo Mangino; Joris Deelen; Jessica L. Buxton; Jouke-Jan Hottenga; Krista Fischer; Tonu Esko; Ida Surakka; Linda Broer; Dale R. Nyholt; Irene Mateo Leach; Perttu Salo; Sara Hägg; Mary Matthews; Jutta Palmen; Giuseppe Danilo Norata; Paul F. O'Reilly; Danish Saleheen; Najaf Amin; Anthony J. Balmforth; Marian Beekman; Rudolf A. de Boer; Stefan Böhringer; Peter S. Braund; Paul R. Burton; Anton J. M. de Craen; Yanbin Dong; Konstantinos Douroudis

Interindividual variation in mean leukocyte telomere length (LTL) is associated with cancer and several age-associated diseases. We report here a genome-wide meta-analysis of 37,684 individuals with replication of selected variants in an additional 10,739 individuals. We identified seven loci, including five new loci, associated with mean LTL (P < 5 × 10−8). Five of the loci contain candidate genes (TERC, TERT, NAF1, OBFC1 and RTEL1) that are known to be involved in telomere biology. Lead SNPs at two loci (TERC and TERT) associate with several cancers and other diseases, including idiopathic pulmonary fibrosis. Moreover, a genetic risk score analysis combining lead variants at all 7 loci in 22,233 coronary artery disease cases and 64,762 controls showed an association of the alleles associated with shorter LTL with increased risk of coronary artery disease (21% (95% confidence interval, 5–35%) per standard deviation in LTL, P = 0.014). Our findings support a causal role of telomere-length variation in some age-related diseases.

Nature | 2010

A new highly penetrant form of obesity due to deletions on chromosome 16p11.2

Robin G. Walters; Sébastien Jacquemont; Armand Valsesia; A.J. de Smith; Danielle Martinet; Johanna C. Andersson; Mario Falchi; Fangfang Chen; Joris Andrieux; Stéphane Lobbens; Bruno Delobel; Fanny Stutzmann; J. S. El-Sayed Moustafa; Jean-Claude Chèvre; Cécile Lecoeur; Vincent Vatin; Sonia Bouquillon; Jessica L. Buxton; Odile Boute; M. Holder-Espinasse; Jean-Marie Cuisset; M.-P. Lemaitre; A.-E. Ambresin; A. Brioschi; M. Gaillard; V. Giusti; Florence Fellmann; Alessandra Ferrarini; Nouchine Hadjikhani; Dominique Campion

Obesity has become a major worldwide challenge to public health, owing to an interaction between the Western ‘obesogenic’ environment and a strong genetic contribution. Recent extensive genome-wide association studies (GWASs) have identified numerous single nucleotide polymorphisms associated with obesity, but these loci together account for only a small fraction of the known heritable component. Thus, the ‘common disease, common variant’ hypothesis is increasingly coming under challenge. Here we report a highly penetrant form of obesity, initially observed in 31 subjects who were heterozygous for deletions of at least 593 kilobases at 16p11.2 and whose ascertainment included cognitive deficits. Nineteen similar deletions were identified from GWAS data in 16,053 individuals from eight European cohorts. These deletions were absent from healthy non-obese controls and accounted for 0.7% of our morbid obesity cases (body mass index (BMI) ≥ 40 kg m-2 or BMI standard deviation score ≥ 4; P = 6.4 × 10-8, odds ratio 43.0), demonstrating the potential importance in common disease of rare variants with strong effects. This highlights a promising strategy for identifying missing heritability in obesity and other complex traits: cohorts with extreme phenotypes are likely to be enriched for rare variants, thereby improving power for their discovery. Subsequent analysis of the loci so identified may well reveal additional rare variants that further contribute to the missing heritability, as recently reported for SIM1 (ref. 3). The most productive approach may therefore be to combine the ‘power of the extreme’ in small, well-phenotyped cohorts, with targeted follow-up in case-control and population cohorts.

Nature Genetics | 2010

Common variants in KCNN3 are associated with lone atrial fibrillation

Patrick T. Ellinor; Kathryn L. Lunetta; Nicole L. Glazer; Arne Pfeufer; Alvaro Alonso; Mina K. Chung; Moritz F. Sinner; Paul I. W. de Bakker; Martina Mueller; Steven A. Lubitz; Ervin R. Fox; Dawood Darbar; Nicholas L. Smith; Jonathan D. Smith; Renate B. Schnabel; Elsayed Z. Soliman; Kenneth Rice; David R. Van Wagoner; Britt-M. Beckmann; Charlotte van Noord; Ke Wang; Georg Ehret; Jerome I. Rotter; Stanley L. Hazen; Gerhard Steinbeck; Albert V. Smith; Lenore J. Launer; Tamara B. Harris; Seiko Makino; Mari Nelis

Atrial fibrillation (AF) is the most common sustained arrhythmia. Previous studies have identified several genetic loci associated with typical AF. We sought to identify common genetic variants underlying lone AF. This condition affects a subset of individuals without overt heart disease and with an increased heritability of AF. We report a meta-analysis of genome-wide association studies conducted using 1,335 individuals with lone AF (cases) and 12,844 unaffected individuals (referents). Cases were obtained from the German AF Network, Heart and Vascular Health Study, the Atherosclerosis Risk in Communities Study, the Cleveland Clinic and Massachusetts General Hospital. We identified an association on chromosome 1q21 to lone AF (rs13376333, adjusted odds ratio = 1.56; P = 6.3 × 10−12), and we replicated this association in two independent cohorts with lone AF (overall combined odds ratio = 1.52, 95% CI 1.40–1.64; P = 1.83 × 10−21). rs13376333 is intronic to KCNN3, which encodes a potassium channel protein involved in atrial repolarization.

PLOS ONE | 2009

Genetic Structure of Europeans: A View from the North–East

Mari Nelis; Tonu Esko; Reedik Mägi; Fritz Zimprich; Alexander Zimprich; Draga Toncheva; Sena Karachanak; T. Piskackova; I. Balascak; Leena Peltonen; Eveliina Jakkula; Karola Rehnström; Mark Lathrop; Simon Heath; Pilar Galan; Stefan Schreiber; Thomas Meitinger; Arne Pfeufer; H-Erich Wichmann; Béla Melegh; Noémi Polgár; Daniela Toniolo; Paolo Gasparini; Pio D'Adamo; Janis Klovins; Liene Nikitina-Zake; Vaidutis Kučinskas; Jūratė Kasnauskienė; Jan Lubinski; Tadeusz Dębniak

Using principal component (PC) analysis, we studied the genetic constitution of 3,112 individuals from Europe as portrayed by more than 270,000 single nucleotide polymorphisms (SNPs) genotyped with the Illumina Infinium platform. In cohorts where the sample size was >100, one hundred randomly chosen samples were used for analysis to minimize the sample size effect, resulting in a total of 1,564 samples. This analysis revealed that the genetic structure of the European population correlates closely with geography. The first two PCs highlight the genetic diversity corresponding to the northwest to southeast gradient and position the populations according to their approximate geographic origin. The resulting genetic map forms a triangular structure with a) Finland, b) the Baltic region, Poland and Western Russia, and c) Italy as its vertexes, and with d) Central- and Western Europe in its centre. Inter- and intra- population genetic differences were quantified by the inflation factor lambda (λ) (ranging from 1.00 to 4.21), fixation index (Fst) (ranging from 0.000 to 0.023), and by the number of markers exhibiting significant allele frequency differences in pair-wise population comparisons. The estimated lambda was used to assess the real diminishing impact to association statistics when two distinct populations are merged directly in an analysis. When the PC analysis was confined to the 1,019 Estonian individuals (0.1% of the Estonian population), a fine structure emerged that correlated with the geography of individual counties. With at least two cohorts available from several countries, genetic substructures were investigated in Czech, Finnish, German, Estonian and Italian populations. Together with previously published data, our results allow the creation of a comprehensive European genetic map that will greatly facilitate inter-population genetic studies including genome wide association studies (GWAS).

Nature Genetics | 2015

Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index

Jian Yang; Andrew Bakshi; Zhihong Zhu; Gibran Hemani; Anna A. E. Vinkhuyzen; Sang Hong Lee; Matthew R. Robinson; John Perry; Ilja M. Nolte; Jana V. van Vliet-Ostaptchouk; Harold Snieder; Tonu Esko; Lili Milani; Reedik Mägi; Andres Metspalu; Anders Hamsten; Patrik K. E. Magnusson; Nancy L. Pedersen; Erik Ingelsson; Nicole Soranzo; Matthew C. Keller; Naomi R. Wray; Michael E. Goddard; Peter M. Visscher

We propose a method (GREML-LDMS) to estimate heritability for human complex traits in unrelated individuals using whole-genome sequencing data. We demonstrate using simulations based on whole-genome sequencing data that ∼97% and ∼68% of variation at common and rare variants, respectively, can be captured by imputation. Using the GREML-LDMS method, we estimate from 44,126 unrelated individuals that all ∼17 million imputed variants explain 56% (standard error (s.e.) = 2.3%) of variance for height and 27% (s.e. = 2.5%) of variance for body mass index (BMI), and we find evidence that height- and BMI-associated variants have been under natural selection. Considering the imperfect tagging of imputation and potential overestimation of heritability from previous family-based studies, heritability is likely to be 60–70% for height and 30–40% for BMI. Therefore, the missing heritability is small for both traits. For further discovery of genes associated with complex traits, a study design with SNP arrays followed by imputation is more cost-effective than whole-genome sequencing at current prices.

Molecular Psychiatry | 2012

Meta-analysis of genome-wide association studies for personality

M.H.M. de Moor; Paul T. Costa; Antonio Terracciano; Robert F. Krueger; E.J.C. de Geus; T Toshiko; Brenda W. J. H. Penninx; Tonu Esko; P. A. F. Madden; Jaime Derringer; Najaf Amin; Gonneke Willemsen; J.J. Hottenga; Marijn A. Distel; Manuela Uda; Serena Sanna; Philip Spinhoven; C. A. Hartman; Patrick F. Sullivan; Anu Realo; Jüri Allik; A. C. Heath; Michele L. Pergadia; Arpana Agrawal; Peng Lin; Richard A. Grucza; Teresa Nutile; Marina Ciullo; Dan Rujescu; Ina Giegling

Personality can be thought of as a set of characteristics that influence peoples thoughts, feelings and behavior across a variety of settings. Variation in personality is predictive of many outcomes in life, including mental health. Here we report on a meta-analysis of genome-wide association (GWA) data for personality in 10 discovery samples (17 375 adults) and five in silico replication samples (3294 adults). All participants were of European ancestry. Personality scores for Neuroticism, Extraversion, Openness to Experience, Agreeableness and Conscientiousness were based on the NEO Five-Factor Inventory. Genotype data of ∼2.4M single-nucleotide polymorphisms (SNPs; directly typed and imputed using HapMap data) were available. In the discovery samples, classical association analyses were performed under an additive model followed by meta-analysis using the weighted inverse variance method. Results showed genome-wide significance for Openness to Experience near the RASA1 gene on 5q14.3 (rs1477268 and rs2032794, P=2.8 × 10−8 and 3.1 × 10−8) and for Conscientiousness in the brain-expressed KATNAL2 gene on 18q21.1 (rs2576037, P=4.9 × 10−8). We further conducted a gene-based test that confirmed the association of KATNAL2 to Conscientiousness. In silico replication did not, however, show significant associations of the top SNPs with Openness and Conscientiousness, although the direction of effect of the KATNAL2 SNP on Conscientiousness was consistent in all replication samples. Larger scale GWA studies and alternative approaches are required for confirmation of KATNAL2 as a novel gene affecting Conscientiousness.

PLOS Genetics | 2013

Human Disease-Associated Genetic Variation Impacts Large Intergenic Non-Coding RNA Expression

Vinod Kumar; Harm-Jan Westra; Juha Karjalainen; Daria V. Zhernakova; Tonu Esko; Barbara Hrdlickova; Rodrigo Coutinho de Almeida; Alexandra Zhernakova; Eva Reinmaa; Urmo Võsa; Marten H. Hofker; Rudolf S. N. Fehrmann; Jingyuan Fu; Sebo Withoff; Andres Metspalu; Lude Franke; Cisca Wijmenga

Recently it has become clear that only a small percentage (7%) of disease-associated single nucleotide polymorphisms (SNPs) are located in protein-coding regions, while the remaining 93% are located in gene regulatory regions or in intergenic regions. Thus, the understanding of how genetic variations control the expression of non-coding RNAs (in a tissue-dependent manner) has far-reaching implications. We tested the association of SNPs with expression levels (eQTLs) of large intergenic non-coding RNAs (lincRNAs), using genome-wide gene expression and genotype data from five different tissues. We identified 112 cis-regulated lincRNAs, of which 45% could be replicated in an independent dataset. We observed that 75% of the SNPs affecting lincRNA expression (lincRNA cis-eQTLs) were specific to lincRNA alone and did not affect the expression of neighboring protein-coding genes. We show that this specific genotype-lincRNA expression correlation is tissue-dependent and that many of these lincRNA cis-eQTL SNPs are also associated with complex traits and diseases.

Researchain Logo
Decentralizing Knowledge