Birte Kehr
deCODE genetics
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Birte Kehr.
Nature Communications | 2015
Simon N. Stacey; Hannes Helgason; Sigurjon A. Gudjonsson; Gudmar Thorleifsson; Florian Zink; Asgeir Sigurdsson; Birte Kehr; Julius Gudmundsson; Patrick Sulem; Bardur Sigurgeirsson; Kristrun R. Benediktsdottir; Kristin Thorisdottir; Rafn Ragnarsson; Victoria Fuentelsaz; Cristina Corredera; Yolanda Gilaberte; Matilde Grasa; Dolores Planelles; Onofre Sanmartín; Peter Rudnai; Eugene Gurzau; Kvetoslava Koppova; Bjørn A. Nexø; Anne Tjønneland; Kim Overvad; Jon G. Jonasson; Laufey Tryggvadottir; Hrefna Johannsdottir; Anna M. Kristinsdottir; Hreinn Stefansson
In an ongoing screen for DNA sequence variants that confer risk of cutaneous basal cell carcinoma (BCC), we conduct a genome-wide association study (GWAS) of 24,988,228 SNPs and small indels detected through whole-genome sequencing of 2,636 Icelanders and imputed into 4,572 BCC patients and 266,358 controls. Here we show the discovery of four new BCC susceptibility loci: 2p24 MYCN (rs57244888[C], OR=0.76, P=4.7 × 10−12), 2q33 CASP8-ALS2CR12 (rs13014235[C], OR=1.15, P=1.5 × 10−9), 8q21 ZFHX4 (rs28727938[G], OR=0.70, P=3.5 × 10−12) and 10p14 GATA3 (rs73635312[A], OR=0.74, P=2.4 × 10−16). Fine mapping reveals that two variants correlated with rs73635312[A] occur in conserved binding sites for the GATA3 transcription factor. In addition, expression microarrays and RNA-seq show that rs13014235[C] and a related SNP rs700635[C] are associated with expression of CASP8 splice variants in which sequences from intron 8 are retained.
Nature | 2017
Hákon Jónsson; Patrick Sulem; Birte Kehr; Snaedis Kristmundsdottir; Florian Zink; Eirikur Hjartarson; Marteinn T. Hardarson; Kristjan E. Hjorleifsson; Hannes P. Eggertsson; Sigurjon A. Gudjonsson; Lucas D. Ward; Gudny A. Arnadottir; Einar A. Helgason; Hannes Helgason; Arnaldur Gylfason; Adalbjorg Jonasdottir; Aslaug Jonasdottir; Thorunn Rafnar; Mike Frigge; Simon N. Stacey; Olafur T. Magnusson; Unnur Thorsteinsdottir; Gisli Masson; Augustine Kong; Bjarni V. Halldórsson; Agnar Helgason; Daniel F. Gudbjartsson; Kari Stefansson
The characterization of mutational processes that generate sequence diversity in the human genome is of paramount importance both to medical genetics and to evolutionary studies. To understand how the age and sex of transmitting parents affect de novo mutations, here we sequence 1,548 Icelanders, their parents, and, for a subset of 225, at least one child, to 35× genome-wide coverage. We find 108,778 de novo mutations, both single nucleotide polymorphisms and indels, and determine the parent of origin of 42,961. The number of de novo mutations from mothers increases by 0.37 per year of age (95% CI 0.32–0.43), a quarter of the 1.51 per year from fathers (95% CI 1.45–1.57). The number of clustered mutations increases faster with the mother’s age than with the father’s, and the genomic span of maternal de novo mutation clusters is greater than that of paternal ones. The types of de novo mutation from mothers change substantially with age, with a 0.26% (95% CI 0.19–0.33%) decrease in cytosine–phosphate–guanine to thymine–phosphate–guanine (CpG>TpG) de novo mutations and a 0.33% (95% CI 0.28–0.38%) increase in C>G de novo mutations per year, respectively. Remarkably, these age-related changes are not distributed uniformly across the genome. A striking example is a 20 megabase region on chromosome 8p, with a maternal C>G mutation rate that is up to 50-fold greater than the rest of the genome. The age-related accumulation of maternal non-crossover gene conversions also mostly occurs within these regions. Increased sequence diversity and linkage disequilibrium of C>G variants within regions affected by excess maternal mutations indicate that the underlying mutational process has persisted in humans for thousands of years. Moreover, the regional excess of C>G variation in humans is largely shared by chimpanzees, less by gorillas, and is almost absent from orangutans. This demonstrates that sequence diversity in humans results from evolving interactions between age, sex, mutation type, and genomic location.
Scientific Data | 2017
Hakon Jonsson; Patrick Sulem; Birte Kehr; Snaedis Kristmundsdottir; Florian Zink; Eirikur Hjartarson; Marteinn T. Hardarson; Kristjan E. Hjorleifsson; Hannes P. Eggertsson; Sigurjon A. Gudjonsson; Lucas D. Ward; Gudny A. Arnadottir; Einar A. Helgason; Hannes Helgason; Arnaldur Gylfason; Adalbjorg Jonasdottir; Aslaug Jonasdottir; Thorunn Rafnar; Søren Besenbacher; Michael L. Frigge; Simon N. Stacey; Olafur T. Magnusson; Unnur Thorsteinsdottir; Gisli Masson; Augustine Kong; Bjarni V. Halldórsson; Agnar Helgason; Daniel F. Gudbjartsson; Kari Stefansson
Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing GATK filters: 31,079,378 SNPs and 7,940,790 indels. Calling de novo mutations (DNMs) is a formidable challenge given the high false positive rate in sequencing datasets relative to the mutation rate. Here we addressed this issue by using segregation of alleles in three-generation families. Using this transmission assay, we controlled the false positive rate and identified 108,778 high quality DNMs. Furthermore, we used our extended family structure and read pair tracing of DNMs to a panel of phased SNPs, to determine the parent of origin of 42,961 DNMs.
Nature Communications | 2017
Sigurdis Haraldsdottir; Thorunn Rafnar; Wendy L. Frankel; Sylvia Einarsdottir; Asgeir Sigurdsson; Heather Hampel; Petur Snaebjornsson; Gisli Masson; Daniel Weng; Reynir Arngrimsson; Birte Kehr; Ahmet Yilmaz; Stefan Haraldsson; Patrick Sulem; Tryggvi Stefansson; Peter G. Shields; Fridbjorn Sigurdsson; Tanios Bekaii-Saab; Pall Helgi Möller; Kristin Alexiusdottir; Megan P. Hitchins; Colin C. Pritchard; Albert de la Chapelle; Jon G. Jonasson; Richard M. Goldberg; Kari Stefansson
Lynch syndrome, caused by germline mutations in the mismatch repair genes, is associated with increased cancer risk. Here using a large whole-genome sequencing data bank, cancer registry and colorectal tumour bank we determine the prevalence of Lynch syndrome, associated cancer risks and pathogenicity of several variants in the Icelandic population. We use colorectal cancer samples from 1,182 patients diagnosed between 2000–2009. One-hundred and thirty-two (11.2%) tumours are mismatch repair deficient per immunohistochemistry. Twenty-one (1.8%) have Lynch syndrome while 106 (9.0%) have somatic hypermethylation or mutations in the mismatch repair genes. The population prevalence of Lynch syndrome is 0.442%. We discover a translocation disrupting MLH1 and three mutations in MSH6 and PMS2 that increase endometrial, colorectal, brain and ovarian cancer risk. We find thirteen mismatch repair variants of uncertain significance that are not associated with cancer risk. We find that founder mutations in MSH6 and PMS2 prevail in Iceland unlike most other populations.
Nature Genetics | 2016
Bjarni V. Halldórsson; Marteinn T. Hardarson; Birte Kehr; Unnur Styrkarsdottir; Arnaldur Gylfason; Gudmar Thorleifsson; Florian Zink; Adalbjorg Jonasdottir; Aslaug Jonasdottir; Patrick Sulem; Gisli Masson; Unnur Thorsteinsdottir; Agnar Helgason; Augustine Kong; Daniel F. Gudbjartsson; Kari Stefansson
Meiotic recombination involves a combination of gene conversion and crossover events that, along with mutations, produce germline genetic diversity. Here we report the discovery of 3,176 SNP and 61 indel gene conversions. Our estimate of the non-crossover (NCO) gene conversion rate (G) is 7.0 for SNPs and 5.8 for indels per megabase per generation, and the GC bias is 67.6%. For indels, we demonstrate a 65.6% preference for the shorter allele. NCO gene conversions from mothers are longer than those from fathers, and G is 2.17 times greater in mothers. Notably, G increases with the age of mothers, but not the age of fathers. A disproportionate number of NCO gene conversions in older mothers occur outside double-strand break (DSB) regions and in regions with relatively low GC content. This points to age-related changes in the mechanisms of meiotic gene conversion in oocytes.
Nature Genetics | 2017
Hannes P. Eggertsson; Hakon Jonsson; Snaedis Kristmundsdottir; Eirikur Hjartarson; Birte Kehr; Gisli Masson; Florian Zink; Kristjan E. Hjorleifsson; Aslaug Jonasdottir; Adalbjorg Jonasdottir; Ingileif Jonsdottir; Daniel F. Gudbjartsson; Páll Melsted; Kari Stefansson; Bjarni V. Halldórsson
A fundamental requirement for genetic studies is an accurate determination of sequence variation. While human genome sequence diversity is increasingly well characterized, there is a need for efficient ways to use this knowledge in sequence analysis. Here we present Graphtyper, a publicly available novel algorithm and software for discovering and genotyping sequence variants. Graphtyper realigns short-read sequence data to a pangenome, a variation-aware graph structure that encodes sequence variation within a population by representing possible haplotypes as graph paths. Our results show that Graphtyper is fast, highly scalable, and provides sensitive and accurate genotype calls. Graphtyper genotyped 89.4 million sequence variants in the whole genomes of 28,075 Icelanders using less than 100,000 CPU days, including detailed genotyping of six human leukocyte antigen (HLA) genes. We show that Graphtyper is a valuable tool in characterizing sequence variation in both small and population-scale sequencing studies.
Human Molecular Genetics | 2017
Eythor Bjornsson; Hannes Helgason; Gisli H. Halldorsson; Anna Helgadottir; Arnaldur Gylfason; Birte Kehr; Adalbjorg Jonasdottir; Aslaug Jonasdottir; Asgeir Sigurdsson; Asmundur Oddsson; Gudmar Thorleifsson; Olafur T. Magnusson; Solveig Gretarsdottir; Florian Zink; Ragnar P. Kristjansson; Margret Asgeirsdottir; Dorine W. Swinkels; Lambertus A. Kiemeney; Gudmundur I. Eyjolfsson; Olof Sigurdardottir; Gisli Masson; Isleifur Olafsson; Gudmundur Thorgeirsson; Hilma Holm; Unnur Thorsteinsdottir; Daniel F. Gudbjartsson; Patrick Sulem; Kari Stefansson
Common sequence variants at the haptoglobin gene (HP) have been associated with blood lipid levels. Through whole-genome sequencing of 8,453 Icelanders, we discovered a splice donor founder mutation in HP (NM_001126102.1:c.190 + 1G > C, minor allele frequency = 0.56%). This mutation occurs on the HP1 allele of the common copy number variant in HP and leads to a loss of function of HP1. It associates with lower levels of haptoglobin (P = 2.1 × 10-54), higher levels of non-high density lipoprotein cholesterol (β = 0.26 mmol/l, P = 2.6 × 10-9) and greater risk of coronary artery disease (odds ratio = 1.30, 95% confidence interval: 1.10-1.54, P = 0.0024). Through haplotype analysis and with RNA sequencing, we provide evidence of a causal relationship between one of the two haptoglobin isoforms, namely Hp1, and lower levels of non-HDL cholesterol. Furthermore, we show that the HP1 allele associates with various other quantitative biological traits.
Human Molecular Genetics | 2016
Simon N. Stacey; Birte Kehr; Julius Gudmundsson; Florian Zink; Aslaug Jonasdottir; Sigurjon A. Gudjonsson; Asgeir Sigurdsson; Bjarni V. Halldórsson; Bjarni A. Agnarsson; Kristrun R. Benediktsdottir; Katja K. Aben; Sita H. Vermeulen; Ruben G. Cremers; Angeles Panadero; Brian T. Helfand; Phillip R. Cooper; Jenny Donovan; Freddie C. Hamdy; Viorel Jinga; Ichiro Okamoto; Jon G. Jonasson; Laufey Tryggvadottir; Hrefna Johannsdottir; Anna M. Kristinsdottir; Gisli Masson; Olafur T. Magnusson; Paul D. Iordache; Agnar Helgason; Hannes Helgason; Patrick Sulem
Transcriptional and splicing anomalies have been observed in intron 8 of the CASP8 gene (encoding procaspase-8) in association with cutaneous basal-cell carcinoma (BCC) and linked to a germline SNP rs700635. Here, we show that the rs700635[C] allele, which is associated with increased risk of BCC and breast cancer, is protective against prostate cancer [odds ratio (OR) = 0.91, P = 1.0 × 10−6]. rs700635[C] is also associated with failures to correctly splice out CASP8 intron 8 in breast and prostate tumours and in corresponding normal tissues. Investigation of rs700635[C] carriers revealed that they have a human-specific short interspersed element-variable number of tandem repeat-Alu (SINE-VNTR-Alu), subfamily-E retrotransposon (SVA-E) inserted into CASP8 intron 8. The SVA-E shows evidence of prior activity, because it has transduced some CASP8 sequences during subsequent retrotransposition events. Whole-genome sequence (WGS) data were used to tag the SVA-E with a surrogate SNP rs1035142[T] (r2 = 0.999), which showed associations with both the splicing anomalies (P = 6.5 × 10−32) and with protection against prostate cancer (OR = 0.91, P = 3.8 × 10−7).
Bioinformatics | 2016
Snaedis Kristmundsdottir; Brynja D. Sigurpálsdóttir; Birte Kehr; Bjarni V. Halldórsson
Motivation: Microsatellites, also known as short tandem repeats (STRs), are tracts of repetitive DNA sequences containing motifs ranging from two to six bases. Microsatellites are one of the most abundant type of variation in the human genome, after single nucleotide polymorphisms (SNPs) and Indels. Microsatellite analysis has a wide range of applications, including medical genetics, forensics and construction of genetic genealogy. However, microsatellite variations are rarely considered in whole‐genome sequencing studies, in large due to a lack of tools capable of analyzing them. Results: Here we present a microsatellite genotyper, optimized for Illumina WGS data, which is both faster and more accurate than other methods previously presented. There are two main ingredients to our improvements. First we reduce the amount of sequencing data necessary for creating microsatellite profiles by using previously aligned sequencing data. Second, we use population information to train microsatellite and individual specific error profiles. By comparing our genotyping results to genotypes generated by capillary electrophoresis we show that our error rates are 50% lower than those of lobSTR, another program specifically developed to determine microsatellite genotypes. Availability and Implementation: Source code is available on Github: https://github.com/DecodeGenetics/popSTR Contact: [email protected] or [email protected]
Nature Genetics | 2017
Birte Kehr; Anna Helgadottir; Páll Melsted; Hákon Jónsson; Hannes Helgason; Adalbjorg Jonasdottir; Aslaug Jonasdottir; Asgeir Sigurdsson; Arnaldur Gylfason; Gisli H. Halldorsson; Snaedis Kristmundsdottir; Gudmundur Thorgeirsson; Isleifur Olafsson; Hilma Holm; Unnur Thorsteinsdottir; Patrick Sulem; Agnar Helgason; Daniel F. Gudbjartsson; Bjarni V. Halldórsson; Kari Stefansson
Genomes usually contain some non-repetitive sequences that are missing from the reference genome and occur only in a population subset. Such non-repetitive, non-reference (NRNR) sequences have remained largely unexplored in terms of their characterization and downstream analyses. Here we describe 3,791 breakpoint-resolved NRNR sequence variants called using PopIns from whole-genome sequence data of 15,219 Icelanders. We found that over 95% of the 244 NRNR sequences that are 200 bp or longer are present in chimpanzees, indicating that they are ancestral. Furthermore, 149 variant loci are in linkage disequilibrium (r2 > 0.8) with a genome-wide association study (GWAS) catalog marker, suggesting disease relevance. Additionally, we report an association (P = 3.8 × 10−8, odds ratio (OR) = 0.92) with myocardial infarction (23,360 cases, 300,771 controls) for a 766-bp NRNR sequence variant. Our results underline the importance of including variation of all complexity levels when searching for variants that associate with disease.