David van Enckevort
University Medical Center Groningen
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by David van Enckevort.
Nature Genetics | 2014
Laurent C. Francioli; Androniki Menelaou; Sara L. Pulit; Freerk van Dijk; Pier Francesco Palamara; Clara C. Elbers; Pieter B. T. Neerincx; Kai Ye; Victor Guryev; Wigard P. Kloosterman; Patrick Deelen; Abdel Abdellaoui; Elisabeth M. van Leeuwen; Mannis van Oven; Martijn Vermaat; Mingkun Li; Jeroen F. J. Laros; Lennart C. Karssen; Alexandros Kanterakis; Najaf Amin; Jouke-Jan Hottenga; Eric-Wubbo Lameijer; Mathijs Kattenberg; Martijn Dijkstra; Heorhiy Byelas; Jessica van Setten; Barbera D. C. van Schaik; Jan Bot; Isaac J. Nijman; Ivo Renkens
Whole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch parent-offspring families and constructed a haplotype map of 20.4 million single-nucleotide variants and 1.2 million insertions and deletions. The intermediate coverage (∼13×) and trio design enabled extensive characterization of structural variation, including midsize events (30–500 bp) previously poorly catalogued and de novo mutations. We demonstrate that the quality of the haplotypes boosts imputation accuracy in independent samples, especially for lower frequency alleles. Population genetic analyses demonstrate fine-scale structure across the country and support multiple ancient migrations, consistent with historical changes in sea level and flooding. The GoNL Project illustrates how single-population whole-genome sequencing can provide detailed characterization of genetic variation and may guide the design of future population studies.
European Journal of Human Genetics | 2014
Dorret I. Boomsma; Cisca Wijmenga; Eline Slagboom; Morris A. Swertz; Lennart C. Karssen; Abdel Abdellaoui; Kai Ye; Victor Guryev; Martijn Vermaat; Freerk van Dijk; Laurent C. Francioli; Jouke-Jan Hottenga; Jeroen F. J. Laros; Qibin Li; Yingrui Li; Hongzhi Cao; Ruoyan Chen; Yuanping Du; Ning Li; Sujie Cao; Jessica van Setten; Androniki Menelaou; Sara L. Pulit; Jayne Y. Hehir-Kwa; Marian Beekman; Clara C. Elbers; Heorhiy Byelas; Anton J. M. de Craen; Patrick Deelen; Martijn Dijkstra
Within the Netherlands a national network of biobanks has been established (Biobanking and Biomolecular Research Infrastructure-Netherlands (BBMRI-NL)) as a national node of the European BBMRI. One of the aims of BBMRI-NL is to enrich biobanks with different types of molecular and phenotype data. Here, we describe the Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL. GoNL is a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population. The parent–offspring trios include adult individuals ranging in age from 19 to 87 years (mean=53 years; SD=16 years) from birth cohorts 1910–1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14–15x. The family-based design represents a unique resource to assess the frequency of regional variants, accurately reconstruct haplotypes by family-based phasing, characterize short indels and complex structural variants, and establish the rate of de novo mutational events. GoNL will also serve as a reference panel for imputation in the available genome-wide association studies in Dutch and other cohorts to refine association signals and uncover population-specific variants. GoNL will create a catalog of human genetic variation in this sample that is uniquely characterized with respect to micro-geographic location and a wide range of phenotypes. The resource will be made available to the research and medical community to guide the interpretation of sequencing projects. The present paper summarizes the global characteristics of the project.
European Journal of Human Genetics | 2014
Patrick Deelen; Androniki Menelaou; Elisabeth M. van Leeuwen; Alexandros Kanterakis; Freerk van Dijk; Carolina Medina-Gomez; Laurent C. Francioli; J ouke; Jan Hottenga; Lennart C. Karssen; Karol Estrada; Eskil Kreiner-Møller; Fernando Rivadeneira; Jessica van Setten; Javier Gutierrez-Achury; Lude Franke; David van Enckevort; Martijn Dijkstra; Heorhiy Byelas; Paul I. W. de Bakker; Cisca Wijmenga; Morris A. Swertz
Although genome-wide association studies (GWAS) have identified many common variants associated with complex traits, low-frequency and rare variants have not been interrogated in a comprehensive manner. Imputation from dense reference panels, such as the 1000 Genomes Project (1000G), enables testing of ungenotyped variants for association. Here we present the results of imputation using a large, new population-specific panel: the Genome of The Netherlands (GoNL). We benchmarked the performance of the 1000G and GoNL reference sets by comparing imputation genotypes with ‘true’ genotypes typed on ImmunoChip in three European populations (Dutch, British, and Italian). GoNL showed significant improvement in the imputation quality for rare variants (MAF 0.05–0.5%) compared with 1000G. In Dutch samples, the mean observed Pearson correlation, r2, increased from 0.61 to 0.71. We also saw improved imputation accuracy for other European populations (in the British samples, r2 improved from 0.58 to 0.65, and in the Italians from 0.43 to 0.47). A combined reference set comprising 1000G and GoNL improved the imputation of rare variants even further. The Italian samples benefitted the most from this combined reference (the mean r2 increased from 0.47 to 0.50). We conclude that the creation of a large population-specific reference is advantageous for imputing rare variants and that a combined reference panel across multiple populations yields the best imputation results.
Nature Communications | 2016
Jayne Y. Hehir-Kwa; Tobias Marschall; Wigard P. Kloosterman; Laurent C. Francioli; Jasmijn A. Baaijens; Louis J. Dijkstra; Abdel Abdellaoui; Vyacheslav Koval; Djie Tjwan Thung; René Wardenaar; Ivo Renkens; Bradley P. Coe; Patrick Deelen; Joep de Ligt; Eric-Wubbo Lameijer; Freerk van Dijk; Fereydoun Hormozdiari; Jasper Bovenberg; Anton J. M. de Craen; Marian Beekman; Albert Hofman; Gonneke Willemsen; Bruce H. R. Wolffenbuttel; Mathieu Platteel; Yuanping Du; Ruoyan Chen; Hongzhi Cao; Rui Cao; Yushen Sun; Jeremy Sujie Cao
Structural variation (SV) represents a major source of differences between individual human genomes and has been linked to disease phenotypes. However, the majority of studies provide neither a global view of the full spectrum of these variants nor integrate them into reference panels of genetic variation. Here, we analyse whole genome sequencing data of 769 individuals from 250 Dutch families, and provide a haplotype-resolved map of 1.9 million genome variants across 9 different variant classes, including novel forms of complex indels, and retrotransposition-mediated insertions of mobile elements and processed RNAs. A large proportion are previously under reported variants sized between 21 and 100 bp. We detect 4 megabases of novel sequence, encoding 11 new transcripts. Finally, we show 191 known, trait-associated SNPs to be in strong linkage disequilibrium with SVs and demonstrate that our panel facilitates accurate imputation of SVs in unrelated individuals.
Nature Communications | 2015
Elisabeth M. van Leeuwen; Lennart C. Karssen; Joris Deelen; Aaron Isaacs; Carolina Medina-Gomez; Hamdi Mbarek; Alexandros Kanterakis; Stella Trompet; Iris Postmus; Niek Verweij; David van Enckevort; Jennifer E. Huffman; Charles C. White; Mary F. Feitosa; Traci M. Bartz; Ani Manichaikul; Peter K. Joshi; Gina M. Peloso; Patrick Deelen; Freerk van Dijk; Gonneke Willemsen; Eco J. de Geus; Yuri Milaneschi; Laurent C. Francioli; Androniki Menelaou; Sara L. Pulit; Fernando Rivadeneira; Albert Hofman; Ben A. Oostra; Oscar H. Franco
Variants associated with blood lipid levels may be population-specific. To identify low-frequency variants associated with this phenotype, population-specific reference panels may be used. Here we impute nine large Dutch biobanks (~35,000 samples) with the population-specific reference panel created by the Genome of the Netherlands Project and perform association testing with blood lipid levels. We report the discovery of five novel associations at four loci (P value <6.61 × 10−4), including a rare missense variant in ABCA6 (rs77542162, p.Cys1359Arg, frequency 0.034), which is predicted to be deleterious. The frequency of this ABCA6 variant is 3.65-fold increased in the Dutch and its effect (βLDL-C=0.135, βTC=0.140) is estimated to be very similar to those observed for single variants in well-known lipid genes, such as LDLR.
Bioinformatics | 2016
Chao Pang; David van Enckevort; Mark de Haan; Fleur Kelpin; Jonathan Jetten; Dennis Hendriksen; Tommy de Boer; Bart Charbon; Erwin Winder; K. Joeri van der Velde; Dany Doiron; Isabel Fortier; Hans L. Hillege; Morris A. Swertz
Motivation: While the size and number of biobanks, patient registries and other data collections are increasing, biomedical researchers still often need to pool data for statistical power, a task that requires time-intensive retrospective integration. Results: To address this challenge, we developed MOLGENIS/connect, a semi-automatic system to find, match and pool data from different sources. The system shortlists relevant source attributes from thousands of candidates using ontology-based query expansion to overcome variations in terminology. Then it generates algorithms that transform source attributes to a common target DataSchema. These include unit conversion, categorical value matching and complex conversion patterns (e.g. calculation of BMI). In comparison to human-experts, MOLGENIS/connect was able to auto-generate 27% of the algorithms perfectly, with an additional 46% needing only minor editing, representing a reduction in the human effort and expertise needed to pool data. Availability and Implementation: Source code, binaries and documentation are available as open-source under LGPLv3 from http://github.com/molgenis/molgenis and www.molgenis.org/connect. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
European Journal of Human Genetics | 2017
Laurent C. Francioli; Mircea Cretu-Stancu; Kiran Garimella; Menachem Fromer; Wigard P. Kloosterman; Cisca Wijmenga; Principal Investigator; Morris A. Swertz; Cornelia M. van Duijn; Dorret I. Boomsma; PEline Slagboom; Gert-Jan B. van Ommen; Paul I. W. de Bakker; Freerk van Dijk; Androniki Menelaou; Pieter B. T. Neerincx; Sara L. Pulit; Patrick Deelen; Clara C. Elbers; Pier Francesco Palamara; Itsik Pe'er; Abdel Abdellaoui; Mannis van Oven; Martijn Vermaat; Mingkun Li; Jeroen F. J. Laros; Mark Stoneking; Peter de Knijff; Manfred Kayser; Jan H. Veldink
Germline mutation detection from human DNA sequence data is challenging due to the rarity of such events relative to the intrinsic error rates of sequencing technologies and the uneven coverage across the genome. We developed PhaseByTransmission (PBT) to identify de novo single nucleotide variants and short insertions and deletions (indels) from sequence data collected in parent-offspring trios. We compute the joint probability of the data given the genotype likelihoods in the individual family members, the known familial relationships and a prior probability for the mutation rate. Candidate de novo mutations (DNMs) are reported along with their posterior probability, providing a systematic way to prioritize them for validation. Our tool is integrated in the Genome Analysis Toolkit and can be used together with the ReadBackedPhasing module to infer the parental origin of DNMs based on phase-informative reads. Using simulated data, we show that PBT outperforms existing tools, especially in low coverage data and on the X chromosome. We further show that PBT displays high validation rates on empirical parent-offspring sequencing data for whole-exome data from 104 trios and X-chromosome data from 249 parent-offspring families. Finally, we demonstrate an association between father’s age at conception and the number of DNMs in female offspring’s X chromosome, consistent with previous literature reports.
Smart Health | 2015
Heimo Müller; Robert Reihs; Kurt Zatloukal; Fleur Jeanquartier; Roxana Merino-Martinez; David van Enckevort; Morris A. Swertz; Andreas Holzinger
Biobanks are essential for the realization of P4-medicine, hence indispensable for smart health. One of the grand challenges in biobank research is to close the research cycle in such a way that all the data generated by one research study can be consistently associated to the original samples, therefore data and knowledge can be reused in other studies. A catalogue must provide the information hub connecting all relevant information sources. The key knowledge embedded in a biobank catalogue is the availability and quality of proper samples to perform a research project. Depending on the study type, the samples can reflect a healthy reference population, a cross sectional representation of a certain group of people (healthy or with various diseases) or a certain disease type or stage. To overview and compare collections from different catalogues, we introduce visual analytics techniques, especially glyph based visualization techniques, which were successfully applied for knowledge discovery of single biobank catalogues. In this paper, we describe the state-of-the art in the integration of biobank catalogues addressing the challenge of combining heterogeneous data sources in a unified and meaningful way, consequently enabling the discovery and visualization of data from different sources. Finally we present open questions both in data integration and visualization of unified catalogues and propose future research in data integration with a linked data approach and the fusion of multi level glyph and network visualization.
International Journal of Environmental Research and Public Health | 2018
Yllka Kodra; Jérôme Weinbach; Manuel Posada-de-la-Paz; Alessio Coi; S. Lemonnier; David van Enckevort; Marco Roos; Annika Jacobsen; Ronald Cornet; S. Ahmed; Virginie Bros-Facer; Veronica Popa; Marieke Van Meel; Daniel Renault; Rainald von Gizycki; Michele Santoro; Paul Landais; Paola Torreri; Claudio Carta; Deborah Mascalzoni; Sabina Gainotti; Estrella Lopez; Anna Ambrosini; Heimo Müller; Robert Reis; Fabrizio Bianchi; Yaffa Rubinstein; Hanns Lochmüller; Domenica Taruscio
Rare diseases (RD) patient registries are powerful instruments that help develop clinical research, facilitate the planning of appropriate clinical trials, improve patient care, and support healthcare management. They constitute a key information system that supports the activities of European Reference Networks (ERNs) on rare diseases. A rapid proliferation of RD registries has occurred during the last years and there is a need to develop guidance for the minimum requirements, recommendations and standards necessary to maintain a high-quality registry. In response to these heterogeneities, in the framework of RD-Connect, a European platform connecting databases, registries, biobanks and clinical bioinformatics for rare disease research, we report on a list of recommendations, developed by a group of experts, including members of patient organizations, to be used as a framework for improving the quality of RD registries. This list includes aspects of governance, Findable, Accessible, Interoperable and Reusable (FAIR) data and information, infrastructure, documentation, training, and quality audit. The list is intended to be used by established as well as new RD registries. Further work includes the development of a toolkit to enable continuous assessment and improvement of their organizational and data quality.
Human Mutation | 2018
Gillian S. Townend; Friederike Ehrhart; Henk J. van Kranen; Mark D. Wilkinson; Annika Jacobsen; Marco Roos; Egon Willighagen; David van Enckevort; Chris T. Evelo; Leopold M. G. Curfs
Rett syndrome (RTT) is a monogenic rare disorder that causes severe neurological problems. In most cases, it results from a loss‐of‐function mutation in the gene encoding methyl‐CPG‐binding protein 2 (MECP2). Currently, about 900 unique MECP2 variations (benign and pathogenic) have been identified and it is suspected that the different mutations contribute to different levels of disease severity. For researchers and clinicians, it is important that genotype–phenotype information is available to identify disease‐causing mutations for diagnosis, to aid in clinical management of the disorder, and to provide counseling for parents. In this study, 13 genotype–phenotype databases were surveyed for their general functionality and availability of RTT‐specific MECP2 variation data. For each database, we investigated findability and interoperability alongside practical user functionality, and type and amount of genetic and phenotype data. The main conclusions are that, as well as being challenging to find these databases and specific MECP2 variants held within, interoperability is as yet poorly developed and requires effort to search across databases. Nevertheless, we found several thousand online database entries for MECP2 variations and their associated phenotypes, diagnosis, or predicted variant effects, which is a good starting point for researchers and clinicians who want to provide, annotate, and use the data.