Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Lukas Forer is active.

Publication


Featured researches published by Lukas Forer.


Nature Genetics | 2016

Next-generation genotype imputation service and methods

Sayantan Das; Lukas Forer; Sebastian Schönherr; Carlo Sidore; Adam E. Locke; Alan Kwong; Scott I. Vrieze; Emily Y. Chew; Shawn Levy; Matt McGue; David Schlessinger; Dwight Stambolian; Po-Ru Loh; William G. Iacono; Anand Swaroop; Laura J. Scott; Francesco Cucca; Florian Kronenberg; Michael Boehnke; Gonçalo R. Abecasis; Christian Fuchsberger

Genotype imputation is a key component of genetic association studies, where it increases power, facilitates meta-analysis, and aids interpretation of signals. Genotype imputation is computationally demanding and, with current tools, typically requires access to a high-performance computing cluster and to a reference panel of sequenced genomes. Here we describe improvements to imputation machinery that reduce computational requirements by more than an order of magnitude with no loss of accuracy in comparison to standard imputation tools. We also describe a new web-based service for imputation that facilitates access to new reference panels and greatly improves user experience and productivity.


Nucleic Acids Research | 2016

HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing

Hansi Weissensteiner; Dominic Pacher; Anita Kloss-Brandstätter; Lukas Forer; Günther Specht; Hans-Jürgen Bandelt; Florian Kronenberg; Antonio Salas; Sebastian Schönherr

Mitochondrial DNA (mtDNA) profiles can be classified into phylogenetic clusters (haplogroups), which is of great relevance for evolutionary, forensic and medical genetics. With the extensive growth of the underlying phylogenetic tree summarizing the published mtDNA sequences, the manual process of haplogroup classification would be too time-consuming. The previously published classification tool HaploGrep provided an automatic way to address this issue. Here, we present the completely updated version HaploGrep 2 offering several advanced features, including a generic rule-based system for immediate quality control (QC). This allows detecting artificial recombinants and missing variants as well as annotating rare and phantom mutations. Furthermore, the handling of high-throughput data in form of VCF files is now directly supported. For data output, several graphical reports are generated in real time, such as a multiple sequence alignment format, a VCF format and extended haplogroup QC reports, all viewable directly within the application. In addition, HaploGrep 2 generates a publication-ready phylogenetic tree of all input samples encoded relative to the revised Cambridge Reference Sequence. Finally, new distance measures and optimizations of the algorithm increase accuracy and speed-up the application. HaploGrep 2 can be accessed freely and without any registration at http://haplogrep.uibk.ac.at.


Nature Genetics | 2016

Reference-based phasing using the Haplotype Reference Consortium panel

Po-Ru Loh; Petr Danecek; Pier Francesco Palamara; Christian Fuchsberger; Yakir A. Reshef; Hilary Finucane; Sebastian Schoenherr; Lukas Forer; Shane McCarthy; Gonçalo R. Abecasis; Richard Durbin; Alkes L. Price

Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts but attains lower accuracy in smaller cohorts. Here we instead explore the paradigm of reference-based phasing. We introduce a new phasing algorithm, Eagle2, that attains high accuracy across a broad range of cohort sizes by efficiently leveraging information from large external reference panels (such as the Haplotype Reference Consortium; HRC) using a new data structure based on the positional Burrows-Wheeler transform. We demonstrate that Eagle2 attains a ∼20× speedup and ∼10% increase in accuracy compared to reference-based phasing using SHAPEIT2. On European-ancestry samples, Eagle2 with the HRC panel achieves >2× the accuracy of 1000 Genomes–based phasing. Eagle2 is open source and freely available for HRC-based phasing via the Sanger Imputation Service and the Michigan Imputation Server.


Nephrology Dialysis Transplantation | 2015

Disease burden and risk profile in referred patients with moderate chronic kidney disease: composition of the German Chronic Kidney Disease (GCKD) cohort

Stephanie Titze; Matthias Schmid; Anna Köttgen; Martin Busch; Jürgen Floege; Christoph Wanner; Florian Kronenberg; Kai-Uwe Eckardt; Hans-Ulrich Prokosch; Barbara Bärthlein; Andreas Beck; Thomas Ganslandt; Olaf Gefeller; Jan Köster; Martina Malzer; Georg Schlieper; Frank Eitner; Sabine Meisen; Katharina Kehl; Elfriede Arweiler; Elke Schaeffner; Seema Baid-Agrawal; Ralf Schindler; Silvia Hübner; Thomas Dienemann; Karl F. Hilgers; Ulla T. Schultheiß; Gerd Walz; Jan T. Kielstein; Johan M. Lorenzen

BACKGROUND A main challenge for targeting chronic kidney disease (CKD) is the heterogeneity of its causes, co-morbidities and outcomes. Patients under nephrological care represent an important reference population, but knowledge about their characteristics is limited. METHODS We enrolled 5217 carefully phenotyped patients with moderate CKD [estimated glomerular filtration rate (eGFR) 30-60 mL/min per 1.73 m(2) or overt proteinuria at higher eGFR] under routine care of nephrologists into the German Chronic Kidney Disease (GCKD) study, thereby establishing the currently worldwide largest CKD cohort. RESULTS The cohort has 60% men, a mean age (±SD) of 60 ± 12 years, a mean eGFR of 47 ± 17 mL/min per 1.73 m(2) and a median (IQR) urinary albumin/creatinine ratio of 51 (9-392) mg/g. Assessment of causes of CKD revealed a high degree of uncertainty, with the leading cause unknown in 20% and frequent suspicion of multifactorial pathogenesis. Thirty-five per cent of patients had diabetes, but only 15% were considered to have diabetic nephropathy. Cardiovascular disease prevalence was high (32%, excluding hypertension); prevalent risk factors included smoking (59% current or former smokers) and obesity (43% with BMI >30). Despite widespread use of anti-hypertensive medication, only 52% of the cohort had an office blood pressure <140/90 mmHg. Family histories for cardiovascular events (39%) and renal disease (28%) suggest familial aggregation. CONCLUSIONS Patients with moderate CKD under specialist care have a high disease burden. Improved diagnostic accuracy, rigorous management of risk factors and unravelling of the genetic predisposition may represent strategies for improving prognosis.


BMC Bioinformatics | 2012

Cloudgene: A graphical execution platform for MapReduce programs on private and public clouds

Sebastian Schönherr; Lukas Forer; Hansi Weißensteiner; Florian Kronenberg; Günther Specht; Anita Kloss-Brandstätter

BackgroundThe MapReduce framework enables a scalable processing and analyzing of large datasets by distributing the computational load on connected computer nodes, referred to as a cluster. In Bioinformatics, MapReduce has already been adopted to various case scenarios such as mapping next generation sequencing data to a reference genome, finding SNPs from short read data or matching strings in genotype files. Nevertheless, tasks like installing and maintaining MapReduce on a cluster system, importing data into its distributed file system or executing MapReduce programs require advanced knowledge in computer science and could thus prevent scientists from usage of currently available and useful software solutions.ResultsHere we present Cloudgene, a freely available platform to improve the usability of MapReduce programs in Bioinformatics by providing a graphical user interface for the execution, the import and export of data and the reproducibility of workflows on in-house (private clouds) and rented clusters (public clouds). The aim of Cloudgene is to build a standardized graphical execution environment for currently available and future MapReduce programs, which can all be integrated by using its plug-in interface. Since Cloudgene can be executed on private clusters, sensitive datasets can be kept in house at all time and data transfer times are therefore minimized.ConclusionsOur results show that MapReduce programs can be integrated into Cloudgene with little effort and without adding any computational overhead to existing programs. This platform gives developers the opportunity to focus on the actual implementation task and provides scientists a platform with the aim to hide the complexity of MapReduce. In addition to MapReduce programs, Cloudgene can also be used to launch predefined systems (e.g. Cloud BioLinux, RStudio) in public clouds. Currently, five different bioinformatic programs using MapReduce and two systems are integrated and have been successfully deployed. Cloudgene is freely available athttp://cloudgene.uibk.ac.at.


BMC Evolutionary Biology | 2014

Large-scale mitochondrial DNA analysis in Southeast Asia reveals evolutionary effects of cultural isolation in the multi-ethnic population of Myanmar

Monika Summerer; Jürgen Horst; Gertraud Erhart; Hansi Weißensteiner; Sebastian Schönherr; Dominic Pacher; Lukas Forer; David Horst; Angelika Manhart; Basil Horst; Torpong Sanguansermsri; Anita Kloss-Brandstätter

BackgroundMyanmar is the largest country in mainland Southeast Asia with a population of 55 million people subdivided into more than 100 ethnic groups. Ruled by changing kingdoms and dynasties and lying on the trade route between India and China, Myanmar was influenced by numerous cultures. Since its independence from British occupation, tensions between the ruling Bamar and ethnic minorities increased.ResultsOur aim was to search for genetic footprints of Myanmar’s geographic, historic and sociocultural characteristics and to contribute to the picture of human colonization by describing and dating of new mitochondrial DNA (mtDNA) haplogroups. Therefore, we sequenced the mtDNA control region of 327 unrelated donors and the complete mitochondrial genome of 44 selected individuals according to highest quality standards.ConclusionPhylogenetic analyses of the entire mtDNA genomes uncovered eight new haplogroups and three unclassified basal M-lineages. The multi-ethnic population and the complex history of Myanmar were reflected in its mtDNA heterogeneity. Population genetic analyses of Burmese control region sequences combined with population data from neighboring countries revealed that the Myanmar haplogroup distribution showed a typical Southeast Asian pattern, but also Northeast Asian and Indian influences. The population structure of the extraordinarily diverse Bamar differed from that of the Karen people who displayed signs of genetic isolation. Migration analyses indicated a considerable genetic exchange with an overall positive migration balance from Myanmar to neighboring countries. Age estimates of the newly described haplogroups point to the existence of evolutionary windows where climatic and cultural changes gave rise to mitochondrial haplogroup diversification in Asia.


Biology Direct | 2015

Experiences with workflows for automating data-intensive bioinformatics

Ola Spjuth; Erik Bongcam-Rudloff; Guillermo Carrasco Hernández; Lukas Forer; Mario Giovacchini; Roman Valls Guimera; Aleksi Kallio; Eija Korpelainen; Maciej M. Kańduła; Milko Krachunov; David P. Kreil; Ognyan Kulev; Paweł P. Łabaj; Samuel Lampa; Luca Pireddu; Sebastian Schönherr; Alexey Siretskiy; Dimitar Vassilev

AbstractHigh-throughput technologies, such as next-generation sequencing, have turned molecular biology into a data-intensive discipline, requiring bioinformaticians to use high-performance computing resources and carry out data management and analysis tasks on large scale. Workflow systems can be useful to simplify construction of analysis pipelines that automate tasks, support reproducibility and provide measures for fault-tolerance. However, workflow systems can incur significant development and administration overhead so bioinformatics pipelines are often still built without them. We present the experiences with workflows and workflow systems within the bioinformatics community participating in a series of hackathons and workshops of the EU COST action SeqAhead. The organizations are working on similar problems, but we have addressed them with different strategies and solutions. This fragmentation of efforts is inefficient and leads to redundant and incompatible solutions. Based on our experiences we define a set of recommendations for future systems to enable efficient yet simple bioinformatics workflow construction and execution. Reviewers This article was reviewed by Dr Andrew Clark.


Atherosclerosis | 2015

Association of relative telomere length with cardiovascular disease in a large chronic kidney disease cohort: The GCKD study

Julia Raschenberger; Barbara Kollerits; Stephanie Titze; Anna Köttgen; Barbara Bärthlein; Arif B. Ekici; Lukas Forer; Sebastian Schönherr; Hansi Weissensteiner; Margot Haun; Christoph Wanner; Kai-Uwe Eckardt; Florian Kronenberg

BACKGROUND Chronic kidney disease (CKD) affects 10-15% of the general population and affected individuals are at an increased risk for cardiovascular disease (CVD). Since telomere length is considered to be involved in biological aging, we tested whether relative telomere length (RTL) might be a marker for these two diseases. METHODS The German Chronic Kidney Disease (GCKD) study is an ongoing prospective cohort study including patients with CKD of moderate severity. RTL was measured by qPCR in 4955 out of 5217 GCKD patients at baseline. RESULTS RTL was distributed in the cohort with a mean ± SD of 0.95 ± 0.19. CVD was present in 1266 patients. Each decrease of RTL by 0.1 unit was associated with a higher probability for prevalent CVD: OR = 1.06, 95% CI 1.02-1.11, p = 0.007 (adjusted for age, sex, eGFR, BMI, ln-CRP, smoking, hypertension, diabetes, and lipids). Similar findings were observed for history of specific CVD entities, such as coronary artery disease (OR = 1.05, p = 0.025), myocardial infarction (OR = 1.08, p = 0.013) and percutaneous transluminal coronary angioplasty (OR = 1.06, p = 0.032). The strongest associations were found for interventions at the carotid arteries (OR = 1.25, p = 0.001) as well as aortic aneurysms (OR = 1.22, p = 0.001). CONCLUSIONS In the presence of CKD there is a significant association between shorter RTL and CVD manifestations. RTL appears to be a marker reflecting changes in homeostasis associated with CKD that may contribute to the excess CVD risk.


Nucleic Acids Research | 2016

mtDNA-Server: next-generation sequencing data analysis of human mitochondrial DNA in the cloud

Hansi Weissensteiner; Lukas Forer; Christian Fuchsberger; Bernd Schöpf; Anita Kloss-Brandstätter; Günther Specht; Florian Kronenberg; Sebastian Schönherr

Next generation sequencing (NGS) allows investigating mitochondrial DNA (mtDNA) characteristics such as heteroplasmy (i.e. intra-individual sequence variation) to a higher level of detail. While several pipelines for analyzing heteroplasmies exist, issues in usability, accuracy of results and interpreting final data limit their usage. Here we present mtDNA-Server, a scalable web server for the analysis of mtDNA studies of any size with a special focus on usability as well as reliable identification and quantification of heteroplasmic variants. The mtDNA-Server workflow includes parallel read alignment, heteroplasmy detection, artefact or contamination identification, variant annotation as well as several quality control metrics, often neglected in current mtDNA NGS studies. All computational steps are parallelized with Hadoop MapReduce and executed graphically with Cloudgene. We validated the underlying heteroplasmy and contamination detection model by generating four artificial sample mix-ups on two different NGS devices. Our evaluation data shows that mtDNA-Server detects heteroplasmies and artificial recombinations down to the 1% level with perfect specificity and outperforms existing approaches regarding sensitivity. mtDNA-Server is currently able to analyze the 1000G Phase 3 data (n = 2,504) in less than 5 h and is freely accessible at https://mtdna-server.uibk.ac.at.


PLOS ONE | 2015

Validation of Next-Generation Sequencing of Entire Mitochondrial Genomes and the Diversity of Mitochondrial DNA Mutations in Oral Squamous Cell Carcinoma

Anita Kloss-Brandstätter; Hansi Weissensteiner; Gertraud Erhart; Georg Schäfer; Lukas Forer; Sebastian Schönherr; Dominic Pacher; Christof Seifarth; Andrea Stöckl; Liane Fendt; Irma Sottsas; Helmut Klocker; Christian W. Huck; Michael Rasse; Florian Kronenberg; Frank Kloss

Background Oral squamous cell carcinoma (OSCC) is mainly caused by smoking and alcohol abuse and shows a five-year survival rate of ~50%. We aimed to explore the variation of somatic mitochondrial DNA (mtDNA) mutations in primary oral tumors, recurrences and metastases. Methods We performed an in-depth validation of mtDNA next-generation sequencing (NGS) on an Illumina HiSeq 2500 platform for its application to cancer tissues, with the goal to detect low-level heteroplasmies and to avoid artifacts. Therefore we genotyped the mitochondrial genome (16.6 kb) from 85 tissue samples (tumors, recurrences, resection edges, metastases and blood) collected from 28 prospectively recruited OSCC patients applying both Sanger sequencing and high-coverage NGS (~35,000 reads per base). Results We observed a strong correlation between Sanger sequencing and NGS in estimating the mixture ratio of heteroplasmies (r = 0.99; p<0.001). Non-synonymous heteroplasmic variants were enriched among cancerous tissues. The proportions of somatic and inherited variants in a given gene region were strongly correlated (r = 0.85; p<0.001). Half of the patients shared mutations between benign and cancerous tissue samples. Low level heteroplasmies (<10%) were more frequent in benign samples compared to tumor samples, where heteroplasmies >10% were predominant. Four out of six patients who developed a local tumor recurrence showed mutations in the recurrence that had also been observed in the primary tumor. Three out of five patients, who had tumor metastases in the lymph nodes of their necks, shared mtDNA mutations between primary tumors and lymph node metastases. The percentage of mutation heteroplasmy increased from the primary tumor to lymph node metastases. Conclusions We conclude that Sanger sequencing is valid for heteroplasmy quantification for heteroplasmies ≥10% and that NGS is capable of reliably detecting and quantifying heteroplasmies down to the 1%-level. The finding of shared mutations between primary tumors, recurrences and metastasis indicates a clonal origin of malignant cells in oral cancer.

Collaboration


Dive into the Lukas Forer's collaboration.

Top Co-Authors

Avatar

Florian Kronenberg

Innsbruck Medical University

View shared research outputs
Top Co-Authors

Avatar

Sebastian Schönherr

Innsbruck Medical University

View shared research outputs
Top Co-Authors

Avatar

Hansi Weissensteiner

Innsbruck Medical University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Gertraud Erhart

Innsbruck Medical University

View shared research outputs
Top Co-Authors

Avatar

Christian Gieger

Pennington Biomedical Research Center

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Kai-Uwe Eckardt

University of Erlangen-Nuremberg

View shared research outputs
Top Co-Authors

Avatar

Barbara Kollerits

Innsbruck Medical University

View shared research outputs
Researchain Logo
Decentralizing Knowledge