Po-Ru Loh | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Po-Ru Loh is active.

Explore More

Publication

Featured researches published by Po-Ru Loh.

Nature Genetics | 2015

LD Score regression distinguishes confounding from polygenicity in genome-wide association studies

Brendan Bulik-Sullivan; Po-Ru Loh; Hilary Finucane; Stephan Ripke; Jian Yang; Nick Patterson; Mark J. Daly; Alkes L. Price; Benjamin M. Neale

Both polygenicity (many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification, can yield an inflated distribution of test statistics in genome-wide association studies (GWAS). However, current methods cannot distinguish between inflation from a true polygenic signal and bias. We have developed an approach, LD Score regression, that quantifies the contribution of each by examining the relationship between test statistics and linkage disequilibrium (LD). The LD Score regression intercept can be used to estimate a more powerful and accurate correction factor than genomic control. We find strong evidence that polygenicity accounts for the majority of the inflation in test statistics in many GWAS of large sample size.

Nature Genetics | 2015

An atlas of genetic correlations across human diseases and traits

Brendan Bulik-Sullivan; Hilary Finucane; Verneri Anttila; Alexander Gusev; Felix R. Day; Po-Ru Loh; Laramie Duncan; John Perry; Nick Patterson; Elise B. Robinson; Mark J. Daly; Alkes L. Price; Benjamin M. Neale

Identifying genetic correlations between complex traits and diseases can provide useful etiological insights and help prioritize likely causal relationships. The major challenges preventing estimation of genetic correlation from genome-wide association study (GWAS) data with current methods are the lack of availability of individual-level genotype data and widespread sample overlap among meta-analyses. We circumvent these difficulties by introducing a technique—cross-trait LD Score regression—for estimating genetic correlation that requires only GWAS summary statistics and is not biased by sample overlap. We use this method to estimate 276 genetic correlations among 24 traits. The results include genetic correlations between anorexia nervosa and schizophrenia, anorexia and obesity, and educational attainment and several diseases. These results highlight the power of genome-wide analyses, as there currently are no significantly associated SNPs for anorexia nervosa and only three for educational attainment.

Nature Genetics | 2015

Partitioning heritability by functional annotation using genome-wide association summary statistics

Hilary Finucane; Brendan Bulik-Sullivan; Alexander Gusev; Gosia Trynka; Yakir A. Reshef; Po-Ru Loh; Verneri Anttila; Han Xu; Chongzhi Zang; Kyle Kai-How Farh; Stephan Ripke; Felix R. Day; Shaun Purcell; Eli A. Stahl; Sara Lindström; John Perry; Yukinori Okada; Soumya Raychaudhuri; Mark J. Daly; Nick Patterson; Benjamin M. Neale; Alkes L. Price

Recent work has demonstrated that some functional categories of the genome contribute disproportionately to the heritability of complex diseases. Here we analyze a broad set of functional elements, including cell type–specific elements, to estimate their polygenic contributions to heritability in genome-wide association studies (GWAS) of 17 complex diseases and traits with an average sample size of 73,599. To enable this analysis, we introduce a new method, stratified LD score regression, for partitioning heritability from GWAS summary statistics while accounting for linked markers. This new method is computationally tractable at very large sample sizes and leverages genome-wide information. Our findings include a large enrichment of heritability in conserved regions across many traits, a very large immunological disease–specific enrichment of heritability in FANTOM5 enhancers and many cell type–specific enrichments, including significant enrichment of central nervous system cell types in the heritability of body mass index, age at menarche, educational attainment and smoking behavior.

Nature Genetics | 2016

Next-generation genotype imputation service and methods

Sayantan Das; Lukas Forer; Sebastian Schönherr; Carlo Sidore; Adam E. Locke; Alan Kwong; Scott I. Vrieze; Emily Y. Chew; Shawn Levy; Matt McGue; David Schlessinger; Dwight Stambolian; Po-Ru Loh; William G. Iacono; Anand Swaroop; Laura J. Scott; Francesco Cucca; Florian Kronenberg; Michael Boehnke; Gonçalo R. Abecasis; Christian Fuchsberger

Genotype imputation is a key component of genetic association studies, where it increases power, facilitates meta-analysis, and aids interpretation of signals. Genotype imputation is computationally demanding and, with current tools, typically requires access to a high-performance computing cluster and to a reference panel of sequenced genomes. Here we describe improvements to imputation machinery that reduce computational requirements by more than an order of magnitude with no loss of accuracy in comparison to standard imputation tools. We also describe a new web-based service for imputation that facilitates access to new reference panels and greatly improves user experience and productivity.

Nature Genetics | 2015

Efficient Bayesian mixed model analysis increases association power in large cohorts

Po-Ru Loh; George Tucker; Brendan Bulik-Sullivan; Bjarni J. Vilhjálmsson; Hilary Finucane; Rany M. Salem; Daniel I. Chasman; Paul M. Ridker; Benjamin M. Neale; Bonnie Berger; Nick Patterson; Alkes L. Price

Linear mixed models are a powerful statistical tool for identifying genetic associations and avoiding confounding. However, existing methods are computationally intractable in large cohorts and may not optimize power. All existing methods require time cost O(MN2) (where N is the number of samples and M is the number of SNPs) and implicitly assume an infinitesimal genetic architecture in which effect sizes are normally distributed, which can limit power. Here we present a far more efficient mixed-model association method, BOLT-LMM, which requires only a small number of O(MN) time iterations and increases power by modeling more realistic, non-infinitesimal genetic architectures via a Bayesian mixture prior on marker effect sizes. We applied BOLT-LMM to 9 quantitative traits in 23,294 samples from the Womens Genome Health Study (WGHS) and observed significant increases in power, consistent with simulations. Theory and simulations show that the boost in power increases with cohort size, making BOLT-LMM appealing for genome-wide association studies in large cohorts.

Genetics | 2013

Inferring Admixture Histories of Human Populations Using Linkage Disequilibrium

Po-Ru Loh; Mark Lipson; Nick Patterson; Priya Moorjani; Joseph K. Pickrell; David Reich; Bonnie Berger

Long-range migrations and the resulting admixtures between populations have been important forces shaping human genetic diversity. Most existing methods for detecting and reconstructing historical admixture events are based on allele frequency divergences or patterns of ancestry segments in chromosomes of admixed individuals. An emerging new approach harnesses the exponential decay of admixture-induced linkage disequilibrium (LD) as a function of genetic distance. Here, we comprehensively develop LD-based inference into a versatile tool for investigating admixture. We present a new weighted LD statistic that can be used to infer mixture proportions as well as dates with fewer constraints on reference populations than previous methods. We define an LD-based three-population test for admixture and identify scenarios in which it can detect admixture events that previous formal tests cannot. We further show that we can uncover phylogenetic relationships among populations by comparing weighted LD curves obtained using a suite of references. Finally, we describe several improvements to the computation and fitting of weighted LD curves that greatly increase the robustness and speed of the calculations. We implement all of these advances in a software package, ALDER, which we validate in simulations and apply to test for admixture among all populations from the Human Genome Diversity Project (HGDP), highlighting insights into the admixture history of Central African Pygmies, Sardinians, and Japanese.

Nature Communications | 2012

The genetic prehistory of southern Africa

Joseph K. Pickrell; Nick Patterson; Chiara Barbieri; Falko Berthold; Linda Gerlach; Tom Güldemann; Blesswell Kure; Sununguko W. Mpoloka; Hirosi Nakagawa; Christfried Naumann; Mark Lipson; Po-Ru Loh; Joseph Lachance; Joanna L. Mountain; Carlos Bustamante; Bonnie Berger; Sarah A. Tishkoff; Brenna M. Henn; Mark Stoneking; David Reich; Brigitte Pakendorf

Southern and eastern African populations that speak non-Bantu languages with click consonants are known to harbour some of the most ancient genetic lineages in humans, but their relationships are poorly understood. Here, we report data from 23 populations analysed at over half a million single-nucleotide polymorphisms, using a genome-wide array designed for studying human history. The southern African Khoisan fall into two genetic groups, loosely corresponding to the northwestern and southeastern Kalahari, which we show separated within the last 30,000 years. We find that all individuals derive at least a few percent of their genomes from admixture with non-Khoisan populations that began ∼1,200 years ago. In addition, the East African Hadza and Sandawe derive a fraction of their ancestry from admixture with a population related to the Khoisan, supporting the hypothesis of an ancient link between southern and eastern Africa.

Proceedings of the National Academy of Sciences of the United States of America | 2014

Ancient west Eurasian ancestry in southern and eastern Africa

Joseph K. Pickrell; Nick Patterson; Po-Ru Loh; Mark Lipson; Bonnie Berger; Mark Stoneking; Brigitte Pakendorf; David Reich

Significance The hunter–gatherer and pastoralist populations of southern Africa are among the culturally, linguistically, and genetically most diverse human populations. However, little is known about their history. We show that all of these populations have some ancestry most closely related to Europeans and Middle Easterners and use this to reconstruct the history of population movements between Eurasia, eastern Africa, and southern Africa. The history of southern Africa involved interactions between indigenous hunter–gatherers and a range of populations that moved into the region. Here we use genome-wide genetic data to show that there are at least two admixture events in the history of Khoisan populations (southern African hunter–gatherers and pastoralists who speak non-Bantu languages with click consonants). One involved populations related to Niger–Congo-speaking African populations, and the other introduced ancestry most closely related to west Eurasian (European or Middle Eastern) populations. We date this latter admixture event to ∼900–1,800 y ago and show that it had the largest demographic impact in Khoisan populations that speak Khoe–Kwadi languages. A similar signal of west Eurasian ancestry is present throughout eastern Africa. In particular, we also find evidence for two admixture events in the history of Kenyan, Tanzanian, and Ethiopian populations, the earlier of which involved populations related to west Eurasians and which we date to ∼2,700–3,300 y ago. We reconstruct the allele frequencies of the putative west Eurasian population in eastern Africa and show that this population is a good proxy for the west Eurasian ancestry in southern Africa. The most parsimonious explanation for these findings is that west Eurasian ancestry entered southern Africa indirectly through eastern Africa.

Nature Genetics | 2015

Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis

Po-Ru Loh; Gaurav Bhatia; Alexander Gusev; Hilary Finucane; Brendan Bulik-Sullivan; Samuela Pollack; Teresa R. de Candia; Sang Hong Lee; Naomi R. Wray; Kenneth S. Kendler; Michael Conlon O'Donovan; Benjamin M. Neale; Nick Patterson; Alkes L. Price

Heritability analyses of genome-wide association study (GWAS) cohorts have yielded important insights into complex disease architecture, and increasing sample sizes hold the promise of further discoveries. Here we analyze the genetic architectures of schizophrenia in 49,806 samples from the PGC and nine complex diseases in 54,734 samples from the GERA cohort. For schizophrenia, we infer an overwhelmingly polygenic disease architecture in which ≥71% of 1-Mb genomic regions harbor ≥1 variant influencing schizophrenia risk. We also observe significant enrichment of heritability in GC-rich regions and in higher-frequency SNPs for both schizophrenia and GERA diseases. In bivariate analyses, we observe significant genetic correlations (ranging from 0.18 to 0.85) for several pairs of GERA diseases; genetic correlations were on average 1.3 tunes stronger than the correlations of overall disease liabilities. To accomplish these analyses, we developed a fast algorithm for multicomponent, multi-trait variance-components analysis that overcomes prior computational barriers that made such analyses intractable at this scale.

Nature Genetics | 2016

Reference-based phasing using the Haplotype Reference Consortium panel

Po-Ru Loh; Petr Danecek; Pier Francesco Palamara; Christian Fuchsberger; Yakir A. Reshef; Hilary Finucane; Sebastian Schoenherr; Lukas Forer; Shane McCarthy; Gonçalo R. Abecasis; Richard Durbin; Alkes L. Price

Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts but attains lower accuracy in smaller cohorts. Here we instead explore the paradigm of reference-based phasing. We introduce a new phasing algorithm, Eagle2, that attains high accuracy across a broad range of cohort sizes by efficiently leveraging information from large external reference panels (such as the Haplotype Reference Consortium; HRC) using a new data structure based on the positional Burrows-Wheeler transform. We demonstrate that Eagle2 attains a ∼20× speedup and ∼10% increase in accuracy compared to reference-based phasing using SHAPEIT2. On European-ancestry samples, Eagle2 with the HRC panel achieves >2× the accuracy of 1000 Genomes–based phasing. Eagle2 is open source and freely available for HRC-based phasing via the Sanger Imputation Service and the Michigan Imputation Server.

Explore More