Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Hilary Finucane is active.

Publication


Featured researches published by Hilary Finucane.


Nature Genetics | 2015

LD Score regression distinguishes confounding from polygenicity in genome-wide association studies

Brendan Bulik-Sullivan; Po-Ru Loh; Hilary Finucane; Stephan Ripke; Jian Yang; Nick Patterson; Mark J. Daly; Alkes L. Price; Benjamin M. Neale

Both polygenicity (many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification, can yield an inflated distribution of test statistics in genome-wide association studies (GWAS). However, current methods cannot distinguish between inflation from a true polygenic signal and bias. We have developed an approach, LD Score regression, that quantifies the contribution of each by examining the relationship between test statistics and linkage disequilibrium (LD). The LD Score regression intercept can be used to estimate a more powerful and accurate correction factor than genomic control. We find strong evidence that polygenicity accounts for the majority of the inflation in test statistics in many GWAS of large sample size.


Nature Genetics | 2015

An atlas of genetic correlations across human diseases and traits

Brendan Bulik-Sullivan; Hilary Finucane; Verneri Anttila; Alexander Gusev; Felix R. Day; Po-Ru Loh; Laramie Duncan; John Perry; Nick Patterson; Elise B. Robinson; Mark J. Daly; Alkes L. Price; Benjamin M. Neale

Identifying genetic correlations between complex traits and diseases can provide useful etiological insights and help prioritize likely causal relationships. The major challenges preventing estimation of genetic correlation from genome-wide association study (GWAS) data with current methods are the lack of availability of individual-level genotype data and widespread sample overlap among meta-analyses. We circumvent these difficulties by introducing a technique—cross-trait LD Score regression—for estimating genetic correlation that requires only GWAS summary statistics and is not biased by sample overlap. We use this method to estimate 276 genetic correlations among 24 traits. The results include genetic correlations between anorexia nervosa and schizophrenia, anorexia and obesity, and educational attainment and several diseases. These results highlight the power of genome-wide analyses, as there currently are no significantly associated SNPs for anorexia nervosa and only three for educational attainment.


Nature Genetics | 2015

Partitioning heritability by functional annotation using genome-wide association summary statistics

Hilary Finucane; Brendan Bulik-Sullivan; Alexander Gusev; Gosia Trynka; Yakir A. Reshef; Po-Ru Loh; Verneri Anttila; Han Xu; Chongzhi Zang; Kyle Kai-How Farh; Stephan Ripke; Felix R. Day; Shaun Purcell; Eli A. Stahl; Sara Lindström; John Perry; Yukinori Okada; Soumya Raychaudhuri; Mark J. Daly; Nick Patterson; Benjamin M. Neale; Alkes L. Price

Recent work has demonstrated that some functional categories of the genome contribute disproportionately to the heritability of complex diseases. Here we analyze a broad set of functional elements, including cell type–specific elements, to estimate their polygenic contributions to heritability in genome-wide association studies (GWAS) of 17 complex diseases and traits with an average sample size of 73,599. To enable this analysis, we introduce a new method, stratified LD score regression, for partitioning heritability from GWAS summary statistics while accounting for linked markers. This new method is computationally tractable at very large sample sizes and leverages genome-wide information. Our findings include a large enrichment of heritability in conserved regions across many traits, a very large immunological disease–specific enrichment of heritability in FANTOM5 enhancers and many cell type–specific enrichments, including significant enrichment of central nervous system cell types in the heritability of body mass index, age at menarche, educational attainment and smoking behavior.


Nature Genetics | 2015

Efficient Bayesian mixed model analysis increases association power in large cohorts

Po-Ru Loh; George Tucker; Brendan Bulik-Sullivan; Bjarni J. Vilhjálmsson; Hilary Finucane; Rany M. Salem; Daniel I. Chasman; Paul M. Ridker; Benjamin M. Neale; Bonnie Berger; Nick Patterson; Alkes L. Price

Linear mixed models are a powerful statistical tool for identifying genetic associations and avoiding confounding. However, existing methods are computationally intractable in large cohorts and may not optimize power. All existing methods require time cost O(MN2) (where N is the number of samples and M is the number of SNPs) and implicitly assume an infinitesimal genetic architecture in which effect sizes are normally distributed, which can limit power. Here we present a far more efficient mixed-model association method, BOLT-LMM, which requires only a small number of O(MN) time iterations and increases power by modeling more realistic, non-infinitesimal genetic architectures via a Bayesian mixture prior on marker effect sizes. We applied BOLT-LMM to 9 quantitative traits in 23,294 samples from the Womens Genome Health Study (WGHS) and observed significant increases in power, consistent with simulations. Theory and simulations show that the boost in power increases with cohort size, making BOLT-LMM appealing for genome-wide association studies in large cohorts.


Bioinformatics | 2017

LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis

Jie Zheng; A. Mesut Erzurumluoglu; Benjamin Elsworth; John P. Kemp; Laurence J Howe; Philip Haycock; Gibran Hemani; Katherine E. Tansey; Charles Laurin; Early Genetics; Beate St Pourcain; Nicole M. Warrington; Hilary Finucane; Alkes L. Price; Brendan Bulik-Sullivan; Verneri Anttila; Lavinia Paternoster; Tom R. Gaunt; David Evans; Benjamin M. Neale

Motivation: LD score regression is a reliable and efficient method of using genome-wide association study (GWAS) summary-level results data to estimate the SNP heritability of complex traits and diseases, partition this heritability into functional categories, and estimate the genetic correlation between different phenotypes. Because the method relies on summary level results data, LD score regression is computationally tractable even for very large sample sizes. However, publicly available GWAS summary-level data are typically stored in different databases and have different formats, making it difficult to apply LD score regression to estimate genetic correlations across many different traits simultaneously. Results: In this manuscript, we describe LD Hub - a centralized database of summary-level GWAS results for 173 diseases/traits from different publicly available resources/consortia and a web interface that automates the LD score regression analysis pipeline. To demonstrate functionality and validate our software, we replicated previously reported LD score regression analyses of 49 traits/diseases using LD Hub; and estimated SNP heritability and the genetic correlation across the different phenotypes. We also present new results obtained by uploading a recent atopic dermatitis GWAS meta-analysis to examine the genetic correlation between the condition and other potentially related traits. In response to the growing availability of publicly accessible GWAS summary-level results data, our database and the accompanying web interface will ensure maximal uptake of the LD score regression methodology, provide a useful database for the public dissemination of GWAS results, and provide a method for easily screening hundreds of traits for overlapping genetic aetiologies. Availability and Implementation: The web interface and instructions for using LD Hub are available at http://ldsc.broadinstitute.org/ Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


Nature Genetics | 2015

Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis

Po-Ru Loh; Gaurav Bhatia; Alexander Gusev; Hilary Finucane; Brendan Bulik-Sullivan; Samuela Pollack; Teresa R. de Candia; Sang Hong Lee; Naomi R. Wray; Kenneth S. Kendler; Michael Conlon O'Donovan; Benjamin M. Neale; Nick Patterson; Alkes L. Price

Heritability analyses of genome-wide association study (GWAS) cohorts have yielded important insights into complex disease architecture, and increasing sample sizes hold the promise of further discoveries. Here we analyze the genetic architectures of schizophrenia in 49,806 samples from the PGC and nine complex diseases in 54,734 samples from the GERA cohort. For schizophrenia, we infer an overwhelmingly polygenic disease architecture in which ≥71% of 1-Mb genomic regions harbor ≥1 variant influencing schizophrenia risk. We also observe significant enrichment of heritability in GC-rich regions and in higher-frequency SNPs for both schizophrenia and GERA diseases. In bivariate analyses, we observe significant genetic correlations (ranging from 0.18 to 0.85) for several pairs of GERA diseases; genetic correlations were on average 1.3 tunes stronger than the correlations of overall disease liabilities. To accomplish these analyses, we developed a fast algorithm for multicomponent, multi-trait variance-components analysis that overcomes prior computational barriers that made such analyses intractable at this scale.


Nature Genetics | 2016

Reference-based phasing using the Haplotype Reference Consortium panel

Po-Ru Loh; Petr Danecek; Pier Francesco Palamara; Christian Fuchsberger; Yakir A. Reshef; Hilary Finucane; Sebastian Schoenherr; Lukas Forer; Shane McCarthy; Gonçalo R. Abecasis; Richard Durbin; Alkes L. Price

Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts but attains lower accuracy in smaller cohorts. Here we instead explore the paradigm of reference-based phasing. We introduce a new phasing algorithm, Eagle2, that attains high accuracy across a broad range of cohort sizes by efficiently leveraging information from large external reference panels (such as the Haplotype Reference Consortium; HRC) using a new data structure based on the positional Burrows-Wheeler transform. We demonstrate that Eagle2 attains a ∼20× speedup and ∼10% increase in accuracy compared to reference-based phasing using SHAPEIT2. On European-ancestry samples, Eagle2 with the HRC panel achieves >2× the accuracy of 1000 Genomes–based phasing. Eagle2 is open source and freely available for HRC-based phasing via the Sanger Imputation Service and the Michigan Imputation Server.


Cell | 2015

Clinical Sequencing Uncovers Origins and Evolution of Lassa Virus

Kristian G. Andersen; B. Jesse Shapiro; Christian B. Matranga; Rachel Sealfon; Aaron E. Lin; Lina M. Moses; Onikepe A. Folarin; Augustine Goba; Ikponmwonsa Odia; Philomena E. Ehiane; Mambu Momoh; Eleina M. England; Sarah M. Winnicki; Luis M. Branco; Stephen K. Gire; Eric Phelan; Ridhi Tariyal; Ryan Tewhey; Omowunmi Omoniwa; Mohammed Fullah; Richard Fonnie; Mbalu Fonnie; Lansana Kanneh; Simbirie Jalloh; Michael Gbakie; Sidiki Saffa; Kandeh Karbo; Adrianne D. Gladden; James Qu; Matthew Stremlau

The 2013-2015 West African epidemic of Ebola virus disease (EVD) reminds us of how little is known about biosafety level 4 viruses. Like Ebola virus, Lassa virus (LASV) can cause hemorrhagic fever with high case fatality rates. We generated a genomic catalog of almost 200 LASV sequences from clinical and rodent reservoir samples. We show that whereas the 2013-2015 EVD epidemic is fueled by human-to-human transmissions, LASV infections mainly result from reservoir-to-human infections. We elucidated the spread of LASV across West Africa and show that this migration was accompanied by changes in LASV genome abundance, fatality rates, codon adaptation, and translational efficiency. By investigating intrahost evolution, we found that mutations accumulate in epitopes of viral surface proteins, suggesting selection for immune escape. This catalog will serve as a foundation for the development of vaccines and diagnostics. VIDEO ABSTRACT.


allerton conference on communication, control, and computing | 2008

Designing floating codes for expected performance

Flavio Chierichetti; Hilary Finucane; Zhenming Liu; Michaell Mitzenmacher

Floating codes are codes designed to store multiple values in a Write Asymmetric Memory, with applications to flash memory. In this model, a memory consists of a block of n cells, with each cell in one of q states {0,1,...,q -1}. The cells are used to represent k variable values from an ¿-ary alphabet. Cells can move from lower values to higher values easily, but moving any cell from a higher value to a lower value requires first resetting the entire block to an all 0 state. Reset operations are to be avoided; generally a block can only experience a large but finite number of resets before wearing out entirely. A code here corresponds to a mapping from cell states to variable values, and a transition function that gives how to rewrite cell states when a variable is changed. Previous work has focused on developing codes that maximize the worst-case number of variable changes, or equivalently cell rewrites, that can be experienced before resetting. In this paper, we introduce the problem of maximizing the expected number of variable changes before resetting, given an underlying Markov chain that models variable changes. We demonstrate that codes designed for expected performance can differ substantially from optimal worst-case codes, and suggest constructions for some simple cases. We then study the related question of the performance of random codes, again focusing on the issue of expected behavior.


Nature Communications | 2015

Shared genetic aetiology of puberty timing between sexes and with health-related outcomes

Felix R. Day; Brendan Bulik-Sullivan; David A. Hinds; Hilary Finucane; Joanne M. Murabito; Joyce Y. Tung; Ken K. Ong; John Perry

Understanding of the genetic regulation of puberty timing has come largely from studies of rare disorders and population-based studies in women. Here, we report the largest genomic analysis for puberty timing in 55,871 men, based on recalled age at voice breaking. Analysis across all genomic variants reveals strong genetic correlation (0.74, P=2.7 × 10−70) between male and female puberty timing. However, some loci show sex-divergent effects, including directionally opposite effects between sexes at the SIM1/MCHR2 locus (Pheterogeneity=1.6 × 10−12). We find five novel loci for puberty timing (P<5 × 10−8), in addition to nine signals in men that were previously reported in women. Newly implicated genes include two retinoic acid-related receptors, RORB and RXRA, and two genes reportedly disrupted in rare disorders of puberty, LEPR and KAL1. Finally, we identify genetic correlations that indicate shared aetiologies in both sexes between puberty timing and body mass index, fasting insulin levels, lipid levels, type 2 diabetes and cardiovascular disease.

Collaboration


Dive into the Hilary Finucane's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Mark J. Daly

University of North Carolina at Chapel Hill

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge