Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Stephen F. Schaffner is active.

Publication


Featured researches published by Stephen F. Schaffner.


Nature | 2010

Integrating common and rare genetic variation in diverse human populations.

David Altshuler; Richard A. Gibbs; Leena Peltonen; Emmanouil T. Dermitzakis; Stephen F. Schaffner; Fuli Yu; Penelope E. Bonnen; de Bakker Pi; Panos Deloukas; Stacey Gabriel; R. Gwilliam; Sarah Hunt; Michael Inouye; Xiaoming Jia; Aarno Palotie; Melissa Parkin; Pamela Whittaker; Kyle Chang; Alicia Hawes; Lora Lewis; Yanru Ren; David A. Wheeler; Donna M. Muzny; C. Barnes; Katayoon Darvishi; Joshua M. Korn; Kristiansson K; Cin-Ty A. Lee; McCarrol Sa; James Nemesh

Despite great progress in identifying genetic variants that influence human disease, most inherited risk remains unexplained. A more complete understanding requires genome-wide studies that fully examine less common alleles in populations with a wide range of ancestry. To inform the design and interpretation of such studies, we genotyped 1.6 million common single nucleotide polymorphisms (SNPs) in 1,184 reference individuals from 11 global populations, and sequenced ten 100-kilobase regions in 692 of these individuals. This integrated data set of common and rare alleles, called ‘HapMap 3’, includes both SNPs and copy number polymorphisms (CNPs). We characterized population-specific differences among low-frequency variants, measured the improvement in imputation accuracy afforded by the larger reference panel, especially in imputing SNPs with a minor allele frequency of ≤5%, and demonstrated the feasibility of imputing newly discovered CNPs and SNPs. This expanded public resource of genome variants in global populations supports deeper interrogation of genomic variation and its role in human disease, and serves as a step towards a high-resolution map of the landscape of human genetic variation.


Nature | 2005

Initial sequence of the chimpanzee genome and comparison with the human genome

Tarjei S. Mikkelsen; LaDeana W. Hillier; Evan E. Eichler; Michael C. Zody; David B. Jaffe; Shiaw-Pyng Yang; Wolfgang Enard; Ines Hellmann; Kerstin Lindblad-Toh; Tasha K. Altheide; Nicoletta Archidiacono; Peer Bork; Jonathan Butler; Jean L. Chang; Ze Cheng; Asif T. Chinwalla; Pieter J. de Jong; Kimberley D. Delehaunty; Catrina C. Fronick; Lucinda L. Fulton; Yoav Gilad; Gustavo Glusman; Sante Gnerre; Tina Graves; Toshiyuki Hayakawa; Karen E. Hayden; Xiaoqiu Huang; Hongkai Ji; W. James Kent; Mary Claire King

Here we present a draft genome sequence of the common chimpanzee (Pan troglodytes). Through comparison with the human genome, we have generated a largely complete catalogue of the genetic differences that have accumulated since the human and chimpanzee species diverged from our common ancestor, constituting approximately thirty-five million single-nucleotide changes, five million insertion/deletion events, and various chromosomal rearrangements. We use this catalogue to explore the magnitude and regional variation of mutational forces shaping these two genomes, and the strength of positive and negative selection acting on their genes. In particular, we find that the patterns of evolution in human and chimpanzee protein-coding genes are highly correlated and dominated by the fixation of neutral and slightly deleterious alleles. We also use the chimpanzee genome as an outgroup to investigate human population genetics and identify signatures of selective sweeps in recent human evolution.Here we present a draft genome sequence of the common chimpanzee (Pan troglodytes). Through comparison with the human genome, we have generated a largely complete catalogue of the genetic differences that have accumulated since the human and chimpanzee species diverged from our common ancestor, constituting approximately thirty-five million single-nucleotide changes, five million insertion/deletion events, and various chromosomal rearrangements. We use this catalogue to explore the magnitude and regional variation of mutational forces shaping these two genomes, and the strength of positive and negative selection acting on their genes. In particular, we find that the patterns of evolution in human and chimpanzee protein-coding genes are highly correlated and dominated by the fixation of neutral and slightly deleterious alleles. We also use the chimpanzee genome as an outgroup to investigate human population genetics and identify signatures of selective sweeps in recent human evolution.


Nature Genetics | 2000

The common PPARgamma Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes.

David Altshuler; Joel N. Hirschhorn; Mia Klannemark; Cecilia M. Lindgren; Marie-Claude Vohl; James Nemesh; Charles R. Lane; Stephen F. Schaffner; Stacey Bolk; Carl Brewer; Tiinamaija Tuomi; Daniel Gaudet; Thomas J. Hudson; Mark J. Daly; Leif Groop; Eric S. Lander

Genetic association studies are viewed as problematic and plagued by irreproducibility. Many associations have been reported for type 2 diabetes, but none have been confirmed in multiple samples and with comprehensive controls. We evaluated 16 published genetic associations to type 2 diabetes and related sub-phenotypes using a family-based design to control for population stratification, and replication samples to increase power. We were able to confirm only one association, that of the common Pro12Ala polymorphism in peroxisome proliferator-activated receptor-γ (PPARγ) with type 2 diabetes. By analysing over 3,000 individuals, we found a modest (1.25-fold) but significant (P=0.002) increase in diabetes risk associated with the more common proline allele (∼85% frequency). Moreover, our results resolve a controversy about common variation in PPARγ. An initial study found a threefold effect, but four of five subsequent publications failed to confirm the association. All six studies are consistent with the odds ratio we describe. The data implicate inherited variation in PPARγ in the pathogenesis of type 2 diabetes. Because the risk allele occurs at such high frequency, its modest effect translates into a large population attributable risk—influencing as much as 25% of type 2 diabetes in the general population.


Nature Genetics | 2001

High-resolution haplotype structure in the human genome.

Mark J. Daly; John D. Rioux; Stephen F. Schaffner; Thomas J. Hudson; Eric S. Lander

Linkage disequilibrium (LD) analysis is traditionally based on individual genetic markers and often yields an erratic, non-monotonic picture, because the power to detect allelic associations depends on specific properties of each marker, such as frequency and population history. Ideally, LD analysis should be based directly on the underlying haplotype structure of the human genome, but this structure has remained poorly understood. Here we report a high-resolution analysis of the haplotype structure across 500 kilobases on chromosome 5q31 using 103 single-nucleotide polymorphisms (SNPs) in a European-derived population. The results show a picture of discrete haplotype blocks (of tens to hundreds of kilobases), each with limited diversity punctuated by apparent sites of recombination. In addition, we develop an analytical model for LD mapping based on such haplotype blocks. If our observed structure is general (and published data suggest that it may be), it offers a coherent framework for creating a haplotype map of the human genome.


Nature | 2002

Detecting recent positive selection in the human genome from haplotype structure

Pardis C. Sabeti; David Reich; John M. Higgins; Haninah Z. P. Levine; Daniel J. Richter; Stephen F. Schaffner; Stacey Gabriel; Jill Platko; Nick Patterson; Gavin J. McDonald; Hans Ackerman; S J Campbell; David Altshuler; Richard S. Cooper; Dominic P. Kwiatkowski; Ryk Ward; Eric S. Lander

The ability to detect recent natural selection in the human population would have profound implications for the study of human history and for medicine. Here, we introduce a framework for detecting the genetic imprint of recent positive selection by analysing long-range haplotypes in human populations. We first identify haplotypes at a locus of interest (core haplotypes). We then assess the age of each core haplotype by the decay of its association to alleles at various distances from the locus, as measured by extended haplotype homozygosity (EHH). Core haplotypes that have unusually high EHH and a high population frequency indicate the presence of a mutation that rose to prominence in the human gene pool faster than expected under neutral evolution. We applied this approach to investigate selection at two genes carrying common variants implicated in resistance to malaria: G6PD and CD40 ligand. At both loci, the core haplotypes carrying the proposed protective mutation stand out and show significant evidence of selection. More generally, the method could be used to scan the entire genome for evidence of recent positive selection.


Science | 2006

Positive Natural Selection in the Human Lineage

Pardis C. Sabeti; Stephen F. Schaffner; Benjamin Fry; Jason Lohmueller; Patrick Varilly; O. Shamovsky; Alejandro Palma; Tarjei S. Mikkelsen; David Altshuler; Eric S. Lander

Positive natural selection is the force that drives the increase in prevalence of advantageous traits, and it has played a central role in our development as a species. Until recently, the study of natural selection in humans has largely been restricted to comparing individual candidate genes to theoretical expectations. The advent of genome-wide sequence and polymorphism data brings fundamental new tools to the study of natural selection. It is now possible to identify new candidates for selection and to reevaluate previous claims by comparison with empirical distributions of DNA sequence variation across the human genome and among populations. The flood of data and analytical methods, however, raises many new challenges. Here, we review approaches to detect positive natural selection, describe results from recent analyses of genome-wide data, and discuss the prospects and challenges ahead as we expand our understanding of the role of natural selection in shaping the human genome.


Science | 2014

Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak

Stephen K. Gire; Augustine Goba; Kristian G. Andersen; Rachel Sealfon; Daniel J. Park; Lansana Kanneh; Simbirie Jalloh; Mambu Momoh; Mohamed Fullah; Gytis Dudas; Shirlee Wohl; Lina M. Moses; Nathan L. Yozwiak; Sarah M. Winnicki; Christian B. Matranga; Christine M. Malboeuf; James Qu; Adrianne D. Gladden; Stephen F. Schaffner; Xiao Yang; Pan Pan Jiang; Mahan Nekoui; Andres Colubri; Moinya Ruth Coomber; Mbalu Fonnie; Alex Moigboi; Michael Gbakie; Fatima K. Kamara; Veronica Tucker; Edwin Konuwa

In its largest outbreak, Ebola virus disease is spreading through Guinea, Liberia, Sierra Leone, and Nigeria. We sequenced 99 Ebola virus genomes from 78 patients in Sierra Leone to ~2000× coverage. We observed a rapid accumulation of interhost and intrahost genetic variation, allowing us to characterize patterns of viral transmission over the initial weeks of the epidemic. This West African variant likely diverged from central African lineages around 2004, crossed from Guinea to Sierra Leone in May 2014, and has exhibited sustained human-to-human transmission subsequently, with no evidence of additional zoonotic sources. Because many of the mutations alter protein sequences and other biologically meaningful targets, they should be monitored for impact on diagnostics, vaccines, and therapies critical to outbreak response.


Molecular Psychiatry | 2002

Family-based association study of 76 candidate genes in bipolar disorder: BDNF is a potential risk locus

Pamela Sklar; Stacey Gabriel; P. Bennett; Y-M Lim; G. Tsan; Stephen F. Schaffner; George Kirov; Ian Richard Jones; Michael John Owen; N. Craddock; J. R. DePaulo; Eric S. Lander

Identification of the genetic bases for bipolar disorder remains a challenge for the understanding of this disease. Association between 76 candidate genes and bipolar disorder was tested by genotyping 90 single-nucleotide polymorphisms (SNPs) in these genes in 136 parent-proband trios. In this preliminary analysis, SNPs in two genes, brain-derived neurotrophic factor (BDNF) and the alpha subunit of the voltage-dependent calcium channel were associated with bipolar disorder at the P<0.05 level. In view of the large number of hypotheses tested, the two nominally positive associations were then tested in independent populations of bipolar patients and only BDNF remains a potential risk gene. In the replication samples, excess transmission of the valine allele of amino acid 66 of BDNF was observed in the direction of the original result in an additional sample of 334 parent-proband trios (T/U=108/87, P=0.066). Resequencing of 29 kb surrounding the BDNF gene identified 44 additional SNPs. Genotyping eight common SNPs identified three additional markers transmitted to bipolar probands at the P < 0.05 level. Strong LD was observed across this region and all adjacent pairwise haplotypes showed excess transmission to the bipolar proband. Analysis of these haplotypes using TRANSMIT revealed a global P value of 0.03. A single haplotype was identified that is shared by both the original dataset and the replication sample that is uniquely marked by both the rare A allele of the original SNP and a novel allele 11.5 kb 3′. Therefore, this study of 76 candidate genes has identified BDNF as a potential risk allele that will require additional study to confirm.


Science | 2010

A Composite of Multiple Signals Distinguishes Causal Variants in Regions of Positive Selection

Sharon R. Grossman; Ilya Shylakhter; Elinor K. Karlsson; Elizabeth H. Byrne; Shannon Morales; Gabriel Frieden; Elizabeth Hostetter; Elaine Angelino; Manuel Garber; Or Zuk; Eric S. Lander; Stephen F. Schaffner; Pardis C. Sabeti

Pinpointing Genetic Selection The human genome contains hundreds of regions with evidence of recent positive natural selection, yet, for all but a handful of cases, the underlying advantageous mutation remains unknown. Current methods to detect the signal of selection often results in the identification of a broad genomic region containing many candidate regions that vary among individuals. By combining existing statistical methods, Grossman et al. (p. 883, published online 7 January) developed a method, termed Composite of Multiple Signals, which can increase the ability to pinpoint the specific variant under selection. Several candidate regions under selection in human populations were identified. Combining statistical methods detects signals of selection with increased sensitivity and a lower false-positive rate. The human genome contains hundreds of regions whose patterns of genetic variation indicate recent positive natural selection, yet for most the underlying gene and the advantageous mutation remain unknown. We developed a method, composite of multiple signals (CMS), that combines tests for multiple signals of selection and increases resolution by up to 100-fold. By applying CMS to candidate regions from the International Haplotype Map, we localized population-specific selective signals to 55 kilobases (median), identifying known and novel causal variants. CMS can not just identify individual loci but implicates precise variants selected by evolution.


Proceedings of the National Academy of Sciences of the United States of America | 2014

Searching for missing heritability: Designing rare variant association studies

Or Zuk; Stephen F. Schaffner; Kaitlin E. Samocha; Ron Do; Eliana Hechter; Sekar Kathiresan; Mark J. Daly; Benjamin M. Neale; Shamil R. Sunyaev; Eric S. Lander

Significance Discovering the genetic basis of common diseases, such as diabetes, heart disease, and schizophrenia, is a key goal in biomedicine. Genomic studies have revealed thousands of common genetic variants underlying disease, but these variants explain only a portion of the heritability. Rare variants are also likely to play an important role, but few examples are known thus far, and initial discovery efforts with small sample sizes have had only limited success. In this paper, we describe an analytical framework for the design of rare variant association studies of disease. It provides guidance with respect to sample size, as well as the roles of selection, disruptive and missense alleles, gene-specific allele frequency thresholds, isolated populations, gene sets, and coding vs. noncoding regions. Genetic studies have revealed thousands of loci predisposing to hundreds of human diseases and traits, revealing important biological pathways and defining novel therapeutic hypotheses. However, the genes discovered to date typically explain less than half of the apparent heritability. Because efforts have largely focused on common genetic variants, one hypothesis is that much of the missing heritability is due to rare genetic variants. Studies of common variants are typically referred to as genomewide association studies, whereas studies of rare variants are often simply called sequencing studies. Because they are actually closely related, we use the terms common variant association study (CVAS) and rare variant association study (RVAS). In this paper, we outline the similarities and differences between RVAS and CVAS and describe a conceptual framework for the design of RVAS. We apply the framework to address key questions about the sample sizes needed to detect association, the relative merits of testing disruptive alleles vs. missense alleles, frequency thresholds for filtering alleles, the value of predictors of the functional impact of missense alleles, the potential utility of isolated populations, the value of gene-set analysis, and the utility of de novo mutations. The optimal design depends critically on the selection coefficient against deleterious alleles and thus varies across genes. The analysis shows that common variant and rare variant studies require similarly large sample collections. In particular, a well-powered RVAS should involve discovery sets with at least 25,000 cases, together with a substantial replication set.

Collaboration


Dive into the Stephen F. Schaffner's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge