Paul Scheet
University of Texas MD Anderson Cancer Center
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Paul Scheet.
Genetic Epidemiology | 2010
Yun Li; Cristen J. Willer; Jun Ding; Paul Scheet; Gonçalo R. Abecasis
Genome‐wide association studies (GWAS) can identify common alleles that contribute to complex disease susceptibility. Despite the large number of SNPs assessed in each study, the effects of most common SNPs must be evaluated indirectly using either genotyped markers or haplotypes thereof as proxies. We have previously implemented a computationally efficient Markov Chain framework for genotype imputation and haplotyping in the freely available MaCH software package. The approach describes sampled chromosomes as mosaics of each other and uses available genotype and shotgun sequence data to estimate unobserved genotypes and haplotypes, together with useful measures of the quality of these estimates. Our approach is already widely used to facilitate comparison of results across studies as well as meta‐analyses of GWAS. Here, we use simulations and experimental genotypes to evaluate its accuracy and utility, considering choices of genotyping panels, reference panel configurations, and designs where genotyping is replaced with shotgun sequencing. Importantly, we show that genotype imputation not only facilitates cross study analyses but also increases power of genetic association studies. We show that genotype imputation of common variants using HapMap haplotypes as a reference is very accurate using either genome‐wide SNP data or smaller amounts of data typical in fine‐mapping studies. Furthermore, we show the approach is applicable in a variety of populations. Finally, we illustrate how association analyses of unobserved variants will benefit from ongoing advances such as larger HapMap reference panels and whole genome shotgun sequencing technologies. Genet. Epidemiol. 34: 816‐834, 2010.
Nature Genetics | 2008
Cristen J. Willer; Serena Sanna; Anne U. Jackson; Angelo Scuteri; Lori L. Bonnycastle; Robert Clarke; Simon Heath; Nicholas J. Timpson; Samer S. Najjar; Heather M. Stringham; James B. Strait; William L. Duren; Andrea Maschio; Fabio Busonero; Antonella Mulas; Giuseppe Albai; Amy J. Swift; Mario A. Morken; Derrick Bennett; Sarah Parish; Haiqing Shen; Pilar Galan; Pierre Meneton; Serge Hercberg; Diana Zelenika; Wei-Min Chen; Yun Li; Laura J. Scott; Paul Scheet; Jouko Sundvall
To identify genetic variants influencing plasma lipid concentrations, we first used genotype imputation and meta-analysis to combine three genome-wide scans totaling 8,816 individuals and comprising 6,068 individuals specific to our study (1,874 individuals from the FUSION study of type 2 diabetes and 4,184 individuals from the SardiNIA study of aging-associated variables) and 2,758 individuals from the Diabetes Genetics Initiative, reported in a companion study in this issue. We subsequently examined promising signals in 11,569 additional individuals. Overall, we identify strongly associated variants in eleven loci previously implicated in lipid metabolism (ABCA1, the APOA5-APOA4-APOC3-APOA1 and APOE-APOC clusters, APOB, CETP, GCKR, LDLR, LPL, LIPC, LIPG and PCSK9) and also in several newly identified loci (near MVK-MMAB and GALNT2, with variants primarily associated with high-density lipoprotein (HDL) cholesterol; near SORT1, with variants primarily associated with low-density lipoprotein (LDL) cholesterol; near TRIB1, MLXIPL and ANGPTL3, with variants primarily associated with triglycerides; and a locus encompassing several genes near NCAN, with variants strongly associated with both triglycerides and LDL cholesterol). Notably, the 11 independent variants associated with increased LDL cholesterol concentrations in our study also showed increased frequency in a sample of coronary artery disease cases versus controls.
Nature Genetics | 2009
Sekar Kathiresan; Cristen J. Willer; Gina M. Peloso; Serkalem Demissie; Kiran Musunuru; Eric E. Schadt; Lee M. Kaplan; Derrick Bennett; Yun Li; Toshiko Tanaka; Benjamin F. Voight; Lori L. Bonnycastle; Anne U. Jackson; Gabriel Crawford; Aarti Surti; Candace Guiducci; Noël P. Burtt; Sarah Parish; Robert Clarke; Diana Zelenika; Kari Kubalanza; Mario A. Morken; Laura J. Scott; Heather M. Stringham; Pilar Galan; Amy J. Swift; Johanna Kuusisto; Richard N. Bergman; Jouko Sundvall; Markku Laakso
Blood low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol and triglyceride levels are risk factors for cardiovascular disease. To dissect the polygenic basis of these traits, we conducted genome-wide association screens in 19,840 individuals and replication in up to 20,623 individuals. We identified 30 distinct loci associated with lipoprotein concentrations (each with P < 5 × 10−8), including 11 loci that reached genome-wide significance for the first time. The 11 newly defined loci include common variants associated with LDL cholesterol near ABCG8, MAFB, HNF1A and TIMD4; with HDL cholesterol near ANGPTL4, FADS1-FADS2-FADS3, HNF4A, LCAT, PLTP and TTC39B; and with triglycerides near AMAC1L2, FADS1-FADS2-FADS3 and PLTP. The proportion of individuals exceeding clinical cut points for high LDL cholesterol, low HDL cholesterol and high triglycerides varied according to an allelic dosage score (P < 10−15 for each trend). These results suggest that the cumulative effect of multiple common variants contributes to polygenic dyslipidemia.
Nature | 2008
Mattias Jakobsson; Sonja W. Scholz; Paul Scheet; J. Raphael Gibbs; Jenna M. VanLiere; Hon Chung Fung; Zachary A. Szpiech; James H. Degnan; Kai Wang; Rita Guerreiro; Jose Bras; Jennifer C. Schymick; Dena Hernandez; Bryan J. Traynor; Javier Simón-Sánchez; Mar Matarin; Angela Britton; Joyce van de Leemput; Ian Rafferty; Maja Bucan; Howard M. Cann; John Hardy; Noah A. Rosenberg; Andrew Singleton
Genome-wide patterns of variation across individuals provide a powerful source of data for uncovering the history of migration, range expansion, and adaptation of the human species. However, high-resolution surveys of variation in genotype, haplotype and copy number have generally focused on a small number of population groups. Here we report the analysis of high-quality genotypes at 525,910 single-nucleotide polymorphisms (SNPs) and 396 copy-number-variable loci in a worldwide sample of 29 populations. Analysis of SNP genotypes yields strongly supported fine-scale inferences about population structure. Increasing linkage disequilibrium is observed with increasing geographic distance from Africa, as expected under a serial founder effect for the out-of-Africa spread of human populations. New approaches for haplotype analysis produce inferences about population structure that complement results based on unphased SNPs. Despite a difference from SNPs in the frequency spectrum of the copy-number variants (CNVs) detected—including a comparatively large number of CNVs in previously unexamined populations from Oceania and the Americas—the global distribution of CNVs largely accords with population structure analyses for SNP data sets of similar size. Our results produce new inferences about inter-population variation, support the utility of CNVs in human population-genetic research, and serve as a genomic resource for human-genetic studies in diverse worldwide populations.
PLOS Medicine | 2009
Julian Little; Julian P. T. Higgins; John P. A. Ioannidis; David Moher; Erik von Elm; Muin J. Khoury; Barbara Cohen; George Davey-Smith; Jeremy Grimshaw; Paul Scheet; Marta Gwinn; Robin E. Williamson; Guang Yong Zou; Kim Hutchings; Candice Y. Johnson; Valerie Tait; Miriam Wiens; Jean Golding; Cornelia V. van Duijn; John R. McLaughlin; Andrew D. Paterson; George Wells; Isabel Fortier; Matthew L. Freedman; Maja Zecevic; Richard A. King; Claire Infante-Rivard; Alex Stewart; Nick Birkett
Julian Little and colleagues present the STREGA recommendations, which are aimed at improving the reporting of genetic association studies.
Nature | 2014
Yong Wang; Jill Waters; Marco L. Leung; Anna K. Unruh; Whijae Roh; Xiuqing Shi; Ken Chen; Paul Scheet; Selina Vattathil; Han Liang; Asha S. Multani; Hong Zhang; Rui Zhao; Franziska Michor; Funda Meric-Bernstam; Nicholas Navin
Sequencing studies of breast tumour cohorts have identified many prevalent mutations, but provide limited insight into the genomic diversity within tumours. Here we developed a whole-genome and exome single cell sequencing approach called nuc-seq that uses G2/M nuclei to achieve 91% mean coverage breadth. We applied this method to sequence single normal and tumour nuclei from an oestrogen-receptor-positive (ER+) breast cancer and a triple-negative ductal carcinoma. In parallel, we performed single nuclei copy number profiling. Our data show that aneuploid rearrangements occurred early in tumour evolution and remained highly stable as the tumour masses clonally expanded. In contrast, point mutations evolved gradually, generating extensive clonal diversity. Using targeted single-molecule sequencing, many of the diverse mutations were shown to occur at low frequencies (<10%) in the tumour mass. Using mathematical modelling we found that the triple-negative tumour cells had an increased mutation rate (13.3×), whereas the ER+ tumour cells did not. These findings have important implications for the diagnosis, therapeutic treatment and evolution of chemoresistance in breast cancer.
Nature Genetics | 2011
Jun Yang; Cheng Cheng; Meenakshi Devidas; Xueyuan Cao; Yiping Fan; Dario Campana; Wenjian Yang; Geoff Neale; Nancy J. Cox; Paul Scheet; Michael J. Borowitz; Naomi J. Winick; Paul L. Martin; Cheryl L. Willman; W. Paul Bowman; Bruce M. Camitta; Andrew J. Carroll; Gregory H. Reaman; William L. Carroll; Mignon L. Loh; Stephen P. Hunger; Ching-Hon Pui; William E. Evans; Mary V. Relling
Although five-year survival rates for childhood acute lymphoblastic leukemia (ALL) are now over 80% in most industrialized countries, not all children have benefited equally from this progress. Ethnic differences in survival after childhood ALL have been reported in many clinical studies, with poorer survival observed among African Americans or those with Hispanic ethnicity when compared with European Americans or Asians. The causes of ethnic differences remain uncertain, although both genetic and non-genetic factors are likely important. Interrogating genome-wide germline SNP genotypes in an unselected large cohort of children with ALL, we observed that the component of genomic variation that co-segregated with Native American ancestry was associated with risk of relapse (P = 0.0029) even after adjusting for known prognostic factors (P = 0.017). Ancestry-related differences in relapse risk were abrogated by the addition of a single extra phase of chemotherapy, indicating that modifications to therapy can mitigate the ancestry-related risk of relapse.
American Journal of Human Genetics | 2009
Toshiko Tanaka; Paul Scheet; Betti Giusti; Stefania Bandinelli; Maria Grazia Piras; Gianluca Usala; Sandra Lai; Antonella Mulas; Anna Maria Corsi; Anna Vestrini; Francesco Sofi; Anna Maria Gori; Rosanna Abbate; Jack M. Guralnik; Andrew Singleton; Gonçalo R. Abecasis; David Schlessinger; Manuela Uda; Luigi Ferrucci
The B vitamins are components of one-carbon metabolism (OCM) that contribute to DNA synthesis and methylation. Homocysteine, a by-product of OCM, has been associated with coronary heart disease, stroke and neurological disease. To investigate genetic factors that affect circulating vitamin B6, vitamin B12, folate and homocysteine, a genome-wide association analysis was conducted in the InCHIANTI (N = 1175), SardiNIA (N = 1115), and BLSA (N = 640) studies. The top loci were replicated in an independent sample of 687 participants in the Progetto Nutrizione study. Polymorphisms in the ALPL gene (rs4654748, p = 8.30 x 10(-18)) were associated with vitamin B6 and FUT2 (rs602662, [corrected] p = 2.83 x 10(-20)) with vitamin B12 serum levels. The association of MTHFR, a gene consistently associated with homocysteine, was confirmed in this meta-analysis. The ALPL gene likely influences the catabolism of vitamin B6 while FUT2 interferes with absorption of vitamin B12. These findings highlight mechanisms that affect vitamin B6, vitamin B12 and homocysteine serum levels.
Nature Genetics | 2006
Matthew Stephens; James S. Sloan; Peggy D. Robertson; Paul Scheet; Deborah A. Nickerson
The detection of sequence variation, for which DNA sequencing has emerged as the most sensitive and automated approach, forms the basis of all genetic analysis. Here we describe and illustrate an algorithm that accurately detects and genotypes SNPs from fluorescence-based sequence data. Because the algorithm focuses particularly on detecting SNPs through the identification of heterozygous individuals, it is especially well suited to the detection of SNPs in diploid samples obtained after DNA amplification. It is substantially more accurate than existing approaches and, notably, provides a useful quantitative measure of its confidence in each potential SNP detected and in each genotype called. Calls assigned the highest confidence are sufficiently reliable to remove the need for manual review in several contexts. For example, for sequence data from 47–90 individuals sequenced on both the forward and reverse strands, the highest-confidence calls from our algorithm detected 93% of all SNPs and 100% of high-frequency SNPs, with no false positive SNPs identified and 99.9% genotyping accuracy. This algorithm is implemented in a software package, PolyPhred version 5.0, which is freely available for academic use.
European Journal of Clinical Investigation | 2009
Julian Little; Julian P. T. Higgins; John P. A. Ioannidis; David Moher; Erik von Elm; Muin J. Khoury; Barbara Cohen; George Davey-Smith; Jeremy Grimshaw; Paul Scheet; Marta Gwinn; Robin E. Williamson; Guang Yong Zou; Kim Hutchings; Candice Y. Johnson; Valerie Tait; Miriam Wiens; Jean Golding; Cornelia van Duijn; John R. McLaughlin; Andrew D. Paterson; George Wells; Isabel Fortier; Matthew L. Freedman; Maja Zecevic; Richard A. King; Claire Infante-Rivard; Alex Stewart; Nick Birkett
Making sense of rapidly evolving evidence on genetic associations is crucial to making genuine advances in human genomics and the eventual integration of this information in the practice of medicine and public health. Assessment of the strengths and weaknesses of this evidence, and hence the ability to synthesize it, has been limited by inadequate reporting of results. The STrengthening the REporting of Genetic Association studies (STREGA) initiative builds on the STrengthening the Reporting of OBservational Studies in Epidemiology (STROBE) Statement and provides additions to 12 of the 22 items on the STROBE checklist. The additions concern population stratification, genotyping errors, modelling haplotype variation, Hardy–Weinberg equilibrium, replication, selection of participants, rationale for choice of genes and variants, treatment effects in studying quantitative traits, statistical methods, relatedness, reporting of descriptive and outcome data and the volume of data issues that are important to consider in genetic association studies. The STREGA recommendations do not prescribe or dictate how a genetic association study should be designed, but seek to enhance the transparency of its reporting, regardless of choices made during design, conduct or analysis.