Amy Williams
Cornell University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Amy Williams.
JAMA | 2014
Karol Estrada; Ingvild Aukrust; Lise Bjørkhaug; Noël P. Burtt; Josep M. Mercader; Humberto García-Ortiz; Alicia Huerta-Chagoya; Hortensia Moreno-Macías; Geoffrey A. Walford; Jason Flannick; Amy Williams; María J. Gómez-Vázquez; Juan Carlos Fernández-López; Angélica Martínez-Hernández; Silvia Jiménez-Morales; Federico Centeno-Cruz; Elvia Mendoza-Caamal; Cristina Revilla-Monsalve; Sergio Islas-Andrade; Emilio J. Córdova; Xavier Soberón; María Elena González-Villalpando; E. Henderson; Lynne R. Wilkens; Loic Le Marchand; Olimpia Arellano-Campos; María Luisa Ordóñez-Sánchez; Maribel Rodríguez-Torres; Rosario Rodríguez-Guillén; Laura Riba
IMPORTANCE Latino populations have one of the highest prevalences of type 2 diabetes worldwide. OBJECTIVES To investigate the association between rare protein-coding genetic variants and prevalence of type 2 diabetes in a large Latino population and to explore potential molecular and physiological mechanisms for the observed relationships. DESIGN, SETTING, AND PARTICIPANTS Whole-exome sequencing was performed on DNA samples from 3756 Mexican and US Latino individuals (1794 with type 2 diabetes and 1962 without diabetes) recruited from 1993 to 2013. One variant was further tested for allele frequency and association with type 2 diabetes in large multiethnic data sets of 14,276 participants and characterized in experimental assays. MAIN OUTCOME AND MEASURES Prevalence of type 2 diabetes. Secondary outcomes included age of onset, body mass index, and effect on protein function. RESULTS A single rare missense variant (c.1522G>A [p.E508K]) was associated with type 2 diabetes prevalence (odds ratio [OR], 5.48; 95% CI, 2.83-10.61; P = 4.4 × 10(-7)) in hepatocyte nuclear factor 1-α (HNF1A), the gene responsible for maturity onset diabetes of the young type 3 (MODY3). This variant was observed in 0.36% of participants without type 2 diabetes and 2.1% of participants with it. In multiethnic replication data sets, the p.E508K variant was seen only in Latino patients (n = 1443 with type 2 diabetes and 1673 without it) and was associated with type 2 diabetes (OR, 4.16; 95% CI, 1.75-9.92; P = .0013). In experimental assays, HNF-1A protein encoding the p.E508K mutant demonstrated reduced transactivation activity of its target promoter compared with a wild-type protein. In our data, carriers and noncarriers of the p.E508K mutation with type 2 diabetes had no significant differences in compared clinical characteristics, including age at onset. The mean (SD) age for carriers was 45.3 years (11.2) vs 47.5 years (11.5) for noncarriers (P = .49) and the mean (SD) BMI for carriers was 28.2 (5.5) vs 29.3 (5.3) for noncarriers (P = .19). CONCLUSIONS AND RELEVANCE Using whole-exome sequencing, we identified a single low-frequency variant in the MODY3-causing gene HNF1A that is associated with type 2 diabetes in Latino populations and may affect protein function. This finding may have implications for screening and therapeutic modification in this population, but additional studies are required.
American Journal of Human Genetics | 2012
Amy Williams; Nick Patterson; Joseph T. Glessner; Hakon Hakonarson; David Reich
Haplotypes are an important resource for a large number of applications in human genetics, but computationally inferred haplotypes are subject to switch errors that decrease their utility. The accuracy of computationally inferred haplotypes increases with sample size, and although ever larger genotypic data sets are being generated, the fact that existing methods require substantial computational resources limits their applicability to data sets containing tens or hundreds of thousands of samples. Here, we present HAPI-UR (haplotype inference for unrelated samples), an algorithm that is designed to handle unrelated and/or trio and duo family data, that has accuracy comparable to or greater than existing methods, and that is computationally efficient and can be applied to 100,000 samples or more. We use HAPI-UR to phase a data set with 58,207 samples and show that it achieves practical runtime and that switch errors decrease with sample size even with the use of samples from multiple ethnicities. Using a data set with 16,353 samples, we compare HAPI-UR to Beagle, MaCH, IMPUTE2, and SHAPEIT and show that HAPI-UR runs 18× faster than all methods and has a lower switch-error rate than do other methods except for Beagle; with the use of consensus phasing, running HAPI-UR three times gives a slightly lower switch-error rate than Beagle does and is more than six times faster. We demonstrate results similar to those from Beagle on another data set with a higher marker density. Lastly, we show that HAPI-UR has better runtime scaling properties than does Beagle so that for larger data sets, HAPI-UR will be practical and will have an even larger runtime advantage. HAPI-UR is available online (see Web Resources).
Journal of Graphics Tools | 2005
Amy Williams; Steve Barrus; R. Keith Morley; Peter Shirley
The computational bottleneck in a ray tracer using bounding volume hierarchies is often the ray intersection routine with axis-aligned bounding boxes. We describe a version of this routine that uses IEEE numerical properties to ensure that those tests are both robust and efficient. Sample source code is available online.
eLife | 2015
Amy Williams; Giulio Genovese; Thomas D. Dyer; Nicolas Altemose; Katherine Truax; Goo Jun; Nick Patterson; Simon Myers; Joanne E. Curran; Ravindranath Duggirala; John Blangero; David Reich; Molly Przeworski
Although the past decade has seen tremendous progress in our understanding of fine-scale recombination, little is known about non-crossover (NCO) gene conversion. We report the first genome-wide study of NCO events in humans. Using SNP array data from 98 meioses, we identified 103 sites affected by NCO, of which 50/52 were confirmed in sequence data. Overlap with double strand break (DSB) hotspots indicates that most of the events are likely of meiotic origin. We estimate that a site is involved in a NCO at a rate of 5.9 × 10−6/bp/generation, consistent with sperm-typing studies, and infer that tract lengths span at least an order of magnitude. Observed NCO events show strong allelic bias at heterozygous AT/GC SNPs, with 68% (58–78%) transmitting GC alleles (p = 5 × 10−4). Strikingly, in 4 of 15 regions with resequencing data, multiple disjoint NCO tracts cluster in close proximity (∼20–30 kb), a phenomenon not previously seen in mammals. DOI: http://dx.doi.org/10.7554/eLife.04637.001
Diabetes | 2015
Nicholette D. Palmer; Mark O. Goodarzi; Carl D. Langefeld; Nan Wang; Xiuqing Guo; Kent D. Taylor; Tasha E. Fingerlin; Jill M. Norris; Thomas A. Buchanan; Anny H. Xiang; Talin Haritunians; Julie T. Ziegler; Adrienne H. Williams; Darko Stefanovski; Jinrui Cui; Adrienne MacKay; Leora Henkin; Richard N. Bergman; Xiaoyi Gao; James Gauderman; Rohit Varma; Craig L. Hanis; Nancy J. Cox; Heather M Highland; Jennifer E. Below; Amy Williams; Noël P. Burtt; Carlos A. Aguilar-Salinas; Alicia Huerta-Chagoya; Clicerio González-Villalpando
Insulin sensitivity, insulin secretion, insulin clearance, and glucose effectiveness exhibit strong genetic components, although few studies have examined their genetic architecture or influence on type 2 diabetes (T2D) risk. We hypothesized that loci affecting variation in these quantitative traits influence T2D. We completed a multicohort genome-wide association study to search for loci influencing T2D-related quantitative traits in 4,176 Mexican Americans. Quantitative traits were measured by the frequently sampled intravenous glucose tolerance test (four cohorts) or euglycemic clamp (three cohorts), and random-effects models were used to test the association between loci and quantitative traits, adjusting for age, sex, and admixture proportions (Discovery). Analysis revealed a significant (P < 5.00 × 10−8) association at 11q14.3 (MTNR1B) with acute insulin response. Loci with P < 0.0001 among the quantitative traits were examined for translation to T2D risk in 6,463 T2D case and 9,232 control subjects of Mexican ancestry (Translation). Nonparametric meta-analysis of the Discovery and Translation cohorts identified significant associations at 6p24 (SLC35B3/TFAP2A) with glucose effectiveness/T2D, 11p15 (KCNQ1) with disposition index/T2D, and 6p22 (CDKAL1) and 11q14 (MTNR1B) with acute insulin response/T2D. These results suggest that T2D and insulin secretion and sensitivity have both shared and distinct genetic factors, potentially delineating genomic components of these quantitative traits that drive the risk for T2D.
international conference on computer graphics and interactive techniques | 2005
Amy Williams; Steve Barrus; R. Keith Morley; Peter Shirley
The computational bottleneck in a ray tracer using bounding volume hierarchies is often the ray intersection routine with axis-aligned bounding boxes. We describe a version of this routine that uses IEEE numerical properties to ensure that those tests are both robust and efficient. Sample source code is available online.
Genome Biology | 2010
Amy Williams; David E. Housman; Martin C. Rinard; David K. Gifford
Hapi is a new dynamic programming algorithm that ignores uninformative states and state transitions in order to efficiently compute minimum-recombinant and maximum likelihood haplotypes. When applied to a dataset containing 103 families, Hapi performs 3.8 and 320 times faster than state-of-the-art algorithms. Because Hapi infers both minimum-recombinant and maximum likelihood haplotypes and applies to related individuals, the haplotypes it infers are highly accurate over extended genomic distances.
Diabetes | 2017
Josep M. Mercader; Rachel G. Liao; Avery Davis; Zachary Dymek; Karol Estrada; Taru Tukiainen; Alicia Huerta-Chagoya; Hortensia Moreno-Macías; Kathleen A. Jablonski; Robert L. Hanson; Geoffrey A. Walford; Ignasi Moran; Ling Chen; Vineeta Agarwala; María Luisa Ordóñez-Sánchez; Rosario Rodríguez-Guillén; Maribel Rodríguez-Torres; Yayoi Segura-Kato; Humberto García-Ortiz; Federico Centeno-Cruz; Francisco Martin Barajas-Olmos; Lizz Caulkins; Sobha Puppala; Pierre Fontanillas; Amy Williams; Sílvia Bonàs-Guarch; Chris Hartl; Stephan Ripke; Katherine Tooley; Jacqueline M. Lane
Type 2 diabetes (T2D) affects more than 415 million people worldwide, and its costs to the health care system continue to rise. To identify common or rare genetic variation with potential therapeutic implications for T2D, we analyzed and replicated genome-wide protein coding variation in a total of 8,227 individuals with T2D and 12,966 individuals without T2D of Latino descent. We identified a novel genetic variant in the IGF2 gene associated with ∼20% reduced risk for T2D. This variant, which has an allele frequency of 17% in the Mexican population but is rare in Europe, prevents splicing between IGF2 exons 1 and 2. We show in vitro and in human liver and adipose tissue that the variant is associated with a specific, allele-dosage–dependent reduction in the expression of IGF2 isoform 2. In individuals who do not carry the protective allele, expression of IGF2 isoform 2 in adipose is positively correlated with both incidence of T2D and increased plasma glycated hemoglobin in individuals without T2D, providing support that the protective effects are mediated by reductions in IGF2 isoform 2. Broad phenotypic examination of carriers of the protective variant revealed no association with other disease states or impaired reproductive health. These findings suggest that reducing IGF2 isoform 2 expression in relevant tissues has potential as a new therapeutic strategy for T2D, even beyond the Latin American population, with no major adverse effects on health or reproduction.
Genetics | 2017
Monica D. Ramstetter; Thomas D. Dyer; Donna M. Lehman; Joanne E. Curran; Ravindranath Duggirala; John Blangero; Jason G. Mezey; Amy Williams
Relatedness inference is an essential component of many genetic analyses and popular in consumer genetic testing. Ramstetter et al. evaluate twelve..... Inferring relatedness from genomic data is an essential component of genetic association studies, population genetics, forensics, and genealogy. While numerous methods exist for inferring relatedness, thorough evaluation of these approaches in real data has been lacking. Here, we report an assessment of 12 state-of-the-art pairwise relatedness inference methods using a data set with 2485 individuals contained in several large pedigrees that span up to six generations. We find that all methods have high accuracy (92–99%) when detecting first- and second-degree relationships, but their accuracy dwindles to <43% for seventh-degree relationships. However, most identical by descent (IBD) segment-based methods inferred seventh-degree relatives correct to within one relatedness degree for >76% of relative pairs. Overall, the most accurate methods are Estimation of Recent Shared Ancestry (ERSA) and approaches that compute total IBD sharing using the output from GERMLINE and Refined IBD to infer relatedness. Combining information from the most accurate methods provides little accuracy improvement, indicating that novel approaches, such as new methods that leverage relatedness signals from multiple samples, are needed to achieve a sizeable jump in performance.
bioRxiv | 2017
Monica D. Ramstetter; Thomas D. Dyer; Donna M. Lehman; Joanne E. Curran; Ravindranath Duggirala; John Blangero; Jason G. Mezey; Amy Williams
Inferring relatedness from genomic data is an essential component of genetic association studies, population genetics, forensics, and genealogy. While numerous methods exist for inferring relatedness, thorough evaluation of these methods in real data has been lacking. Here, we report an assessment of 11 state-of-the-art relatedness inference methods using a dataset with 2,485 individuals contained in several large pedigrees that span up to six generations. We nd that all methods have high accuracy (~93% – 99%) when reporting first and second degree relationships, but their accuracy dwindles to less than 60% for fifth degree relationships. However, the inferred relationships were correct to within one relatedness degree at a rate of 83% – 99% across all methods and considered relationship degrees. Furthermore, most methods infer unrelated individuals correctly at a rate of ~99%, suggesting a low rate of false positives. Overall, the most accurate methods were ERSA 2.0 and approaches that classify relationships using the IBD segments inferred by Refined IBD and IBDseq. Combining results from the most accurate methods provides little accuracy improvement, indicating that novel approaches for relatedness inference may be needed to achieve a sizeable jump in performance.
Collaboration
Dive into the Amy Williams's collaboration.
University of Texas Health Science Center at San Antonio
View shared research outputs