Molly A. Hall
Pennsylvania State University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Molly A. Hall.
PLOS Genetics | 2014
Molly A. Hall; Anurag Verma; Kristin Brown-Gentry; Robert Goodloe; Jonathan Boston; Sarah Wilson; Bob McClellan; Cara Sutcliffe; Holly H. Dilks; Nila B. Gillani; Hailing Jin; Ping Mayo; Melissa Allen; Nathalie Schnetz-Boutaud; Dana C. Crawford; Marylyn D. Ritchie; Sarah A. Pendergrass
We performed a Phenome-wide association study (PheWAS) utilizing diverse genotypic and phenotypic data existing across multiple populations in the National Health and Nutrition Examination Surveys (NHANES), conducted by the Centers for Disease Control and Prevention (CDC), and accessed by the Epidemiological Architecture for Genes Linked to Environment (EAGLE) study. We calculated comprehensive tests of association in Genetic NHANES using 80 SNPs and 1,008 phenotypes (grouped into 184 phenotype classes), stratified by race-ethnicity. Genetic NHANES includes three surveys (NHANES III, 1999–2000, and 2001–2002) and three race-ethnicities: non-Hispanic whites (n = 6,634), non-Hispanic blacks (n = 3,458), and Mexican Americans (n = 3,950). We identified 69 PheWAS associations replicating across surveys for the same SNP, phenotype-class, direction of effect, and race-ethnicity at p<0.01, allele frequency >0.01, and sample size >200. Of these 69 PheWAS associations, 39 replicated previously reported SNP-phenotype associations, 9 were related to previously reported associations, and 21 were novel associations. Fourteen results had the same direction of effect across more than one race-ethnicity: one result was novel, 11 replicated previously reported associations, and two were related to previously reported results. Thirteen SNPs showed evidence of pleiotropy. We further explored results with gene-based biological networks, contrasting the direction of effect for pleiotropic associations across phenotypes. One PheWAS result was ABCG2 missense SNP rs2231142, associated with uric acid levels in both non-Hispanic whites and Mexican Americans, protoporphyrin levels in non-Hispanic whites and Mexican Americans, and blood pressure levels in Mexican Americans. Another example was SNP rs1800588 near LIPC, significantly associated with the novel phenotypes of folate levels (Mexican Americans), vitamin E levels (non-Hispanic whites) and triglyceride levels (non-Hispanic whites), and replication for cholesterol levels. The results of this PheWAS show the utility of this approach for exposing more of the complex genetic architecture underlying multiple traits, through generating novel hypotheses for future research.
Annual Review of Public Health | 2017
Arjun K. Manrai; Yuxia Cui; Pierre R. Bushel; Molly A. Hall; Carolyn J. Mattingly; Marylyn D. Ritchie; Charles Schmitt; D. Sarigiannis; Duncan C. Thomas; David S. Wishart; David M. Balshaw; Chirag Patel
The complexity of the human exposome-the totality of environmental exposures encountered from birth to death-motivates systematic, high-throughput approaches to discover new environmental determinants of disease. In this review, we describe the state of science in analyzing the human exposome and provide recommendations for the public health community to consider in dealing with analytic challenges of exposome-based biomedical research. We describe extant and novel analytic methods needed to associate the exposome with critical health outcomes and contextualize the data-centered challenges by drawing parallels to other research endeavors such as human genomics research. We discuss efforts for training scientists who can bridge public health, genomics, and biomedicine in informatics and statistics. If an exposome data ecosystem is brought to fruition, it will likely play a role as central as genomic science has had in molding the current and new generations of biomedical researchers, computational scientists, and public health research programs.
pacific symposium on biocomputing | 2013
Molly A. Hall; Scott M. Dudek; Robert Goodloe; Dana C. Crawford; Sarah A. Pendergrass; Peggy L. Peissig; Murray H. Brilliant; Catherine A. McCarty; Marylyn D. Ritchie
Environment-wide association studies (EWAS) provide a way to uncover the environmental mechanisms involved in complex traits in a high-throughput manner. Genome-wide association studies have led to the discovery of genetic variants associated with many common diseases but do not take into account the environmental component of complex phenotypes. This EWAS assesses the comprehensive association between environmental variables and the outcome of type 2 diabetes (T2D) in the Marshfield Personalized Medicine Research Project Biobank (Marshfield PMRP). We sought replication in two National Health and Nutrition Examination Surveys (NHANES). The Marshfield PMRP currently uses four tools for measuring environmental exposures and outcome traits: 1) the PhenX Toolkit includes standardized exposure and phenotypic measures across several domains, 2) the Diet History Questionnaire (DHQ) is a food frequency questionnaire, 3) the Measurement of a Persons Habitual Physical Activity scores the level of an individuals physical activity, and 4) electronic health records (EHR) employs validated algorithms to establish T2D case-control status. Using PLATO software, 314 environmental variables were tested for association with T2D using logistic regression, adjusting for sex, age, and BMI in over 2,200 European Americans. When available, similar variables were tested with the same methods and adjustment in samples from NHANES III and NHANES 1999-2002. Twelve and 31 associations were identified in the Marshfield samples at p<0.01 and p<0.05, respectively. Seven and 13 measures replicated in at least one of the NHANES at p<0.01 and p<0.05, respectively, with the same direction of effect. The most significant environmental exposures associated with T2D status included decreased alcohol use as well as increased smoking exposure in childhood and adulthood. The results demonstrate the utility of the EWAS method and survey tools for identifying environmental components of complex diseases like type 2 diabetes. These high-throughput and comprehensive investigation methods can easily be applied to investigate the relation between environmental exposures and multiple phenotypes in future analyses.
Genetic Epidemiology | 2015
Molly A. Hall; Shefali S. Verma; John R. Wallace; Anastasia Lucas; Richard L. Berg; John J. Connolly; Dana C. Crawford; David R. Crosslin; Mariza de Andrade; Kimberly F. Doheny; Jonathan L. Haines; John B. Harley; Gail P. Jarvik; Terrie Kitchner; Helena Kuivaniemi; Eric B. Larson; David Carrell; Gerard Tromp; Tamara R. Vrabec; Sarah A. Pendergrass; Catherine A. McCarty; Marylyn D. Ritchie
Bioinformatics approaches to examine gene‐gene models provide a means to discover interactions between multiple genes that underlie complex disease. Extensive computational demands and adjusting for multiple testing make uncovering genetic interactions a challenge. Here, we address these issues using our knowledge‐driven filtering method, Biofilter, to identify putative single nucleotide polymorphism (SNP) interaction models for cataract susceptibility, thereby reducing the number of models for analysis. Models were evaluated in 3,377 European Americans (1,185 controls, 2,192 cases) from the Marshfield Clinic, a study site of the Electronic Medical Records and Genomics (eMERGE) Network, using logistic regression. All statistically significant models from the Marshfield Clinic were then evaluated in an independent dataset of 4,311 individuals (742 controls, 3,569 cases), using independent samples from additional study sites in the eMERGE Network: Mayo Clinic, Group Health/University of Washington, Vanderbilt University Medical Center, and Geisinger Health System. Eighty‐three SNP‐SNP models replicated in the independent dataset at likelihood ratio test P < 0.05. Among the most significant replicating models was rs12597188 (intron of CDH1)–rs11564445 (intron of CTNNB1). These genes are known to be involved in processes that include: cell‐to‐cell adhesion signaling, cell‐cell junction organization, and cell‐cell communication. Further Biofilter analysis of all replicating models revealed a number of common functions among the genes harboring the 83 replicating SNP‐SNP models, which included signal transduction and PI3K‐Akt signaling pathway. These findings demonstrate the utility of Biofilter as a biology‐driven method, applicable for any genome‐wide association study dataset.
Human Heredity | 2015
Sarah A. Pendergrass; Anurag Verma; Anna Okula; Molly A. Hall; Dana C. Crawford; Marylyn D. Ritchie
The inherent complexity of biological systems can be leveraged for a greater understanding of the impact of genetic architecture on outcomes, traits, and pharmacological response. The genome-wide association study (GWAS) approach has well-developed methods and relatively straight-forward methodologies; however, the bigger picture of the impact of genetic architecture on phenotypic outcome still remains to be elucidated even with an ever-growing number of GWAS performed. Greater consideration of the complexity of biological processes, using more data from the phenome, exposome, and diverse -omic resources, including considering the interplay of pleiotropy and genetic interactions, may provide additional leverage for making the most of the incredible wealth of information available for study. Here, we describe how incorporating greater complexity into analyses through the use of additional phenotypic data and widespread deployment of phenome-wide association studies may provide new insights into genetic factors influencing diseases, traits, and pharmacological response.
Trends in Genetics | 2016
Molly A. Hall; Jason H. Moore; Marylyn D. Ritchie
Genome-wide association studies (GWAS) have identified numerous loci associated with human phenotypes. This approach, however, does not consider the richly diverse and complex environment with which humans interact throughout the life course, nor does it allow for interrelationships between genetic loci and across traits. As we move toward making precision medicine a reality, whereby we make predictions about disease risk based on genomic profiles, we need to identify improved predictive models of the relationship between genome and phenome. Methods that embrace pleiotropy (the effect of one locus on more than one trait), and gene-environment (G×E) and gene-gene (G×G) interactions, will further unveil the impact of alterations in biological pathways and identify genes that are only involved with disease in the context of the environment. This valuable information can be used to assess personal risk and choose the most appropriate medical interventions based on the genotype and environment of an individual, the whole premise of precision medicine.
Nature Communications | 2017
Molly A. Hall; John R. Wallace; Anastasia Lucas; Dokyoon Kim; Anna Okula Basile; Shefali S. Verma; Catherine A. McCarty; Murray H. Brilliant; Peggy L. Peissig; Terrie Kitchner; Anurag Verma; Sarah A. Pendergrass; Scott M. Dudek; Jason H. Moore; Marylyn D. Ritchie
Genome-wide, imputed, sequence, and structural data are now available for exceedingly large sample sizes. The needs for data management, handling population structure and related samples, and performing associations have largely been met. However, the infrastructure to support analyses involving complexity beyond genome-wide association studies is not standardized or centralized. We provide the PLatform for the Analysis, Translation, and Organization of large-scale data (PLATO), a software tool equipped to handle multi-omic data for hundreds of thousands of samples to explore complexity using genetic interactions, environment-wide association studies and gene–environment interactions, phenome-wide association studies, as well as copy number and rare variant analyses. Using the data from the Marshfield Personalized Medicine Research Project, a site in the electronic Medical Records and Genomics Network, we apply each feature of PLATO to type 2 diabetes and demonstrate how PLATO can be used to uncover the complex etiology of common traits.Centralized infrastructure to support analyses involving complexity beyond genome-wide association studies is broadly needed. Here, Ritchie and colleagues develop PLATO, a software tool to process and integrate various methods for this task.
Autism Research | 2017
Dokyoon Kim; Heather E. Volk; Santhosh Girirajan; Sarah A. Pendergrass; Molly A. Hall; Shefali S. Verma; Rebecca J. Schmidt; Robin L. Hansen; Debashis Ghosh; Yunin Ludena-Rodriguez; Kyoungmi Kim; Marylyn D. Ritchie; Irva Hertz-Picciotto; Scott B. Selleck
Autism spectrum disorder is a complex trait with a high degree of heritability as well as documented susceptibility from environmental factors. In this study the contributions of copy number variation, exposure to air pollutants, and the interaction between the two on autism risk, were evaluated in the population‐based case‐control Childhood Autism Risks from Genetics and Environment (CHARGE) Study. For the current investigation, we included only those CHARGE children (a) who met criteria for autism or typical development and (b) for whom our team had conducted both genetic evaluation of copy number burden and determination of environmental air pollution exposures based on mapping addresses from the pregnancy and early childhood. This sample consisted of 158 cases of children with autism and 147 controls with typical development. Multiple logistic regression models were fit with and without environmental variable‐copy number burden interactions. We found no correlation between average air pollution exposure from conception to age 2 years and the childs CNV burden. We found a significant interaction in which a 1SD increase in duplication burden combined with a 1SD increase in ozone exposure was associated with an elevated autism risk (OR 3.4, P < 0.005) much greater than the increased risks associated with either genomic duplication (OR 1.85, 95% CI 1.25–2.73) or ozone (OR 1.20, 95% CI 0.93–1.54) alone. Similar results were obtained when CNV and ozone were dichotomized to compare those in the top quartile relative to those having a smaller CNV burden and lower exposure to ozone, and when exposures were assessed separately for pregnancy, the first year of life, and the second year of life. No interactions were observed for other air pollutants, even those that demonstrated main effects; ozone tends to be negatively correlated with the other pollutants examined. While earlier work has demonstrated interactions between the presence of a pathogenic CNV and an environmental exposure [Webb et al., 2016], these findings appear to be the first indication that global copy number variation may increase susceptibility to certain environmental factors, and underscore the need to consider both genomics and environmental exposures as well as the mechanisms by which each may amplify the risks for autism associated with the other. Autism Res 2017, 10: 1470–1480.
Proceedings of the Pacific Symposium | 2018
Elisabetta Manduchi; Alessandra Chesi; Molly A. Hall; Struan F. A. Grant; Jason H. Moore
We utilized evidence for enhancer-promoter interactions from functional genomics data in order to build biological filters to narrow down the search space for two-way Single Nucleotide Polymorphism (SNP) interactions in Type 2 Diabetes (T2D) Genome Wide Association Studies (GWAS). This has led us to the identification of a reproducible statistically significant SNP pair associated with T2D. As more functional genomics data are being generated that can help identify potentially interacting enhancer-promoter pairs in larger collection of tissues/cells, this approach has implications for investigation of epistasis from GWAS in general.
Archive | 2016
Molly A. Hall; Brian S. Cole; Jason H. Moore
Association methods in the biomedical sciences often focus on one genetic locus at a time. Alternately, studying gene-gene interactions, or epistasis, allows discovery of genetic regions that modulate the effect of other loci. With precision medicine as the goal of biomedical research, genetic interactions are essential to uncovering the still unknown complex mechanisms which lead to common diseases. This article delves into the fundamental background of genetic interactions, a history of this field of study, major challenges to identifying gene-gene models, and common methods to overcome these challenges. Finally, further complexity beyond genetic interactions will be considered.