Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Caitlin P. McHugh is active.

Publication


Featured researches published by Caitlin P. McHugh.


Nature Genetics | 2012

Detectable clonal mosaicism from birth to old age and its relationship to cancer

Cathy C. Laurie; Cecelia A. Laurie; Kenneth Rice; Kimberly F. Doheny; Leila R. Zelnick; Caitlin P. McHugh; Hua Ling; Kurt N. Hetrick; Elizabeth W. Pugh; Christopher I. Amos; Qingyi Wei; Li-E Wang; Jeffrey E. Lee; Kathleen C. Barnes; Nadia N. Hansel; Rasika A. Mathias; Denise Daley; Terri H. Beaty; Alan F. Scott; Ingo Ruczinski; Rob Scharpf; Laura J. Bierut; Sarah M. Hartz; Maria Teresa Landi; Neal D. Freedman; Lynn R. Goldin; David Ginsburg; Jun-Jun Li; Karl C. Desch; Sara S. Strom

We detected clonal mosaicism for large chromosomal anomalies (duplications, deletions and uniparental disomy) using SNP microarray data from over 50,000 subjects recruited for genome-wide association studies. This detection method requires a relatively high frequency of cells with the same abnormal karyotype (>5–10%; presumably of clonal origin) in the presence of normal cells. The frequency of detectable clonal mosaicism in peripheral blood is low (<0.5%) from birth until 50 years of age, after which it rapidly rises to 2–3% in the elderly. Many of the mosaic anomalies are characteristic of those found in hematological cancers and identify common deleted regions with genes previously associated with these cancers. Although only 3% of subjects with detectable clonal mosaicism had any record of hematological cancer before DNA sampling, those without a previous diagnosis have an estimated tenfold higher risk of a subsequent hematological cancer (95% confidence interval = 6–18).


Genetic Epidemiology | 2010

Quality control and quality assurance in genotypic data for genome-wide association studies

Cathy C. Laurie; Kimberly F. Doheny; Daniel B. Mirel; Elizabeth W. Pugh; Laura J. Bierut; Tushar Bhangale; Frederick Boehm; Neil E. Caporaso; Marilyn C. Cornelis; Howard J. Edenberg; Stacy B. Gabriel; Emily L. Harris; Frank B. Hu; Kevin B. Jacobs; Peter Kraft; Maria Teresa Landi; Thomas Lumley; Teri A. Manolio; Caitlin P. McHugh; Ian Painter; Justin Paschall; John P. Rice; Kenneth Rice; Xiuwen Zheng; Bruce S. Weir

Genome‐wide scans of nucleotide variation in human subjects are providing an increasing number of replicated associations with complex disease traits. Most of the variants detected have small effects and, collectively, they account for a small fraction of the total genetic variance. Very large sample sizes are required to identify and validate findings. In this situation, even small sources of systematic or random error can cause spurious results or obscure real effects. The need for careful attention to data quality has been appreciated for some time in this field, and a number of strategies for quality control and quality assurance (QC/QA) have been developed. Here we extend these methods and describe a system of QC/QA for genotypic data in genome‐wide association studies (GWAS). This system includes some new approaches that (1) combine analysis of allelic probe intensities and called genotypes to distinguish gender misidentification from sex chromosome aberrations, (2) detect autosomal chromosome aberrations that may affect genotype calling accuracy, (3) infer DNA sample quality from relatedness and allelic intensities, (4) use duplicate concordance to infer SNP quality, (5) detect genotyping artifacts from dependence of Hardy‐Weinberg equilibrium test P‐values on allelic frequency, and (6) demonstrate sensitivity of principal components analysis to SNP selection. The methods are illustrated with examples from the “Gene Environment Association Studies” (GENEVA) program. The results suggest several recommendations for QC/QA in the design and execution of GWAS. Genet. Epidemiol. 34: 591–602, 2010.


Human Molecular Genetics | 2011

Genome-wide association study identifies novel loci predisposing to cutaneous melanoma

Christopher I. Amos; Li-E Wang; Jeffrey E. Lee; Jeffrey E. Gershenwald; Wei Chen; Shenying Fang; Roman Kosoy; Mingfeng Zhang; Abrar A. Qureshi; Selina Vattathil; Christopher W. Schacherer; Julie M. Gardner; Yuling Wang; D. Tim Bishop; Jennifer H. Barrett; Stuart Macgregor; Nicholas K. Hayward; Nicholas G. Martin; David L. Duffy; Graham J. Mann; Anne E. Cust; John L. Hopper; Kevin M. Brown; Elizabeth A. Grimm; Yaji Xu; Younghun Han; Kaiyan Jing; Caitlin P. McHugh; Cathy C. Laurie; Kim Doheny

We performed a multistage genome-wide association study of melanoma. In a discovery cohort of 1804 melanoma cases and 1026 controls, we identified loci at chromosomes 15q13.1 (HERC2/OCA2 region) and 16q24.3 (MC1R) regions that reached genome-wide significance within this study and also found strong evidence for genetic effects on susceptibility to melanoma from markers on chromosome 9p21.3 in the p16/ARF region and on chromosome 1q21.3 (ARNT/LASS2/ANXA9 region). The most significant single-nucleotide polymorphisms (SNPs) in the 15q13.1 locus (rs1129038 and rs12913832) lie within a genomic region that has profound effects on eye and skin color; notably, 50% of variability in eye color is associated with variation in the SNP rs12913832. Because eye and skin colors vary across European populations, we further evaluated the associations of the significant SNPs after carefully adjusting for European substructure. We also evaluated the top 10 most significant SNPs by using data from three other genome-wide scans. Additional in silico data provided replication of the findings from the most significant region on chromosome 1q21.3 rs7412746 (P = 6 × 10(-10)). Together, these data identified several candidate genes for additional studies to identify causal variants predisposing to increased risk for developing melanoma.


Bioinformatics | 2012

GWASTools: an R/Bioconductor package for quality control and analysis of Genome-Wide Association Studies

Stephanie M. Gogarten; Tushar Bhangale; Matthew P. Conomos; Cecelia A. Laurie; Caitlin P. McHugh; Ian Painter; Xiuwen Zheng; David R. Crosslin; David K. Levine; Thomas Lumley; Sarah Nelson; Kenneth Rice; Jess Shen; Rohit Swarnkar; Bruce S. Weir; Cathy C. Laurie

GWASTools is an R/Bioconductor package for quality control and analysis of genome-wide association studies (GWAS). GWASTools brings the interactive capability and extensive statistical libraries of R to GWAS. Data are stored in NetCDF format to accommodate extremely large datasets that cannot fit within Rs memory limits. The documentation includes instructions for converting data from multiple formats, including variants called from sequencing. GWASTools provides a convenient interface for linking genotypes and intensity data with sample and single nucleotide polymorphism annotation.


Proceedings of the National Academy of Sciences of the United States of America | 2013

Linkage analysis identifies a locus for plasma von Willebrand factor undetected by genome-wide association

Karl C. Desch; Ayse Bilge Ozel; David Siemieniak; Yossi Kalish; Jordan A. Shavit; Courtney D. Thornburg; Anjali Sharathkumar; Caitlin P. McHugh; Cathy C. Laurie; Andrew Crenshaw; Daniel B. Mirel; Yoonhee Kim; Cheryl D. Cropp; Anne M. Molloy; Peadar N. Kirke; Joan E. Bailey-Wilson; Alexander F. Wilson; James L. Mills; John M. Scott; Lawrence C. Brody; Jun Li; David Ginsburg

The plasma glycoprotein von Willebrand factor (VWF) exhibits fivefold antigen level variation across the normal human population determined by both genetic and environmental factors. Low levels of VWF are associated with bleeding and elevated levels with increased risk for thrombosis, myocardial infarction, and stroke. To identify additional genetic determinants of VWF antigen levels and to minimize the impact of age and illness-related environmental factors, we performed genome-wide association analysis in two young and healthy cohorts (n = 1,152 and n = 2,310) and identified signals at ABO (P < 7.9E-139) and VWF (P < 5.5E-16), consistent with previous reports. Additionally, linkage analysis based on sibling structure within the cohorts, identified significant signals at chromosome 2q12–2p13 (LOD score 5.3) and at the ABO locus on chromosome 9q34 (LOD score 2.9) that explained 19.2% and 24.5% of the variance in VWF levels, respectively. Given its strong effect, the linkage region on chromosome 2 could harbor a potentially important determinant of bleeding and thrombosis risk. The absence of a chromosome 2 association signal in this or previous association studies suggests a causative gene harboring many genetic variants that are individually rare, but in aggregate common. These results raise the possibility that similar loci could explain a significant portion of the “missing heritability” for other complex genetic traits.


Diabetes | 2013

Identification of HKDC1 and BACE2 as Genes Influencing Glycemic Traits During Pregnancy Through Genome-Wide Association Studies

M. Geoffrey Hayes; Margrit Urbanek; Marie-France Hivert; Loren L. Armstrong; Jean Morrison; Cong Guo; Lynn P. Lowe; Douglas A. Scheftner; Anna Pluzhnikov; David M. Levine; Caitlin P. McHugh; Christine M. Ackerman; Luigi Bouchard; Diane Brisson; Brian T. Layden; Daniel B. Mirel; Kimberly F. Doheny; Marysa V. Leya; Rachel N. Lown-Hecht; Alan R. Dyer; Boyd E. Metzger; Timothy E. Reddy; Nancy J. Cox; William L. Lowe

Maternal metabolism during pregnancy impacts the developing fetus, affecting offspring birth weight and adiposity. This has important implications for metabolic health later in life (e.g., offspring of mothers with pre-existing or gestational diabetes mellitus have an increased risk of metabolic disorders in childhood). To identify genetic loci associated with measures of maternal metabolism obtained during an oral glucose tolerance test at ∼28 weeks’ gestation, we performed a genome-wide association study of 4,437 pregnant mothers of European (n = 1,367), Thai (n = 1,178), Afro-Caribbean (n = 1,075), and Hispanic (n = 817) ancestry, along with replication of top signals in three additional European ancestry cohorts. In addition to identifying associations with genes previously implicated with measures of glucose metabolism in nonpregnant populations, we identified two novel genome-wide significant associations: 2-h plasma glucose and HKDC1, and fasting C-peptide and BACE2. These results suggest that the genetic architecture underlying glucose metabolism may differ, in part, in pregnancy.


Arthritis & Rheumatism | 2017

Genome-Wide Association Analysis Reveals Genetic Heterogeneity of Sjögren's Syndrome According to Ancestry

Kimberly E. Taylor; Quenna Wong; David M. Levine; Caitlin P. McHugh; Cathy C. Laurie; Kimberly F. Doheny; Mi Y. Lam; Alan N. Baer; Stephen Challacombe; Hector Lanfranchi; Morten Schiødt; Muthiah Srinivasan; Hisanori Umehara; Frederick B. Vivino; Yan Zhao; Stephen Shiboski; Troy E. Daniels; John S. Greenspan; Caroline H. Shiboski; Lindsey A. Criswell

The Sjögrens International Collaborative Clinical Alliance (SICCA) is an international data registry and biorepository derived from a multisite observational study of participants in whom genotyping was performed on the Omni2.5M platform and who had undergone deep phenotyping using common protocol‐directed methods. The aim of this study was to examine the genetic etiology of Sjögrens syndrome (SS) across ancestry and disease subsets.


PLOS Genetics | 2017

Genome-wide association study of red blood cell traits in Hispanics/Latinos: The Hispanic Community Health Study/Study of Latinos

Chani J. Hodonsky; Deepti Jain; Ursula M. Schick; Jean Morrison; Lisa Brown; Caitlin P. McHugh; Diane D. Chen; Yongmei Liu; Paul L. Auer; Cecilia A. Laurie; Kent D. Taylor; Brian L. Browning; Yun Li; George J. Papanicolaou; Jerome I. Rotter; Ryo Kurita; Yukio Nakamura; Sharon R. Browning; Ruth J. F. Loos; Kari E. North; Cathy C. Laurie; Timothy A. Thornton; Nathan Pankratz; Daniel E. Bauer; Tamar Sofer; Alex P. Reiner

Prior GWAS have identified loci associated with red blood cell (RBC) traits in populations of European, African, and Asian ancestry. These studies have not included individuals with an Amerindian ancestral background, such as Hispanics/Latinos, nor evaluated the full spectrum of genomic variation beyond single nucleotide variants. Using a custom genotyping array enriched for Amerindian ancestral content and 1000 Genomes imputation, we performed GWAS in 12,502 participants of Hispanic Community Health Study and Study of Latinos (HCHS/SOL) for hematocrit, hemoglobin, RBC count, RBC distribution width (RDW), and RBC indices. Approximately 60% of previously reported RBC trait loci generalized to HCHS/SOL Hispanics/Latinos, including African ancestral alpha- and beta-globin gene variants. In addition to the known 3.8kb alpha-globin copy number variant, we identified an Amerindian ancestral association in an alpha-globin regulatory region on chromosome 16p13.3 for mean corpuscular volume and mean corpuscular hemoglobin. We also discovered and replicated three genome-wide significant variants in previously unreported loci for RDW (SLC12A2 rs17764730, PSMB5 rs941718), and hematocrit (PROX1 rs3754140). Among the proxy variants at the SLC12A2 locus we identified rs3812049, located in a bi-directional promoter between SLC12A2 (which encodes a red cell membrane ion-transport protein) and an upstream anti-sense long-noncoding RNA, LINC01184, as the likely causal variant. We further demonstrate that disruption of the regulatory element harboring rs3812049 affects transcription of SLC12A2 and LINC01184 in human erythroid progenitor cells. Together, these results reinforce the importance of genetic study of diverse ancestral populations, in particular Hispanics/Latinos.


Genetic Epidemiology | 2012

Using family data as a verification standard to evaluate copy number variation calling strategies for genetic association studies.

Xiaojing Zheng; John R. Shaffer; Caitlin P. McHugh; Cathy C. Laurie; Bjarke Feenstra; Mads Melbye; Jeffrey C. Murray; Mary L. Marazita; Eleanor Feingold

A major concern for all copy number variation (CNV) detection algorithms is their reliability and repeatability. However, it is difficult to evaluate the reliability of CNV‐calling strategies due to the lack of gold‐standard data that would tell us which CNVs are real. We propose that if CNVs are called in duplicate samples, or inherited from parent to child, then these can be considered validated CNVs. We used two large family‐based genome‐wide association study (GWAS) datasets from the GENEVA consortium to look at concordance rates of CNV calls between duplicate samples, parent‐child pairs, and unrelated pairs. Our goal was to make recommendations for ways to filter and use CNV calls in GWAS datasets that do not include family data. We used PennCNV as our primary CNV‐calling algorithm, and tested CNV calls using different datasets and marker sets, and with various filters on CNVs and samples. Using the Illumina core HumanHap550 single nucleotide polymorphism (SNP) set, we saw duplicate concordance rates of approximately 55% and parent‐child transmission rates of approximately 28% in our datasets. GC model adjustment and sample quality filtering had little effect on these reliability measures. Stratification on CNV size and DNA sample type did have some effect. Overall, our results show that it is probably not possible to find a CNV‐calling strategy (including filtering and algorithm) that will give us a set of “reliable” CNV calls using current chip technologies. But if we understand the error process, we can still use CNV calls appropriately in genetic association studies.


Genetics | 2016

Detecting Heterogeneity in Population Structure Across the Genome in Admixed Populations

Caitlin P. McHugh; Lisa Brown; Timothy A. Thornton

The genetic structure of human populations is often characterized by aggregating measures of ancestry across the autosomal chromosomes. While it may be reasonable to assume that population structure patterns are similar genome-wide in relatively homogeneous populations, this assumption may not be appropriate for admixed populations, such as Hispanics and African-Americans, with recent ancestry from two or more continents. Recent studies have suggested that systematic ancestry differences can arise at genomic locations in admixed populations as a result of selection and nonrandom mating. Here, we propose a method, which we refer to as the chromosomal ancestry differences (CAnD) test, for detecting heterogeneity in population structure across the genome. CAnD can incorporate either local or chromosome-wide ancestry inferred from SNP genotype data to identify chromosomes harboring genomic regions with ancestry contributions that are significantly different than expected. In simulation studies with real genotype data from phase III of the HapMap Project, we demonstrate the validity and power of CAnD. We apply CAnD to the HapMap Mexican-American (MXL) and African-American (ASW) population samples; in this analysis the software RFMix is used to infer local ancestry at genomic regions, assuming admixing from Europeans, West Africans, and Native Americans. The CAnD test provides strong evidence of heterogeneity in population structure across the genome in the MXL sample (p=1e−5), which is largely driven by elevated Native American ancestry and deficit of European ancestry on the X chromosomes. Among the ASW, all chromosomes are largely African derived and no heterogeneity in population structure is detected in this sample.

Collaboration


Dive into the Caitlin P. McHugh's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jean Morrison

University of Washington

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Alan R. Dyer

Northwestern University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge