Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Thomas J. Hoffmann is active.

Publication


Featured researches published by Thomas J. Hoffmann.


Genomics | 2011

Next generation genome-wide association tool: Design and coverage of a high-throughput European-optimized SNP array

Thomas J. Hoffmann; Mark N. Kvale; Stephanie Hesselson; Yiping Zhan; Christine Aquino; Yang Cao; Simon Cawley; Elaine Chung; Sheryl Connell; Jasmin Eshragh; Marcia Ewing; Jeremy Gollub; Mary Henderson; Earl Hubbell; Carlos Iribarren; Jay Kaufman; Richard Lao; Yontao Lu; Dana Ludwig; Gurpreet K. Mathauda; William B. McGuire; Gangwu Mei; Sunita Miles; Matthew M. Purdy; Charles P. Quesenberry; Dilrini Ranatunga; Sarah Rowell; Marianne Sadler; Michael H. Shapero; Ling Shen

The success of genome-wide association studies has paralleled the development of efficient genotyping technologies. We describe the development of a next-generation microarray based on the new highly-efficient Affymetrix Axiom genotyping technology that we are using to genotype individuals of European ancestry from the Kaiser Permanente Research Program on Genes, Environment and Health (RPGEH). The array contains 674,517 SNPs, and provides excellent genome-wide as well as gene-based and candidate-SNP coverage. Coverage was calculated using an approach based on imputation and cross validation. Preliminary results for the first 80,301 saliva-derived DNA samples from the RPGEH demonstrate very high quality genotypes, with sample success rates above 94% and over 98% of successful samples having SNP call rates exceeding 98%. At steady state, we have produced 462 million genotypes per week for each Axiom system. The new array provides a valuable addition to the repertoire of tools for large scale genome-wide association studies.


PLOS ONE | 2010

Comprehensive Approach to Analyzing Rare Genetic Variants

Thomas J. Hoffmann; Nicholas J. Marini; John S. Witte

Recent findings suggest that rare variants play an important role in both monogenic and common diseases. Due to their rarity, however, it remains unclear how to appropriately analyze the association between such variants and disease. A common approach entails combining rare variants together based on a priori information and analyzing them as a single group. Here one must make some assumptions about what to aggregate. Instead, we propose two approaches to empirically determine the most efficient grouping of rare variants. The first considers multiple possible groupings using existing information. The second is an agnostic “step-up” approach that determines an optimal grouping of rare variants analytically and does not rely on prior information. To evaluate these approaches, we undertook a simulation study using sequence data from genes in the one-carbon folate metabolic pathway. Our results show that using prior information to group rare variants is advantageous only when information is quite accurate, but the step-up approach works well across a broad range of plausible scenarios. This agnostic approach allows one to efficiently analyze the association between rare variants and disease while avoiding assumptions required by other approaches for grouping such variants.


American Journal of Human Genetics | 2012

Estimating Kinship in Admixed Populations

Timothy A. Thornton; Hua Tang; Thomas J. Hoffmann; Heather M. Ochs-Balcom; Bette J. Caan; Neil Risch

Genome-wide association studies (GWASs) are commonly used for the mapping of genetic loci that influence complex traits. A problem that is often encountered in both population-based and family-based GWASs is that of identifying cryptic relatedness and population stratification because it is well known that failure to appropriately account for both pedigree and population structure can lead to spurious association. A number of methods have been proposed for identifying relatives in samples from homogeneous populations. A strong assumption of population homogeneity, however, is often untenable, and many GWASs include samples from structured populations. Here, we consider the problem of estimating relatedness in structured populations with admixed ancestry. We propose a method, REAP (relatedness estimation in admixed populations), for robust estimation of identity by descent (IBD)-sharing probabilities and kinship coefficients in admixed populations. REAP appropriately accounts for population structure and ancestry-related assortative mating by using individual-specific allele frequencies at SNPs that are calculated on the basis of ancestry derived from whole-genome analysis. In simulation studies with related individuals and admixture from highly divergent populations, we demonstrate that REAP gives accurate IBD-sharing probabilities and kinship coefficients. We apply REAP to the Mexican Americans in Los Angeles, California (MXL) population sample of release 3 of phase III of the International Haplotype Map Project; in this sample, we identify third- and fourth-degree relatives who have not previously been reported. We also apply REAP to the African American and Hispanic samples from the Womens Health Initiative SNP Health Association Resource (WHI-SHARe) study, in which hundreds of pairs of cryptically related individuals have been identified.


Genomics | 2011

Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm.

Thomas J. Hoffmann; Yiping Zhan; Mark N. Kvale; Stephanie Hesselson; Jeremy Gollub; Carlos Iribarren; Yontao Lu; Gangwu Mei; Matthew M. Purdy; Charles P. Quesenberry; Sarah Rowell; Michael H. Shapero; David Smethurst; Carol P. Somkin; Stephen K. Van Den Eeden; Larry Walter; Teresa Webster; Rachel A. Whitmer; Andrea Finn; Catherine Schaefer; Pui-Yan Kwok; Neil Risch

Four custom Axiom genotyping arrays were designed for a genome-wide association (GWA) study of 100,000 participants from the Kaiser Permanente Research Program on Genes, Environment and Health. The array optimized for individuals of European race/ethnicity was previously described. Here we detail the development of three additional microarrays optimized for individuals of East Asian, African American, and Latino race/ethnicity. For these arrays, we decreased redundancy of high-performing SNPs to increase SNP capacity. The East Asian array was designed using greedy pairwise SNP selection. However, removing SNPs from the target set based on imputation coverage is more efficient than pairwise tagging. Therefore, we developed a novel hybrid SNP selection method for the African American and Latino arrays utilizing rounds of greedy pairwise SNP selection, followed by removal from the target set of SNPs covered by imputation. The arrays provide excellent genome-wide coverage and are valuable additions for large-scale GWA studies.


Genetics | 2015

Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort

Yambazi Banda; Mark N. Kvale; Thomas J. Hoffmann; Stephanie Hesselson; Dilrini Ranatunga; Hua Tang; Chiara Sabatti; Lisa A. Croen; Brad Dispensa; Mary Henderson; Carlos Iribarren; Eric Jorgenson; Lawrence H. Kushi; Dana Ludwig; Diane Olberg; Charles P. Quesenberry; Sarah Rowell; Marianne Sadler; Lori C. Sakoda; Stanley Sciortino; Ling Shen; David Smethurst; Carol P. Somkin; Stephen K. Van Den Eeden; Lawrence Walter; Rachel A. Whitmer; Pui-Yan Kwok; Catherine Schaefer; Neil Risch

Using genome-wide genotypes, we characterized the genetic structure of 103,006 participants in the Kaiser Permanente Northern California multi-ethnic Genetic Epidemiology Research on Adult Health and Aging Cohort and analyzed the relationship to self-reported race/ethnicity. Participants endorsed any of 23 race/ethnicity/nationality categories, which were collapsed into seven major race/ethnicity groups. By self-report the cohort is 80.8% white and 19.2% minority; 93.8% endorsed a single race/ethnicity group, while 6.2% endorsed two or more. Principal component (PC) and admixture analyses were generally consistent with prior studies. Approximately 17% of subjects had genetic ancestry from more than one continent, and 12% were genetically admixed, considering only nonadjacent geographical origins. Self-reported whites were spread on a continuum along the first two PCs, indicating extensive mixing among European nationalities. Self-identified East Asian nationalities correlated with genetic clustering, consistent with extensive endogamy. Individuals of mixed East Asian–European genetic ancestry were easily identified; we also observed a modest amount of European genetic ancestry in individuals self-identified as Filipinos. Self-reported African Americans and Latinos showed extensive European and African genetic ancestry, and Native American genetic ancestry for the latter. Among 3741 genetically identified parent–child pairs, 93% were concordant for self-reported race/ethnicity; among 2018 genetically identified full-sib pairs, 96% were concordant; the lower rate for parent–child pairs was largely due to intermarriage. The parent–child pairs revealed a trend toward increasing exogamy over time; the presence in the cohort of individuals endorsing multiple race/ethnicity categories creates interesting challenges and future opportunities for genetic epidemiologic studies.


American Journal of Human Genetics | 2013

Genome-wide Characterization of Shared and Distinct Genetic Components that Influence Blood Lipid Levels in Ethnically Diverse Human Populations

Marc A. Coram; Qing Duan; Thomas J. Hoffmann; Timothy A. Thornton; Joshua W. Knowles; Nicholas A. Johnson; Heather M. Ochs-Balcom; Timothy A. Donlon; Lisa W. Martin; Charles B. Eaton; Jennifer G. Robinson; Neil Risch; Xiaofeng Zhu; Charles Kooperberg; Yun Li; Alex P. Reiner; Hua Tang

Blood lipid concentrations are heritable risk factors associated with atherosclerosis and cardiovascular diseases. Lipid traits exhibit considerable variation among populations of distinct ancestral origin as well as between individuals within a population. We performed association analyses to identify genetic loci influencing lipid concentrations in African American and Hispanic American women in the Womens Health Initiative SNP Health Association Resource. We validated one African-specific high-density lipoprotein cholesterol locus at CD36 as well as 14 known lipid loci that have been previously implicated in studies of European populations. Moreover, we demonstrate striking similarities in genetic architecture (loci influencing the trait, direction and magnitude of genetic effects, and proportions of phenotypic variation explained) of lipid traits across populations. In particular, we found that a disproportionate fraction of lipid variation in African Americans and Hispanic Americans can be attributed to genomic loci exhibiting statistical evidence of association in Europeans, even though the precise genes and variants remain unknown. At the same time, we found substantial allelic heterogeneity within shared loci, characterized both by population-specific rare variants and variants shared among multiple populations that occur at disparate frequencies. The allelic heterogeneity emphasizes the importance of including diverse populations in future genetic association studies of complex traits such as lipids; furthermore, the overlap in lipid loci across populations of diverse ancestral origin argues that additional knowledge can be gleaned from multiple populations.


Nature Genetics | 2017

Genome-wide association analyses using electronic health records identify new loci influencing blood pressure variation

Thomas J. Hoffmann; Georg B. Ehret; Priyanka Nandakumar; Dilrini Ranatunga; Catherine Schaefer; Pui-Yan Kwok; Carlos Iribarren; Aravinda Chakravarti; Neil Risch

Longitudinal electronic health records on 99,785 Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort individuals provided 1,342,814 systolic and diastolic blood pressure measurements for a genome-wide association study on long-term average systolic, diastolic, and pulse pressure. We identified 39 new loci among 75 genome-wide significant loci (P ≤ 5 × 10−8), with most replicating in the combined International Consortium for Blood Pressure (ICBP; n = 69,396) and UK Biobank (UKB; n = 152,081) studies. Combining GERA with ICBP yielded 36 additional new loci, with most replicating in UKB. Combining all three studies (n = 321,262) yielded 241 additional genome-wide significant loci, although no replication sample was available for these. All associated loci explained 2.9%, 2.5%, and 3.1% of variation in systolic, diastolic, and pulse pressure, respectively, in GERA non-Hispanic whites. Using multiple blood pressure measurements in GERA doubled the variance explained. A normalized risk score was associated with time to onset of hypertension (hazards ratio = 1.18, P = 8.2 × 10−45). Expression quantitative trait locus analysis of blood pressure loci showed enrichment in aorta and tibial artery.


Genetics | 2015

Genotyping Informatics and Quality Control for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort

Mark N. Kvale; Stephanie Hesselson; Thomas J. Hoffmann; Yang Cao; David Chan; Sheryl Connell; Lisa A. Croen; Brad Dispensa; Jasmin Eshragh; Andrea Finn; Jeremy Gollub; Carlos Iribarren; Eric Jorgenson; Lawrence H. Kushi; Richard Lao; Yontao Lu; Dana Ludwig; Gurpreet K. Mathauda; William B. McGuire; Gangwu Mei; Sunita Miles; Michael Mittman; Mohini Patil; Charles P. Quesenberry; Dilrini Ranatunga; Sarah Rowell; Marianne Sadler; Lori C. Sakoda; Michael H. Shapero; Ling Shen

The Kaiser Permanente (KP) Research Program on Genes, Environment and Health (RPGEH), in collaboration with the University of California—San Francisco, undertook genome-wide genotyping of >100,000 subjects that constitute the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. The project, which generated >70 billion genotypes, represents the first large-scale use of the Affymetrix Axiom Genotyping Solution. Because genotyping took place over a short 14-month period, creating a near-real-time analysis pipeline for experimental assay quality control and final optimized analyses was critical. Because of the multi-ethnic nature of the cohort, four different ethnic-specific arrays were employed to enhance genome-wide coverage. All assays were performed on DNA extracted from saliva samples. To improve sample call rates and significantly increase genotype concordance, we partitioned the cohort into disjoint packages of plates with similar assay contexts. Using strict QC criteria, the overall genotyping success rate was 103,067 of 109,837 samples assayed (93.8%), with a range of 92.1–95.4% for the four different arrays. Similarly, the SNP genotyping success rate ranged from 98.1 to 99.4% across the four arrays, the variation depending mostly on how many SNPs were included as single copy vs. double copy on a particular array. The high quality and large scale of genotype data created on this cohort, in conjunction with comprehensive longitudinal data from the KP electronic health records of participants, will enable a broad range of highly powered genome-wide association studies on a diversity of traits and conditions.


Pediatrics | 2013

A Genome-Wide Association Study (GWAS) for Bronchopulmonary Dysplasia

Hui Wang; Krystal R. St. Julien; David K. Stevenson; Thomas J. Hoffmann; John S. Witte; Laura C. Lazzeroni; Mark A. Krasnow; Cecele Quaintance; John Oehlert; Laura L. Jelliffe-Pawlowski; Jeffrey B. Gould; Gary M. Shaw; Hugh M. O’Brodovich

OBJECTIVE: Twin studies suggest that heritability of moderate-severe bronchopulmonary dysplasia (BPD) is 53% to 79%, we conducted a genome-wide association study (GWAS) to identify genetic variants associated with the risk for BPD. METHODS: The discovery GWAS was completed on 1726 very low birth weight infants (gestational age = 250–296/7 weeks) who had a minimum of 3 days of intermittent positive pressure ventilation and were in the hospital at 36 weeks’ postmenstrual age. At 36 weeks’ postmenstrual age, moderate-severe BPD cases (n = 899) were defined as requiring continuous supplemental oxygen, whereas controls (n = 827) inhaled room air. An additional 795 comparable infants (371 cases, 424 controls) were a replication population. Genomic DNA from case and control newborn screening bloodspots was used for the GWAS. The replication study interrogated single-nucleotide polymorphisms (SNPs) identified in the discovery GWAS and those within the HumanExome beadchip. RESULTS: Genotyping using genomic DNA was successful. We did not identify SNPs associated with BPD at the genome-wide significance level (5 × 10−8) and no SNP identified in previous studies reached statistical significance (Bonferroni-corrected P value threshold .0018). Pathway analyses were not informative. CONCLUSIONS: We did not identify genomic loci or pathways that account for the previously described heritability for BPD. Potential explanations include causal mutations that are genetic variants and were not assayed or are mapped to many distributed loci, inadequate sample size, race ethnicity of our study population, or case-control differences investigated are not attributable to underlying common genetic variation.


American Journal of Psychiatry | 2014

Familial Recurrence of Autism Spectrum Disorder: Evaluating Genetic and Environmental Contributions

Neil Risch; Thomas J. Hoffmann; Meredith Anderson; Lisa A. Croen; Judith K. Grether; Gayle C. Windham

OBJECTIVE This study was designed to examine the pattern of familial recurrence of autism spectrum disorder (ASD) in terms of genetic and environmental contributions related to timing of birth. METHOD The authors linked California Department of Developmental Services records with state birth certificates to identify all siblings and half siblings of individuals affected with ASD born between 1990 and 2003. A total of 6,616 full siblings, 644 maternal half siblings, and 299 paternal half siblings born after ASD index cases were used to calculate recurrence risks. Control families, identified through matching to cases, were included for comparison (a total of 29,384 siblings). RESULTS The overall sibling recurrence risk was 10.1%, compared with a prevalence of 0.52% in siblings of controls. The recurrence risk in second-born children was higher (11.5%) than in later-born siblings (7.3%); a similar pattern was observed for maternal half siblings (6.5% for second-born compared with 3.0% for later-born siblings; 4.8% overall). The recurrence risk was significantly higher for siblings who immediately followed the index case in birth order compared with those later in birth order. The recurrence risk for paternal half siblings (2.3%) was half the overall recurrence risk for maternal half siblings but was similar to that for later-born maternal half siblings. An exponential effect of short interbirth interval was observed, with the recurrence risk reaching 14.4% for an interbirth interval of 18 months or less, compared with 6.8% for an interval of 4 years or more. An identical phenomenon was observed in maternal half siblings. CONCLUSIONS The results support genetic susceptibility in the familial recurrence of ASD along with factors related to timing of birth.

Collaboration


Dive into the Thomas J. Hoffmann's collaboration.

Top Co-Authors

Avatar

Neil Risch

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

John S. Witte

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Mark N. Kvale

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Yambazi Banda

University of California

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge