Xiaoquan Wen
University of Michigan
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Xiaoquan Wen.
Nature Genetics | 2006
Donald F. Conrad; Mattias Jakobsson; Graham Coop; Xiaoquan Wen; Jeffrey D. Wall; Noah A. Rosenberg; Jonathan K. Pritchard
Recent genomic surveys have produced high-resolution haplotype information, but only in a small number of human populations. We report haplotype structure across 12 Mb of DNA sequence in 927 individuals representing 52 populations. The geographic distribution of haplotypes reflects human history, with a loss of haplotype diversity as distance increases from Africa. Although the extent of linkage disequilibrium (LD) varies markedly across populations, considerable sharing of haplotype structure exists, and inferred recombination hotspot locations generally match across groups. The four samples in the International HapMap Project contain the majority of common haplotypes found in most populations: averaging across populations, 83% of common 20-kb haplotypes in a population are also common in the most similar HapMap sample. Consequently, although the portability of tag SNPs based on the HapMap is reduced in low-LD Africans, the HapMap will be helpful for the design of genome-wide association mapping studies in nearly all human populations.
Science | 2008
Graham Coop; Xiaoquan Wen; Carole Ober; Jonathan K. Pritchard; Molly Przeworski
Recombination plays a crucial role in meiosis, ensuring the proper segregation of chromosomes. Recent linkage disequilibrium (LD) and sperm-typing studies suggest that recombination rates vary tremendously across the human genome, with most events occurring in narrow “hotspots.” To examine variation in fine-scale recombination patterns among individuals, we used dense, genome-wide single-nucleotide polymorphism data collected in nuclear families to localize crossovers with high spatial resolution. This analysis revealed that overall recombination hotspot usage is similar in males and females, with individual hotspots often active in both sexes. Across the genome, roughly 60% of crossovers occurred in hotspots inferred from LD studies. Notably, however, we found extensive and heritable variation among both males and females in the proportion of crossovers occurring in these hotspots.
PLOS Genetics | 2013
Timothée Flutre; Xiaoquan Wen; Jonathan K. Pritchard; Matthew Stephens
Mapping expression Quantitative Trait Loci (eQTLs) represents a powerful and widely adopted approach to identifying putative regulatory variants and linking them to specific genes. Up to now eQTL studies have been conducted in a relatively narrow range of tissues or cell types. However, understanding the biology of organismal phenotypes will involve understanding regulation in multiple tissues, and ongoing studies are collecting eQTL data in dozens of cell types. Here we present a statistical framework for powerfully detecting eQTLs in multiple tissues or cell types (or, more generally, multiple subgroups). The framework explicitly models the potential for each eQTL to be active in some tissues and inactive in others. By modeling the sharing of active eQTLs among tissues, this framework increases power to detect eQTLs that are present in more than one tissue compared with “tissue-by-tissue” analyses that examine each tissue separately. Conversely, by modeling the inactivity of eQTLs in some tissues, the framework allows the proportion of eQTLs shared across different tissues to be formally estimated as parameters of a model, addressing the difficulties of accounting for incomplete power when comparing overlaps of eQTLs identified by tissue-by-tissue analyses. Applying our framework to re-analyze data from transformed B cells, T cells, and fibroblasts, we find that it substantially increases power compared with tissue-by-tissue analysis, identifying 63% more genes with eQTLs (at FDR = 0.05). Further, the results suggest that, in contrast to previous analyses of the same data, the majority of eQTLs detectable in these data are shared among all three tissues.
American Journal of Human Genetics | 2004
Sebastian Zöllner; Xiaoquan Wen; Neil A. Hanchard; Mark A. Herbert; Carole Ober; Jonathan K. Pritchard
It is a basic principle of genetics that each chromosome is transmitted from parent to offspring with a probability that is given by Mendels laws. However, several known biological processes lead to skewed transmission probabilities among surviving offspring and, therefore, to excess genetic sharing among relatives. Examples include in utero selection against deleterious mutations, meiotic drive, and maternal-fetal incompatibility. Although these processes affect our basic understanding of inheritance, little is known about their overall impact in humans or other mammals. In this study, we examined genome screen data from 148 nuclear families, collected without reference to phenotype, to look for departures from Mendelian transmission proportions. Using single-point and multipoint linkage analysis, we detected a modest but significant genomewide shift towards excess genetic sharing among siblings (average sharing of 50.43% for the autosomes; P=.009). Our calculations indicate that many loci with skewed transmission are required to produce a genomewide shift of this magnitude. Since transmission distortion loci are subject to strong selection, this raises interesting questions about the evolutionary forces that keep them polymorphic. Finally, our results also have implications for mapping disease genes and for the genetics of fertility.
PLOS Genetics | 2011
Joseph C. Maranville; Francesca Luca; Allison L. Richards; Xiaoquan Wen; David B. Witonsky; Shaneen S. Baxter; Matthew Stephens; Anna Di Rienzo
Glucocorticoids (GCs) mediate physiological responses to environmental stress and are commonly used as pharmaceuticals. GCs act primarily through the GC receptor (GR, a transcription factor). Despite their clear biomedical importance, little is known about the genetic architecture of variation in GC response. Here we provide an initial assessment of variability in the cellular response to GC treatment by profiling gene expression and protein secretion in 114 EBV-transformed B lymphocytes of African and European ancestry. We found that genetic variation affects the response of nearby genes and exhibits distinctive patterns of genotype-treatment interactions, with genotypic effects evident in either only GC-treated or only control-treated conditions. Using a novel statistical framework, we identified interactions that influence the expression of 26 genes known to play central roles in GC-related pathways (e.g. NQO1, AIRE, and SGK1) and that influence the secretion of IL6.
American Journal of Human Genetics | 2015
Philip E. Stuart; Rajan P. Nair; Lam C. Tsoi; Trilokraj Tejasvi; Sayantan Das; Hyun Min Kang; Eva Ellinghaus; Vinod Chandran; Kristina Callis-Duffin; Robert W. Ike; Yanming Li; Xiaoquan Wen; Charlotta Enerbäck; Johann E. Gudjonsson; Sulev Kõks; Külli Kingo; Tonu Esko; Ulrich Mrowietz; André Reis; H.-Erich Wichmann; Christian Gieger; Per Hoffmann; Markus M. Nöthen; Juliane Winkelmann; Manfred Kunz; Elvia G. Moreta; Philip J. Mease; Christopher T. Ritchlin; Anne M. Bowcock; Gerald G. Krueger
Psoriasis vulgaris (PsV) is a common inflammatory and hyperproliferative skin disease. Up to 30% of people with PsV eventually develop psoriatic arthritis (PsA), an inflammatory musculoskeletal condition. To discern differences in genetic risk factors for PsA and cutaneous-only psoriasis (PsC), we carried out a genome-wide association study (GWAS) of 1,430 PsA case subjects and 1,417 unaffected control subjects. Meta-analysis of this study with three other GWASs and two targeted genotyping studies, encompassing a total of 9,293 PsV case subjects, 3,061 PsA case subjects, 3,110 PsC case subjects, and 13,670 unaffected control subjects of European descent, detected 10 regions associated with PsA and 11 with PsC at genome-wide (GW) significance. Several of these association signals (IFNLR1, IFIH1, NFKBIA for PsA; TNFRSF9, LCE3C/B, TRAF3IP2, IL23A, NFKBIA for PsC) have not previously achieved GW significance. After replication, we also identified a PsV-associated SNP near CDKAL1 (rs4712528, odds ratio [OR] = 1.16, p = 8.4 × 10(-11)). Among identified psoriasis risk variants, three were more strongly associated with PsC than PsA (rs12189871 near HLA-C, p = 5.0 × 10(-19); rs4908742 near TNFRSF9, p = 0.00020; rs10888503 near LCE3A, p = 0.0014), and two were more strongly associated with PsA than PsC (rs12044149 near IL23R, p = 0.00018; rs9321623 near TNFAIP3, p = 0.00022). The PsA-specific variants were independent of previously identified psoriasis variants near IL23R and TNFAIP3. We also found multiple independent susceptibility variants in the IL12B, NOS2, and IFIH1 regions. These results provide insights into the pathogenetic similarities and differences between PsC and PsA.
PLOS Genetics | 2009
Adi Fledel-Alon; Daniel J. Wilson; Karl W. Broman; Xiaoquan Wen; Carole Ober; Graham Coop; Molly Przeworski
Although recombination is essential to the successful completion of human meiosis, it remains unclear how tightly the process is regulated and over what scale. To assess the nature and stringency of constraints on human recombination, we examined crossover patterns in transmissions to viable, non-trisomic offspring, using dense genotyping data collected in a large set of pedigrees. Our analysis supports a requirement for one chiasma per chromosome rather than per arm to ensure proper disjunction, with additional chiasmata occurring in proportion to physical length. The requirement is not absolute, however, as chromosome 21 seems to be frequently transmitted properly in the absence of a chiasma in females, a finding that raises the possibility of a back-up mechanism aiding in its correct segregation. We also found a set of double crossovers in surprisingly close proximity, as expected from a second pathway that is not subject to crossover interference. These findings point to multiple mechanisms that shape the distribution of crossovers, influencing proper disjunction in humans.
Genetic Epidemiology | 2009
Omar De la Cruz; Xiaoquan Wen; Baoguan Ke; Minsun Song; Dan L. Nicolae
In the setting of genome‐wide association studies, we propose a method for assigning a measure of significance to pre‐defined sets of markers in the genome. The sets can be genes, conserved regions, or groups of genes such as pathways. Using the proposed methods and algorithms, evidence for association between a particular functional unit and a disease status can be obtained not just by the presence of a strong signal from a SNP within it, but also by the combination of several simultaneous weaker signals that are not strongly correlated. This approach has several advantages. First, moderately strong signals from different SNPs are combined to obtain a much stronger signal for the set, therefore increasing power. Second, in combination with methods that provide information on untyped markers, it leads to results that can be readily combined across studies and platforms that might use different SNPs. Third, the results are easy to interpret, since they refer to functional sets of markers that are likely to behave as a unit in their phenotypic effect. Finally, the availability of gene‐level P‐values for association is the first step in developing methods that integrate information from pathways and networks with genome‐wide association data, and these can lead to a better understanding of the complex traits genetic architecture. The power of the approach is investigated in simulated and real datasets. Novel Crohns disease associations are found using the WTCCC data. Genet. Epidemiol. 34: 222–231, 2010.
PLOS Genetics | 2005
Dan L. Nicolae; Xiaoquan Wen; Benjamin F. Voight; Nancy J. Cox
Improvements in technology have made it possible to conduct genome-wide association mapping at costs within reach of academic investigators, and experiments are currently being conducted with a variety of high-throughput platforms. To provide an appropriate context for interpreting results of such studies, we summarize here results of an investigation of one of the first of these technologies to be publicly available, the Affymetrix GeneChip Human Mapping 100K set of single nucleotide polymorphisms (SNPs). In a systematic analysis of the pattern and distribution of SNPs in the Mapping 100K set, we find that SNPs in this set are undersampled from coding regions (both nonsynonymous and synonymous) and oversampled from regions outside genes, relative to SNPs in the overall HapMap database. In addition, we utilize a novel multilocus linkage disequilibrium (LD) coefficient based on information content (analogous to the information content scores commonly used for linkage mapping) that is equivalent to the familiar measure r 2 in the special case of two loci. Using this approach, we are able to summarize for any subset of markers, such as the Affymetrix Mapping 100K set, the information available for association mapping in that subset, relative to the information available in the full set of markers included in the HapMap, and highlight circumstances in which this multilocus measure of LD provides substantial additional insight about the haplotype structure in a region over pairwise measures of LD.
The Annals of Applied Statistics | 2010
Xiaoquan Wen; Matthew Stephens
Recently-developed genotype imputation methods are a powerful tool for detecting untyped genetic variants that affect disease susceptibility in genetic association studies. However, existing imputation methods require individual-level genotype data, whereas in practice it is often the case that only summary data are available. For example this may occur because, for reasons of privacy or politics, only summary data are made available to the research community at large; or because only summary data are collected, as in DNA pooling experiments. In this article, we introduce a new statistical method that can accurately infer the frequencies of untyped genetic variants in these settings, and indeed substantially improve frequency estimates at typed variants in pooling experiments where observations are noisy. Our approach, which predicts each allele frequency using a linear combination of observed frequencies, is statistically straight-forward, and related to a long history of the use of linear methods for estimating missing values (e.g. Kriging). The main statistical novelty is our approach to regularizing the covariance matrix estimates, and the resulting linear predictors, which is based on methods from population genetics. We find that, besides being both fast and flexible - allowing new problems to be tackled that cannot be handled by existing imputation approaches purpose-built for the genetic context - these linear methods are also very accurate. Indeed, imputation accuracy using this approach is similar to that obtained by state-of-the art imputation methods that use individual-level data, but at a fraction of the computational cost.