Katherine E. Varley
Washington University in St. Louis
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Katherine E. Varley.
Genome Research | 2013
Katherine E. Varley; Jason Gertz; Kevin M. Bowling; Stephanie L. Parker; Timothy E. Reddy; Florencia Pauli-Behn; Marie K. Cross; Brian A. Williams; John A. Stamatoyannopoulos; Gregory E. Crawford; Devin Absher; Barbara J. Wold; Richard M. Myers
As studies of DNA methylation increase in scope, it has become evident that methylation has a complex relationship with gene expression, plays an important role in defining cell types, and is disrupted in many diseases. We describe large-scale single-base resolution DNA methylation profiling on a diverse collection of 82 human cell lines and tissues using reduced representation bisulfite sequencing (RRBS). Analysis integrating RNA-seq and ChIP-seq data illuminates the functional role of this dynamic mark. Loci that are hypermethylated across cancer types are enriched for sites bound by NANOG in embryonic stem cells, which supports and expands the model of a stem/progenitor cell signature in cancer. CpGs that are hypomethylated across cancer types are concentrated in megabase-scale domains that occur near the telomeres and centromeres of chromosomes, are depleted of genes, and are enriched for cancer-specific EZH2 binding and H3K27me3 (repressive chromatin). In noncancer samples, there are cell-type specific methylation signatures preserved in primary cell lines and tissues as well as methylation differences induced by cell culture. The relationship between methylation and expression is context-dependent, and we find that CpG-rich enhancers bound by EP300 in the bodies of expressed genes are unmethylated despite the dense gene-body methylation surrounding them. Non-CpG cytosine methylation occurs in human somatic tissue, is particularly prevalent in brain tissue, and is reproducible across many individuals. This study provides an atlas of DNA methylation across diverse and well-characterized samples and enables new discoveries about DNA methylation and its role in gene regulation and disease.
Genome Research | 2012
Hao Wang; Matthew T. Maurano; Hongzhu Qu; Katherine E. Varley; Jason Gertz; Florencia Pauli; Kristen Lee; Theresa K. Canfield; Molly Weaver; Richard Sandstrom; Robert E. Thurman; Rajinder Kaul; Richard M. Myers; John A. Stamatoyannopoulos
CTCF is a ubiquitously expressed regulator of fundamental genomic processes including transcription, intra- and interchromosomal interactions, and chromatin structure. Because of its critical role in genome function, CTCF binding patterns have long been assumed to be largely invariant across different cellular environments. Here we analyze genome-wide occupancy patterns of CTCF by ChIP-seq in 19 diverse human cell types, including normal primary cells and immortal lines. We observed highly reproducible yet surprisingly plastic genomic binding landscapes, indicative of strong cell-selective regulation of CTCF occupancy. Comparison with massively parallel bisulfite sequencing data indicates that 41% of variable CTCF binding is linked to differential DNA methylation, concentrated at two critical positions within the CTCF recognition sequence. Unexpectedly, CTCF binding patterns were markedly different in normal versus immortal cells, with the latter showing widespread disruption of CTCF binding associated with increased methylation. Strikingly, this disruption is accompanied by up-regulation of CTCF expression, with the result that both normal and immortal cells maintain the same average number of CTCF occupancy sites genome-wide. These results reveal a tight linkage between DNA methylation and the global occupancy patterns of a major sequence-specific regulatory factor.
PLOS Genetics | 2011
Jason Gertz; Katherine E. Varley; Timothy E. Reddy; Kevin M. Bowling; Florencia Pauli; Stephanie L. Parker; Katerina S. Kucera; Huntington F. Willard; Richard M. Myers
The methylation of cytosines in CpG dinucleotides is essential for cellular differentiation and the progression of many cancers, and it plays an important role in gametic imprinting. To assess variation and inheritance of genome-wide patterns of DNA methylation simultaneously in humans, we applied reduced representation bisulfite sequencing (RRBS) to somatic DNA from six members of a three-generation family. We observed that 8.1% of heterozygous SNPs are associated with differential methylation in cis, which provides a robust signature for Mendelian transmission and relatedness. The vast majority of differential methylation between homologous chromosomes (>92%) occurs on a particular haplotype as opposed to being associated with the gender of the parent of origin, indicating that genotype affects DNA methylation of far more loci than does gametic imprinting. We found that 75% of genotype-dependent differential methylation events in the family are also seen in unrelated individuals and that overall genotype can explain 80% of the variation in DNA methylation. These events are under-represented in CpG islands, enriched in intergenic regions, and located in regions of low evolutionary conservation. Even though they are generally not in functionally constrained regions, 22% (twice as many as expected by chance) of genes harboring genotype-dependent DNA methylation exhibited allele-specific gene expression as measured by RNA-seq of a lymphoblastoid cell line, indicating that some of these events are associated with gene expression differences. Overall, our results demonstrate that the influence of genotype on patterns of DNA methylation is widespread in the genome and greatly exceeds the influence of imprinting on genome-wide methylation patterns.
Nature Methods | 2009
Todd E. Druley; Francesco Vallania; Daniel J. Wegner; Katherine E. Varley; Olivia L. Knowles; Jacqueline A. Bonds; Sarah W. Robison; Scott W. Doniger; Aaron Hamvas; F. Sessions Cole; Justin C. Fay; Robi D. Mitra
We report a targeted, cost-effective method to quantify rare single-nucleotide polymorphisms from pooled human genomic DNA using second-generation sequencing. We pooled DNA from 1,111 individuals and targeted four genes to identify rare germline variants. Our base-calling algorithm, SNPSeeker, derived from large deviation theory, detected single-nucleotide polymorphisms present at frequencies below the raw error rate of the sequencing platform.
Proceedings of the National Academy of Sciences of the United States of America | 2012
Ya Guo; Kevin Monahan; Haiyang Wu; Jason Gertz; Katherine E. Varley; Wei Li; Richard M. Myers; Tom Maniatis; Qiang Wu
The closely linked human protocadherin (Pcdh) α, β, and γ gene clusters encode 53 distinct protein isoforms, which are expressed in a combinatorial manner to generate enormous diversity on the surface of individual neurons. This diversity is a consequence of stochastic promoter choice and alternative pre-mRNA processing. Here, we show that Pcdhα promoter choice is achieved by DNA looping between two downstream transcriptional enhancers and individual promoters driving the expression of alternate Pcdhα isoforms. In addition, we show that this DNA looping requires specific binding of the CTCF/cohesin complex to two symmetrically aligned binding sites in both the transcriptionally active promoters and in one of the enhancers. These findings have important implications regarding enhancer/promoter interactions in the generation of complex Pcdh cell surface codes for the establishment of neuronal identity and self-avoidance in individual neurons.
Genome Research | 2012
Timothy E. Reddy; Jason Gertz; Florencia Pauli; Katerina S. Kucera; Katherine E. Varley; Kimberly M. Newberry; Georgi K. Marinov; Ali Mortazavi; Brian A. Williams; Lingyun Song; Gregory E. Crawford; Barbara J. Wold; Huntington F. Willard; Richard M. Myers
A complex interplay between transcription factors (TFs) and the genome regulates transcription. However, connecting variation in genome sequence with variation in TF binding and gene expression is challenging due to environmental differences between individuals and cell types. To address this problem, we measured genome-wide differential allelic occupancy of 24 TFs and EP300 in a human lymphoblastoid cell line GM12878. Overall, 5% of human TF binding sites have an allelic imbalance in occupancy. At many sites, TFs clustered in TF-binding hubs on the same homolog in especially open chromatin. While genetic variation in core TF binding motifs generally resulted in large allelic differences in TF occupancy, most allelic differences in occupancy were subtle and associated with disruption of weak or noncanonical motifs. We also measured genome-wide differential allelic expression of genes with and without heterozygous exonic variants in the same cells. We found that genes with differential allelic expression were overall less expressed both in GM12878 cells and in unrelated human cell lines. Comparing TF occupancy with expression, we found strong association between allelic occupancy and expression within 100 bp of transcription start sites (TSSs), and weak association up to 100 kb from TSSs. Sites of differential allelic occupancy were significantly enriched for variants associated with disease, particularly autoimmune disease, suggesting that allelic differences in TF occupancy give functional insights into intergenic variants associated with disease. Our results have the potential to increase the power and interpretability of association studies by targeting functional intergenic variants in addition to protein coding sequences.
Genome Research | 2008
Katherine E. Varley; Robi D. Mitra
Medical resequencing of candidate genes in individual patient samples is becoming increasingly important in the clinic and in clinical research. Medical resequencing requires the amplification and sequencing of many candidate genes in many patient samples. Here we introduce Nested Patch PCR, a novel method for highly multiplexed PCR that is very specific, can sensitively detect SNPs and mutations, and is easy to implement. This is the first method that couples multiplex PCR with sample-specific DNA barcodes and next-generation sequencing to enable highly multiplex mutation discovery in candidate genes for multiple samples in parallel. In our pilot study, we amplified exons from colon cancer and matched normal human genomic DNA. From each sample, we successfully amplified 96% (90 of 94) targeted exons from across the genome, totaling 21.6 kbp of sequence. Ninety percent of all sequencing reads were from targeted exons, demonstrating that Nested Patch PCR is highly specific. We found that the abundance of reads per exon was reproducible across samples. We reliably detected germline SNPs and discovered a colon tumor specific nonsense mutation in APC, a gene causally implicated in colorectal cancer. With Nested Patch PCR, candidate gene mutation discovery across multiple individual patient samples can now utilize the power of second-generation sequencing.
Molecular Cell | 2013
Jason Gertz; Daniel Savic; Katherine E. Varley; E. Christopher Partridge; Alexias Safi; Preti Jain; Gregory M. Cooper; Timothy E. Reddy; Gregory E. Crawford; Richard M. Myers
Most human transcription factors bind a small subset of potential genomic sites and often use different subsets in different cell types. To identify mechanisms that govern cell-type-specific transcription factor binding, we used an integrative approach to study estrogen receptor α (ER). We found that ER exhibits two distinct modes of binding. Shared sites, bound in multiple cell types, are characterized by high-affinity estrogen response elements (EREs), inaccessible chromatin, and a lack of DNA methylation, while cell-specific sites are characterized by a lack of EREs, co-occurrence with other transcription factors, and cell-type-specific chromatin accessibility and DNA methylation. These observations enabled accurate quantitative models of ER binding that suggest tethering of ER to one-third of cell-specific sites. The distinct properties of cell-specific binding were also observed with glucocorticoid receptor and for ER in primary mouse tissues, representing an elegant genomic encoding scheme for generating cell-type-specific gene regulation.
Genome Research | 2012
Jason Gertz; Katherine E. Varley; Nicholas S. Davis; Bradley J. Baas; Igor Y. Goryshin; Ramesh Vaidyanathan; Scott Kuersten; Richard M. Myers
RNA-seq has been widely adopted as a gene-expression measurement tool due to the detail, resolution, and sensitivity of transcript characterization that the technique provides. Here we present two transposon-based methods that efficiently construct high-quality RNA-seq libraries. We first describe a method that creates RNA-seq libraries for Illumina sequencing from double-stranded cDNA with only two enzymatic reactions. We generated high-quality RNA-seq libraries from as little as 10 pg of mRNA (∼1 ng of total RNA) with this approach. We also present a strand-specific RNA-seq library construction protocol that combines transposon-based library construction with uracil DNA glycosylase and endonuclease VIII to specifically degrade the second strand constructed during cDNA synthesis. The directional RNA-seq libraries maintain the same quality as the nondirectional libraries, while showing a high degree of strand specificity, such that 99.5% of reads map to the expected genomic strand. Each transposon-based library construction method performed well when compared with standard RNA-seq library construction methods with regard to complexity of the libraries, correlation between biological replicates, and the percentage of reads that align to the genome as well as exons. Our results show that high-quality RNA-seq libraries can be constructed efficiently and in an automatable fashion using transposition technology.
Genome Research | 2012
Jason Gertz; Timothy E. Reddy; Katherine E. Varley; Michael J. Garabedian; Richard M. Myers
Endogenous estrogens that are synthesized in the body impact gene regulation by activating estrogen receptors in diverse cell types. Exogenous compounds that have estrogenic properties can also be found circulating in the blood in both children and adults. The genome-wide impact of these environmental estrogens on gene regulation is unclear. To obtain an integrated view of gene regulation in response to environmental and endogenous estrogens on a genome-wide scale, we performed ChIP-seq to identify estrogen receptor 1 (ESR1; previously estrogen receptor α) binding sites, and RNA-seq in endometrial cancer cells exposed to bisphenol A (BPA; found in plastics), genistein (GEN; found in soybean), or 17β-estradiol (E2; an endogenous estrogen). GEN and BPA treatment induces thousands of ESR1 binding sites and >50 gene expression changes, representing a subset of E2-induced gene regulation changes. Genes affected by E2 were highly enriched for ribosome-associated proteins; however, GEN and BPA failed to regulate most ribosome-associated proteins and instead enriched for transporters of carboxylic acids. Treatment-dependent changes in gene expression were associated with treatment-dependent ESR1 binding sites, with the exception that many genes up-regulated by E2 harbored a BPA-induced ESR1 binding site but failed to show any expression change after BPA treatment. GEN and BPA exhibited a similar relationship to E2 in the breast cancer line T-47D, where cell type specificity played a much larger role than treatment specificity. Overall, both environmental estrogens clearly regulate gene expression through ESR1 on a genome-wide scale, although with lower potency resulting in less ESR1 binding sites and less gene expression changes compared to the endogenous estrogen, E2.