Elliott H. Margulies | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Elliott H. Margulies is active.

Explore More

Publication

Featured researches published by Elliott H. Margulies.

Cell | 2008

High-Resolution Mapping and Characterization of Open Chromatin across the Genome

Alan P. Boyle; Sean Davis; Hennady P. Shulha; Paul S. Meltzer; Elliott H. Margulies; Zhiping Weng; Terrence S. Furey; Gregory E. Crawford

Mapping DNase I hypersensitive (HS) sites is an accurate method of identifying the location of genetic regulatory elements, including promoters, enhancers, silencers, insulators, and locus control regions. We employed high-throughput sequencing and whole-genome tiled array strategies to identify DNase I HS sites within human primary CD4+ T cells. Combining these two technologies, we have created a comprehensive and accurate genome-wide open chromatin map. Surprisingly, only 16%-21% of the identified 94,925 DNase I HS sites are found in promoters or first exons of known genes, but nearly half of the most open sites are in these regions. In conjunction with expression, motif, and chromatin immunoprecipitation data, we find evidence of cell-type-specific characteristics, including the ability to identify transcription start sites and locations of different chromatin marks utilized in these cells. In addition, and unexpectedly, our analyses have uncovered detailed features of nucleosome structure.

Nature | 2011

A high-resolution map of human evolutionary constraint using 29 mammals

Kerstin Lindblad-Toh; Manuel Garber; Or Zuk; Michael F. Lin; Brian J. Parker; Stefan Washietl; Pouya Kheradpour; Jason Ernst; Gregory Jordan; Evan Mauceli; Lucas D. Ward; Craig B. Lowe; Alisha K. Holloway; Michele Clamp; Sante Gnerre; Jessica Alföldi; Kathryn Beal; Jean Chang; Hiram Clawson; James Cuff; Federica Di Palma; Stephen Fitzgerald; Paul Flicek; Mitchell Guttman; Melissa J. Hubisz; David B. Jaffe; Irwin Jungreis; W. James Kent; Dennis Kostka; Marcia Lara

The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ∼4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ∼60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease.

Nature | 2003

Comparative analyses of multi-species sequences from targeted genomic regions

James W. Thomas; Jeffrey W. Touchman; Robert W. Blakesley; Gerard G. Bouffard; Stephen M. Beckstrom-Sternberg; Elliott H. Margulies; Mathieu Blanchette; Adam Siepel; Pamela J. Thomas; Jennifer C. McDowell; Baishali Maskeri; Nancy F. Hansen; M. Schwartz; Ryan Weber; William Kent; Donna Karolchik; T. C. Bruen; R. Bevan; David J. Cutler; Scott Schwartz; Laura Elnitski; Jacquelyn R. Idol; A. B. Prasad; S. Q. Lee-Lin; Valerie Maduro; T. J. Summers; Matthew E. Portnoy; Nicole Dietrich; N. Akhter; K. Ayele

The systematic comparison of genomic sequences from different organisms represents a central focus of contemporary genome analysis. Comparative analyses of vertebrate sequences can identify coding and conserved non-coding regions, including regulatory elements, and provide insight into the forces that have rendered modern-day genomes. As a complement to whole-genome sequencing efforts, we are sequencing and comparing targeted genomic regions in multiple, evolutionarily diverse vertebrates. Here we report the generation and analysis of over 12 megabases (Mb) of sequence from 12 species, all derived from the genomic region orthologous to a segment of about 1.8 Mb on human chromosome 7 containing ten genes, including the gene mutated in cystic fibrosis. These sequences show conservation reflecting both functional constraints and the neutral mutational events that shaped this genomic region. In particular, we identify substantial numbers of conserved non-coding segments beyond those previously identified experimentally, most of which are not detectable by pair-wise sequence comparisons alone. Analysis of transposable element insertions highlights the variation in genome dynamics among these species and confirms the placement of rodents as a sister group to the primates.

Science | 2009

An Expressed Fgf4 Retrogene Is Associated with Breed-Defining Chondrodysplasia in Domestic Dogs

Heidi G. Parker; Bridgett M. vonHoldt; Pascale Quignon; Elliott H. Margulies; Stephanie Shao; Dana S. Mosher; Tyrone C. Spady; Abdel G. Elkahloun; Michele Cargill; Paul Glyn Jones; Cheryl L. Maslen; Gregory M. Acland; Nathan B. Sutter; Keiichi Kuroki; Carlos Bustamante; Robert K. Wayne; Elaine A. Ostrander

Going Retro In a year celebrating Darwin, the question of how new functional genes arise during evolution is of particular interest. Through a multibreed genetic analysis of the domestic dog, Parker et al. (p. 995, published online 16 July; see the Perspective by Kaessmann) find that the short-legged phenotype that characterizes at least 19 common dog breeds, including the corgi, dachshund, and basset hound, is specifically associated with the expression in developing bone of a gene encoding fibroblast growth factor 4 (fgf4), a member of a gene family previously implicated in dwarfism in humans. Interestingly, the culprit fgf4 gene in dogs has the hallmarks of a “retrogene,” a gene that arises when a parental gene is duplicated through an RNA-based copying mechanism. The short legs that characterize certain dog breeds are associated with a gene that arose recently by RNA-based gene duplication. Retrotransposition of processed mRNAs is a common source of novel sequence acquired during the evolution of genomes. Although the vast majority of retroposed gene copies, or retrogenes, rapidly accumulate debilitating mutations that disrupt the reading frame, a small percentage become new genes that encode functional proteins. By using a multibreed association analysis in the domestic dog, we demonstrate that expression of a recently acquired retrogene encoding fibroblast growth factor 4 (fgf4) is strongly associated with chondrodysplasia, a short-legged phenotype that defines at least 19 dog breeds including dachshund, corgi, and basset hound. These results illustrate the important role of a single evolutionary event in constraining and directing phenotypic diversity in the domestic dog.

Genome Research | 2010

Systematic comparison of three genomic enrichment methods for massively parallel DNA sequencing

Jamie K. Teer; Lori L. Bonnycastle; Peter S. Chines; Nancy F. Hansen; Natsuyo Aoyama; Amy J. Swift; Hatice Ozel Abaan; Thomas J. Albert; Nisc Comparative Sequencing Program; Elliott H. Margulies; Eric D. Green; Francis S. Collins; James C. Mullikin; Leslie G. Biesecker

Massively parallel DNA sequencing technologies have greatly increased our ability to generate large amounts of sequencing data at a rapid pace. Several methods have been developed to enrich for genomic regions of interest for targeted sequencing. We have compared three of these methods: Molecular Inversion Probes (MIP), Solution Hybrid Selection (SHS), and Microarray-based Genomic Selection (MGS). Using HapMap DNA samples, we compared each of these methods with respect to their ability to capture an identical set of exons and evolutionarily conserved regions associated with 528 genes (2.61 Mb). For sequence analysis, we developed and used a novel Bayesian genotype-assigning algorithm, Most Probable Genotype (MPG). All three capture methods were effective, but sensitivities (percentage of targeted bases associated with high-quality genotypes) varied for an equivalent amount of pass-filtered sequence: for example, 70% (MIP), 84% (SHS), and 91% (MGS) for 400 Mb. In contrast, all methods yielded similar accuracies of >99.84% when compared to Infinium 1M SNP BeadChip-derived genotypes and >99.998% when compared to 30-fold coverage whole-genome shotgun sequencing data. We also observed a low false-positive rate with all three methods; of the heterozygous positions identified by each of the capture methods, >99.57% agreed with 1M SNP BeadChip, and >98.840% agreed with the whole-genome shotgun data. In addition, we successfully piloted the genomic enrichment of a set of 12 pooled samples via the MGS method using molecular bar codes. We find that these three genomic enrichment methods are highly accurate and practical, with sensitivities comparable to that of 30-fold coverage whole-genome shotgun data.

Neuron | 2011

A Transcriptomic Atlas of Mouse Neocortical Layers

T. Grant Belgard; Ana C. Marques; Peter L. Oliver; Hatice Ozel Abaan; Tamara Sirey; Anna Hoerder-Suabedissen; Fernando García-Moreno; Zoltán Molnár; Elliott H. Margulies; Chris P. Ponting

Summary In the mammalian cortex, neurons and glia form a patterned structure across six layers whose complex cytoarchitectonic arrangement is likely to contribute to cognition. We sequenced transcriptomes from layers 1-6b of different areas (primary and secondary) of the adult (postnatal day 56) mouse somatosensory cortex to understand the transcriptional levels and functional repertoires of coding and noncoding loci for cells constituting these layers. A total of 5,835 protein-coding genes and 66 noncoding RNA loci are differentially expressed (“patterned”) across the layers, on the basis of a machine-learning model (naive Bayes) approach. Layers 2-6b are each associated with specific functional and disease annotations that provide insights into their biological roles. This new resource (http://genserv.anat.ox.ac.uk/layers) greatly extends currently available resources, such as the Allen Mouse Brain Atlas and microarray data sets, by providing quantitative expression levels, by being genome-wide, by including novel loci, and by identifying candidate alternatively spliced transcripts that are differentially expressed across layers.

Science | 2009

Local DNA Topography Correlates with Functional Noncoding Regions of the Human Genome

Stephen C.J. Parker; Loren Hansen; Hatice Ozel Abaan; Thomas D. Tullius; Elliott H. Margulies

The three-dimensional molecular structure of DNA, specifically the shape of the backbone and grooves of genomic DNA, can be dramatically affected by nucleotide changes, which can cause differences in protein-binding affinity and phenotype. We developed an algorithm to measure constraint on the basis of similarity of DNA topography among multiple species, using hydroxyl radical cleavage patterns to interrogate the solvent-accessible surface area of DNA. This algorithm found that 12% of bases in the human genome are evolutionarily constrained—double the number detected by nucleotide sequence–based algorithms. Topography-informed constrained regions correlated with functional noncoding elements, including enhancers, better than did regions identified solely on the basis of nucleotide sequence. These results support the idea that the molecular shape of DNA is under selection and can identify evolutionary history.

PLOS ONE | 2009

Pebble and Rock Band: Heuristic Resolution of Repeats and Scaffolding in the Velvet Short-Read de Novo Assembler

Daniel R. Zerbino; Gayle K. McEwen; Elliott H. Margulies; Ewan Birney

Background Despite the short length of their reads, micro-read sequencing technologies have shown their usefulness for de novo sequencing. However, especially in eukaryotic genomes, complex repeat patterns are an obstacle to large assemblies. Principal Findings We present a novel heuristic algorithm, Pebble, which uses paired-end read information to resolve repeats and scaffold contigs to produce large-scale assemblies. In simulations, we can achieve weighted median scaffold lengths (N50) of above 1 Mbp in Bacteria and above 100 kbp in more complex organisms. Using real datasets we obtained a 96 kbp N50 in Pseudomonas syringae and a unique 147 kbp scaffold of a ferret BAC clone. We also present an efficient algorithm called Rock Band for the resolution of repeats in the case of mixed length assemblies, where different sequencing platforms are combined to obtain a cost-effective assembly. Conclusions These algorithms extend the utility of short read only assemblies into large complex genomes. They have been implemented and made available within the open-source Velvet short-read de novo assembler.

Nature Reviews Genetics | 2013

Sequencing studies in human genetics: design and interpretation

David B. Goldstein; Andrew S. Allen; Jonathan Keebler; Elliott H. Margulies; Steven Petrou; Slavé Petrovski; Shamil R. Sunyaev

Next-generation sequencing is becoming the primary discovery tool in human genetics. There have been many clear successes in identifying genes that are responsible for Mendelian diseases, and sequencing approaches are now poised to identify the mutations that cause undiagnosed childhood genetic diseases and those that predispose individuals to more common complex diseases. There are, however, growing concerns that the complexity and magnitude of complete sequence data could lead to an explosion of weakly justified claims of association between genetic variants and disease. Here, we provide an overview of the basic workflow in next-generation sequencing studies and emphasize, where possible, measures and considerations that facilitate accurate inferences from human sequencing studies.

Cell Metabolism | 2010

Global Epigenomic Analysis of Primary Human Pancreatic Islets Provides Insights into Type 2 Diabetes Susceptibility Loci

Michael L. Stitzel; Praveen Sethupathy; Daniel Pearson; Peter S. Chines; Lingyun Song; Michael R. Erdos; Ryan P. Welch; Stephen C. J. Parker; Alan P. Boyle; Laura J. Scott; Elliott H. Margulies; Michael Boehnke; Terrence S. Furey; Gregory E. Crawford; Francis S. Collins

Identifying cis-regulatory elements is important to understanding how human pancreatic islets modulate gene expression in physiologic or pathophysiologic (e.g., diabetic) conditions. We conducted genome-wide analysis of DNase I hypersensitive sites, histone H3 lysine methylation modifications (K4me1, K4me3, K79me2), and CCCTC factor (CTCF) binding in human islets. This identified ∼18,000 putative promoters (several hundred unannotated and islet-active). Surprisingly, active promoter modifications were absent at genes encoding islet-specific hormones, suggesting a distinct regulatory mechanism. Of 34,039 distal (nonpromoter) regulatory elements, 47% are islet unique and 22% are CTCF bound. In the 18 type 2 diabetes (T2D)-associated loci, we identified 118 putative regulatory elements and confirmed enhancer activity for 12 of 33 tested. Among six regulatory elements harboring T2D-associated variants, two exhibit significant allele-specific differences in activity. These findings present a global snapshot of the human islet epigenome and should provide functional context for noncoding variants emerging from genetic studies of T2D and other islet disorders.

Explore More