Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Erez Lieberman Aiden is active.

Publication


Featured researches published by Erez Lieberman Aiden.


Science | 2011

Quantitative Analysis of Culture Using Millions of Digitized Books

Jean-Baptiste Michel; Yuan Kui Shen; Aviva Presser Aiden; Adrian Veres; Matthew K. Gray; Joseph P. Pickett; Dale Hoiberg; Dan Clancy; Peter Norvig; Jon Orwant; Steven Pinker; Martin A. Nowak; Erez Lieberman Aiden

Linguistic and cultural changes are revealed through the analyses of words appearing in books. We constructed a corpus of digitized texts containing about 4% of all books ever printed. Analysis of this corpus enables us to investigate cultural trends quantitatively. We survey the vast terrain of ‘culturomics,’ focusing on linguistic and cultural phenomena that were reflected in the English language between 1800 and 2000. We show how this approach can provide insights about fields as diverse as lexicography, the evolution of grammar, collective memory, the adoption of technology, the pursuit of fame, censorship, and historical epidemiology. Culturomics extends the boundaries of rigorous quantitative inquiry to a wide array of new phenomena spanning the social sciences and the humanities.


Proceedings of the National Academy of Sciences of the United States of America | 2015

Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes

Adrian L. Sanborn; Suhas S.P. Rao; Su-Chen Huang; Neva C. Durand; Miriam Huntley; Andrew Jewett; Ivan D. Bochkov; Dharmaraj Chinnappan; Ashok Cutkosky; Jian Li; Kristopher P. Geeting; Andreas Gnirke; Alexandre Melnikov; Doug McKenna; Elena K. Stamenova; Eric S. Lander; Erez Lieberman Aiden

Significance When the human genome folds up inside the cell nucleus, it is spatially partitioned into numerous loops and contact domains. How these structures form is unknown. Here, we show that data from high-resolution spatial proximity maps are consistent with a model in which a complex, including the proteins CCCTC-binding factor (CTCF) and cohesin, mediates the formation of loops by a process of extrusion. Contact domains form as a byproduct of this process. The model accurately predicts how the genome will fold, using only information about the locations at which CTCF is bound. We demonstrate the ability to reengineer loops and domains in a predictable manner by creating highly targeted mutations, some as small as a single base pair, at CTCF sites. We recently used in situ Hi-C to create kilobase-resolution 3D maps of mammalian genomes. Here, we combine these maps with new Hi-C, microscopy, and genome-editing experiments to study the physical structure of chromatin fibers, domains, and loops. We find that the observed contact domains are inconsistent with the equilibrium state for an ordinary condensed polymer. Combining Hi-C data and novel mathematical theorems, we show that contact domains are also not consistent with a fractal globule. Instead, we use physical simulations to study two models of genome folding. In one, intermonomer attraction during polymer condensation leads to formation of an anisotropic “tension globule.” In the other, CCCTC-binding factor (CTCF) and cohesin act together to extrude unknotted loops during interphase. Both models are consistent with the observed contact domains and with the observation that contact domains tend to form inside loops. However, the extrusion model explains a far wider array of observations, such as why loops tend not to overlap and why the CTCF-binding motifs at pairs of loop anchors lie in the convergent orientation. Finally, we perform 13 genome-editing experiments examining the effect of altering CTCF-binding sites on chromatin folding. The convergent rule correctly predicts the affected loops in every case. Moreover, the extrusion model accurately predicts in silico the 3D maps resulting from each experiment using only the location of CTCF-binding sites in the WT. Thus, we show that it is possible to disrupt, restore, and move loops and domains using targeted mutations as small as a single base pair.


Cell systems | 2016

Juicer Provides a One-Click System for Analyzing Loop-Resolution Hi-C Experiments

Neva C. Durand; Muhammad S. Shamim; Ido Machol; Suhas S.P. Rao; Miriam Huntley; Eric S. Lander; Erez Lieberman Aiden

Hi-C experiments explore the 3D structure of the genome, generating terabases of data to create high-resolution contact maps. Here, we introduce Juicer, an open-source tool for analyzing terabase-scale Hi-C datasets. Juicer allows users without a computational background to transform raw sequence data into normalized contact maps with one click. Juicer produces a hic file containing compressed contact matrices at many resolutions, facilitating visualization and analysis at multiple scales. Structural features, such as loops and domains, are automatically annotated. Juicer is available as open source software at http://aidenlab.org/juicer/.


Proceedings of the National Academy of Sciences of the United States of America | 2016

Deletion of DXZ4 on the human inactive X chromosome alters higher-order genome architecture.

Emily M. Darrow; Miriam Huntley; Olga Dudchenko; Elena K. Stamenova; Neva C. Durand; Zhuo Sun; Su-Chen Huang; Adrian L. Sanborn; Ido Machol; Muhammad S. Shamim; Andrew P. Seberg; Eric S. Lander; Brian P. Chadwick; Erez Lieberman Aiden

Significance In human females, one of the two X chromosomes is inactive (Xi) and adopts an unusual 3D conformation. The Xi chromosome contains superloops, large chromatin loops that are often anchored at the macrosatellite repeat DXZ4, and is partitioned into two large intervals, called superdomains, whose boundary lies at DXZ4. Here, we use spatial proximity mapping, microscopy, and genome editing to study the Xi. We find that superloops and superdomains are conserved across humans, macaque, and mouse. By mapping proximity between three or more loci, we show that superloops tend to occur simultaneously. Deletion of DXZ4 from the human Xi disrupts superloops, eliminates superdomains, and alters chromatin modifications. Finally, we show that a model in which CCCTC-binding factor (CTCF) and cohesin extrude chromatin can explain the formation of superloops and superdomains. During interphase, the inactive X chromosome (Xi) is largely transcriptionally silent and adopts an unusual 3D configuration known as the “Barr body.” Despite the importance of X chromosome inactivation, little is known about this 3D conformation. We recently showed that in humans the Xi chromosome exhibits three structural features, two of which are not shared by other chromosomes. First, like the chromosomes of many species, Xi forms compartments. Second, Xi is partitioned into two huge intervals, called “superdomains,” such that pairs of loci in the same superdomain tend to colocalize. The boundary between the superdomains lies near DXZ4, a macrosatellite repeat whose Xi allele extensively binds the protein CCCTC-binding factor. Third, Xi exhibits extremely large loops, up to 77 megabases long, called “superloops.” DXZ4 lies at the anchor of several superloops. Here, we combine 3D mapping, microscopy, and genome editing to study the structure of Xi, focusing on the role of DXZ4. We show that superloops and superdomains are conserved across eutherian mammals. By analyzing ligation events involving three or more loci, we demonstrate that DXZ4 and other superloop anchors tend to colocate simultaneously. Finally, we show that deleting DXZ4 on Xi leads to the disappearance of superdomains and superloops, changes in compartmentalization patterns, and changes in the distribution of chromatin marks. Thus, DXZ4 is essential for proper Xi packaging.


Cell | 2017

Genome Organization Drives Chromosome Fragility

Andres Canela; Yaakov Maman; Seolkyoung Jung; Nancy Wong; Elsa Callen; Amanda Day; Kyong-Rim Kieffer-Kwon; Aleksandra Pekowska; Hongliang Zhang; Suhas S.P. Rao; Su-Chen Huang; Peter J. McKinnon; Peter D. Aplan; Yves Pommier; Erez Lieberman Aiden; Rafael Casellas; André Nussenzweig

In this study, we show that evolutionarily conserved chromosome loop anchors bound by CCCTC-binding factor (CTCF) and cohesin are vulnerable to DNA double strand breaks (DSBs) mediated by topoisomerase 2B (TOP2B). Polymorphisms in the genome that redistribute CTCF/cohesin occupancy rewire DNA cleavage sites to novel loop anchors. While transcription- and replication-coupled genomic rearrangements have been well documented, we demonstrate that DSBs formed at loop anchors are largely transcription-, replication-, and cell-type-independent. DSBs are continuously formed throughout interphase, are enriched on both sides of strong topological domain borders, and frequently occur at breakpoint clusters commonly translocated in cancer. Thus, loop anchors serve as fragile sites that generate DSBs and chromosomal rearrangements. VIDEO ABSTRACT.


Science | 2017

De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds

Olga Dudchenko; Sanjit S. Batra; Arina D. Omer; Sarah K. Nyquist; Marie Hoeger; Neva C. Durand; Muhammad S. Shamim; Ido Machol; Eric S. Lander; Aviva Presser Aiden; Erez Lieberman Aiden

Hi-C for mosquito genomes Most genomes sequenced today are determined through the generation of short sequenced bits of DNA that are computationally pieced together like a jigsaw puzzle. This has resulted in the need for funds and additional data to fill in gaps in order to fully assemble the many chromosomes that make up a eukaryotic genome. Dudchenko et al. used the Hi-C method, which measures the distance between contact points within and between chromosomes for scaffold validation, together with correction and ordering to more completely determine the arrangement of short sequencing reads for genome mapping. They validated their approach through the de novo generation of a complete human genome. A comparative analysis of mosquito genomes was made possible by improving the Culex quinquefasciatus genome assembly and generating the genome of Aedes aegypti, the vector of Zika virus. Science, this issue p. 92 The DNA proximity ligation method Hi-C was used to create a genome assembly for the mosquito Aedes aegypti. The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective way. Here we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a human genome, de novo, from short reads alone (67× coverage). We then combine our method with draft sequences to create genome assemblies of the mosquito disease vectors Ae. aegypti and Culex quinquefasciatus, each consisting of three scaffolds corresponding to the three chromosomes in each species. These assemblies indicate that almost all genomic rearrangements among these species occur within, rather than between, chromosome arms. The genome assembly procedure we describe is fast, inexpensive, and accurate, and can be applied to many species.We present an end-to-end genome assembly of a female Aedes aegypti mosquito, which spreads viral diseases such as yellow fever, dengue, chikungunya, and Zika to humans. The assembly is based on an earlier genome published in 2007 and improved in 2013. The new assembly has a scaffold N50 of 419Mb, with 96.9% of the ungapped sequence anchored to chromosomes. We used the new assembly to examine the conservation of A. aegypti chromosomes. Our results suggest that synteny is strongly conserved between Ae. aegypti and An. gambiae. Comparison to D. melanogaster highlights the extent to which the identity of entire chromosome arms is preserved across dipterans. Main text: Due to its role in the spread of the Zika virus in the Americas, A. aegypti – an important mosquito vector of many human diseases – is causing a new wave of widespread concern (Harmon 2016). The lack of an end-to-end genome assembly is limiting our understanding of the biology of this major arbovirus vector and hinders efforts at disease control. To aid in the response, we present an improved assembly. Our assembly is based on AaegL2 (Nene et al. 2007), which was generated using Sanger reads (8X coverage) assembled using ARACHNE (Jaffe et al. 2003). The AaegL2 assembly consists of 4756 scaffolds spanning 1.3Gb of sequence, with a contig N50 of 82 Kb and scaffold N50 of 1.5Mb. Our effort to improve AaegL2 resulted in the first end-to-end genome assembly of A. aegypti. We refer to our new assembly as AaegL4. (AaegL3 is frequently used to refer to a variant of AaegL2 which includes the mitochondrial genome.) In addition to providing an end-to-end assembly, AaegL4 also improves the quality of anchoring and scaffolding. The AaegL4 assembly is shared at https://tinyurl.com/AaegL4. Given the public health relevance of this work, we choose to share this assembly with the scientific community without delay. A description of the methodology used to generate AaegL4 is currently in preparation.


Proceedings of the National Academy of Sciences of the United States of America | 2016

Transferable model for chromosome architecture

Michele Di Pierro; Bin Zhang; Erez Lieberman Aiden; Peter G. Wolynes; José N. Onuchic

Significance Chromatin consists of DNA and hundreds of proteins that interact with the genetic material. In vivo, chromatin folds into nonrandom structures. The physical mechanism leading to these characteristic conformations, however, remains poorly understood. Here, we introduce a model that generates chromosome conformations by using the idea that chromatin can be subdivided into types based on its biochemical interactions. Chromatin types, which are distinct from DNA sequence, are partially epigenetically controlled and change during cell differentiation, thus constituting a link between epigenetics, chromosomal organization, and cell development. The degree of accuracy achieved by this model supports the viability of the proposed physical mechanism of chromatin folding and makes the computational model a powerful tool for future investigations. In vivo, the human genome folds into a characteristic ensemble of 3D structures. The mechanism driving the folding process remains unknown. We report a theoretical model for chromatin (Minimal Chromatin Model) that explains the folding of interphase chromosomes and generates chromosome conformations consistent with experimental data. The energy landscape of the model was derived by using the maximum entropy principle and relies on two experimentally derived inputs: a classification of loci into chromatin types and a catalog of the positions of chromatin loops. First, we trained our energy function using the Hi-C contact map of chromosome 10 from human GM12878 lymphoblastoid cells. Then, we used the model to perform molecular dynamics simulations producing an ensemble of 3D structures for all GM12878 autosomes. Finally, we used these 3D structures to generate contact maps. We found that simulated contact maps closely agree with experimental results for all GM12878 autosomes. The ensemble of structures resulting from these simulations exhibited unknotted chromosomes, phase separation of chromatin types, and a tendency for open chromatin to lie at the periphery of chromosome territories.


Proceedings of the National Academy of Sciences of the United States of America | 2017

De novo prediction of human chromosome structures: Epigenetic marking patterns encode genome architecture

Michele Di Pierro; Ryan R. Cheng; Erez Lieberman Aiden; Peter G. Wolynes; Jos eacute N. Onuchic

Significance In the nucleus of eukaryotic cells, the genome is organized in three dimensions in an architecture that depends on cell type. This organization is a key element of transcriptional regulation, and its disruption often leads to disease. We demonstrate that it is possible to predict how a genome will fold based on the epigenetic marks that decorate chromatin. Epigenetic marking patterns are used to predict the corresponding ensemble of 3D structures by leveraging both energy landscape theory and neural network-based machine learning. These predictions are extensively validated by the results of DNA-DNA ligation assays and fluorescence microscopy, which are found to be in exceptionally good agreement with theory. Inside the cell nucleus, genomes fold into organized structures that are characteristic of cell type. Here, we show that this chromatin architecture can be predicted de novo using epigenetic data derived from chromatin immunoprecipitation-sequencing (ChIP-Seq). We exploit the idea that chromosomes encode a 1D sequence of chromatin structural types. Interactions between these chromatin types determine the 3D structural ensemble of chromosomes through a process similar to phase separation. First, a neural network is used to infer the relation between the epigenetic marks present at a locus, as assayed by ChIP-Seq, and the genomic compartment in which those loci reside, as measured by DNA-DNA proximity ligation (Hi-C). Next, types inferred from this neural network are used as an input to an energy landscape model for chromatin organization [Minimal Chromatin Model (MiChroM)] to generate an ensemble of 3D chromosome conformations at a resolution of 50 kilobases (kb). After training the model, dubbed Maximum Entropy Genomic Annotation from Biomarkers Associated to Structural Ensembles (MEGABASE), on odd-numbered chromosomes, we predict the sequences of chromatin types and the subsequent 3D conformational ensembles for the even chromosomes. We validate these structural ensembles by using ChIP-Seq tracks alone to predict Hi-C maps, as well as distances measured using 3D fluorescence in situ hybridization (FISH) experiments. Both sets of experiments support the hypothesis of phase separation being the driving process behind compartmentalization. These findings strongly suggest that epigenetic marking patterns encode sufficient information to determine the global architecture of chromosomes and that de novo structure prediction for whole genomes may be increasingly possible.


Molecular Cell | 2017

Myc Regulates Chromatin Decompaction and Nuclear Architecture during B Cell Activation

Kyong Rim Kieffer-Kwon; Keisuke Nimura; Suhas S.P. Rao; Jianliang Xu; Seolkyoung Jung; Aleksandra Pekowska; Marei Dose; Evan Stevens; Ewy Mathe; Peng Dong; Su Chen Huang; Maria Aurelia Ricci; Laura Baranello; Ying Zheng; Francesco Tomassoni Ardori; Wolfgang Resch; Diana A. Stavreva; Steevenson Nelson; Michael J. McAndrew; Adriel Casellas; Elizabeth H. Finn; Charles Gregory; Brian Glenn St Hilaire; Steven M. Johnson; Wendy Dubois; Maria Pia Cosma; Eric Batchelor; David Levens; Robert D. Phair; Tom Misteli

50 years ago, Vincent Allfrey and colleagues discovered that lymphocyte activation triggers massive acetylation of chromatin. However, the molecular mechanisms driving epigenetic accessibility are still unknown. We here show that stimulated lymphocytes decondense chromatin by three differentially regulated steps. First, chromatin is repositioned away from the nuclear periphery in response to global acetylation. Second, histone nanodomain clusters decompact into mononucleosome fibers through a mechanism that requires Myc and continual energy input. Single-molecule imaging shows that this step lowers transcription factor residence time and non-specific collisions during sampling for DNA targets. Third, chromatin interactions shift from long range to predominantly short range, and CTCF-mediated loops and contact domains double in numbers. This architectural change facilitates cognate promoter-enhancer contacts and also requires Myc and continual ATP production. Our results thus define the nature and transcriptional impact of chromatin decondensation and reveal an unexpected role for Myc in the establishment of nuclear topology in mammalian cells.


Proceedings of the National Academy of Sciences of the United States of America | 2017

Polycomb-mediated chromatin loops revealed by a subkilobase-resolution chromatin interaction map

Kyle P. Eagen; Erez Lieberman Aiden; Roger D. Kornberg

Significance Chromatin loops, detected by chemical cross-linking and DNA sequencing, are frequently bounded by the polycomb repressive complex 1 in Drosophila. The loops are associated with important developmental genes, often in a repressed state. These results are in contrast with previous studies on mammalian cells, in which chromatin loops are commonly bounded by CTCF protein, and with the generally accepted role of looping in gene activation. The locations of chromatin loops in Drosophila were determined by Hi-C (chemical cross-linking, restriction digestion, ligation, and high-throughput DNA sequencing). Whereas most loop boundaries or “anchors” are associated with CTCF protein in mammals, loop anchors in Drosophila were found most often in association with the polycomb group (PcG) protein Polycomb (Pc), a subunit of polycomb repressive complex 1 (PRC1). Loops were frequently located within domains of PcG-repressed chromatin. Promoters located at PRC1 loop anchors regulate some of the most important developmental genes and are less likely to be expressed than those not at PRC1 loop anchors. Although DNA looping has most commonly been associated with enhancer–promoter communication, our results indicate that loops are also associated with gene repression.

Collaboration


Dive into the Erez Lieberman Aiden's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Neva C. Durand

Baylor College of Medicine

View shared research outputs
Top Co-Authors

Avatar

Ido Machol

Baylor College of Medicine

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Su-Chen Huang

Baylor College of Medicine

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ivan D. Bochkov

Baylor College of Medicine

View shared research outputs
Top Co-Authors

Avatar

Miriam Huntley

Baylor College of Medicine

View shared research outputs
Top Co-Authors

Avatar

Olga Dudchenko

Baylor College of Medicine

View shared research outputs
Researchain Logo
Decentralizing Knowledge