Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Roger Pique-Regi is active.

Publication


Featured researches published by Roger Pique-Regi.


Nature | 2012

DNase I sensitivity QTLs are a major determinant of human expression variation

Jacob F. Degner; Athma A. Pai; Roger Pique-Regi; Jean Baptiste Veyrieras; Daniel J. Gaffney; Joseph K. Pickrell; Sherryl De Leon; Katelyn Michelini; Noah Lewellen; Gregory E. Crawford; Matthew Stephens; Yoav Gilad; Jonathan K. Pritchard

The mapping of expression quantitative trait loci (eQTLs) has emerged as an important tool for linking genetic variation to changes in gene regulation. However, it remains difficult to identify the causal variants underlying eQTLs, and little is known about the regulatory mechanisms by which they act. Here we show that genetic variants that modify chromatin accessibility and transcription factor binding are a major mechanism through which genetic variation leads to gene expression differences among humans. We used DNase I sequencing to measure chromatin accessibility in 70 Yoruba lymphoblastoid cell lines, for which genome-wide genotypes and estimates of gene expression levels are also available. We obtained a total of 2.7 billion uniquely mapped DNase I-sequencing (DNase-seq) reads, which allowed us to produce genome-wide maps of chromatin accessibility for each individual. We identified 8,902 locations at which the DNase-seq read depth correlated significantly with genotype at a nearby single nucleotide polymorphism or insertion/deletion (false discovery rate = 10%). We call such variants ‘DNase I sensitivity quantitative trait loci’ (dsQTLs). We found that dsQTLs are strongly enriched within inferred transcription factor binding sites and are frequently associated with allele-specific changes in transcription factor binding. A substantial fraction (16%) of dsQTLs are also associated with variation in the expression levels of nearby genes (that is, these loci are also classified as eQTLs). Conversely, we estimate that as many as 55% of eQTL single nucleotide polymorphisms are also dsQTLs. Our observations indicate that dsQTLs are highly abundant in the human genome and are likely to be important contributors to phenotypic variation.


Genome Research | 2011

Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data

Roger Pique-Regi; Jacob F. Degner; Athma A. Pai; Daniel J. Gaffney; Yoav Gilad; Jonathan K. Pritchard

Accurate functional annotation of regulatory elements is essential for understanding global gene regulation. Here, we report a genome-wide map of 827,000 transcription factor binding sites in human lymphoblastoid cell lines, which is comprised of sites corresponding to 239 position weight matrices of known transcription factor binding motifs, and 49 novel sequence motifs. To generate this map, we developed a probabilistic framework that integrates cell- or tissue-specific experimental data such as histone modifications and DNase I cleavage patterns with genomic information such as gene annotation and evolutionary conservation. Comparison to empirical ChIP-seq data suggests that our method is highly accurate yet has the advantage of targeting many factors in a single assay. We anticipate that this approach will be a valuable tool for genome-wide studies of gene regulation in a wide variety of cell types or tissues under diverse conditions.


Cell | 2012

DNA Sequence-Dependent Compartmentalization and Silencing of Chromatin at the Nuclear Lamina

Joseph Zullo; Ignacio A. Demarco; Roger Pique-Regi; Daniel J. Gaffney; Charles B. Epstein; Chauncey J. Spooner; Teresa R Luperchio; Bradley E. Bernstein; Jonathan K. Pritchard; Harinder Singh

A large fraction of the mammalian genome is organized into inactive chromosomal domains along the nuclear lamina. The mechanism by which these lamina associated domains (LADs) are established remains to be elucidated. Using genomic repositioning assays, we show that LADs, spanning the developmentally regulated IgH and Cyp3a loci contain discrete DNA regions that associate chromatin with the nuclear lamina and repress gene activity in fibroblasts. Lamina interaction is established during mitosis and likely involves the localized recruitment of Lamin B during late anaphase. Fine-scale mapping of LADs reveals numerous lamina-associating sequences (LASs), which are enriched for a GAGA motif. This repeated motif directs lamina association and is bound by the transcriptional repressor cKrox, in a complex with HDAC3 and Lap2β. Knockdown of cKrox or HDAC3 results in dissociation of LASs/LADs from the nuclear lamina. These results reveal a mechanism that couples nuclear compartmentalization of chromatin domains with the control of gene activity.


Genome Biology | 2012

Dissecting the regulatory architecture of gene expression QTLs

Daniel J. Gaffney; Jean-Baptiste Veyrieras; Jacob F. Degner; Roger Pique-Regi; Athma A. Pai; Gregory E. Crawford; Matthew Stephens; Yoav Gilad; Jonathan K. Pritchard

BackgroundExpression quantitative trait loci (eQTLs) are likely to play an important role in the genetics of complex traits; however, their functional basis remains poorly understood. Using the HapMap lymphoblastoid cell lines, we combine 1000 Genomes genotypes and an extensive catalogue of human functional elements to investigate the biological mechanisms that eQTLs perturb.ResultsWe use a Bayesian hierarchical model to estimate the enrichment of eQTLs in a wide variety of regulatory annotations. We find that approximately 40% of eQTLs occur in open chromatin, and that they are particularly enriched in transcription factor binding sites, suggesting that many directly impact protein-DNA interactions. Analysis of core promoter regions shows that eQTLs also frequently disrupt some known core promoter motifs but, surprisingly, are not enriched in other well-known motifs such as the TATA box. We also show that information from regulatory annotations alone, when weighted by the hierarchical model, can provide a meaningful ranking of the SNPs that are most likely to drive gene expression variation.ConclusionsOur study demonstrates how regulatory annotation and the association signal derived from eQTL-mapping can be combined into a single framework. We used this approach to further our understanding of the biology that drives human gene expression variation, and of the putatively causal SNPs that underlie it.


Bioinformatics | 2008

Sparse representation and Bayesian detection of genome copy number alterations from microarray data

Roger Pique-Regi; Jordi Monso-Varona; Antonio Ortega; Robert C. Seeger; Timothy J. Triche; Shahab Asgharzadeh

MOTIVATION Genomic instability in cancer leads to abnormal genome copy number alterations (CNA) that are associated with the development and behavior of tumors. Advances in microarray technology have allowed for greater resolution in detection of DNA copy number changes (amplifications or deletions) across the genome. However, the increase in number of measured signals and accompanying noise from the array probes present a challenge in accurate and fast identification of breakpoints that define CNA. This article proposes a novel detection technique that exploits the use of piece wise constant (PWC) vectors to represent genome copy number and sparse Bayesian learning (SBL) to detect CNA breakpoints. METHODS First, a compact linear algebra representation for the genome copy number is developed from normalized probe intensities. Second, SBL is applied and optimized to infer locations where copy number changes occur. Third, a backward elimination (BE) procedure is used to rank the inferred breakpoints; and a cut-off point can be efficiently adjusted in this procedure to control for the false discovery rate (FDR). RESULTS The performance of our algorithm is evaluated using simulated and real genome datasets and compared to other existing techniques. Our approach achieves the highest accuracy and lowest FDR while improving computational speed by several orders of magnitude. The proposed algorithm has been developed into a free standing software application (GADA, Genome Alteration Detection Algorithm). AVAILABILITY http://biron.usc.edu/~piquereg/GADA


Journal of Clinical Oncology | 2012

Clinical Significance of Tumor-Associated Inflammatory Cells in Metastatic Neuroblastoma

Shahab Asgharzadeh; Jill Salo; Lingyun Ji; André Oberthuer; Matthias Fischer; Frank Berthold; Michael Hadjidaniel; Cathy Wei-Yao Liu; Leonid S. Metelitsa; Roger Pique-Regi; Peter Wakamatsu; Judith G. Villablanca; Susan G. Kreissman; Katherine K. Matthay; Hiroyuki Shimada; Wendy B. London; Richard Sposto; Robert C. Seeger

PURPOSE Children diagnosed at age ≥ 18 months with metastatic MYCN-nonamplified neuroblastoma (NBL-NA) are at high risk for disease relapse, whereas those diagnosed at age < 18 months are nearly always cured. In this study, we investigated the hypothesis that expression of genes related to tumor-associated inflammatory cells correlates with the observed differences in survival by age at diagnosis and contributes to a prognostic signature. METHODS Tumor-associated macrophages (TAMs) in localized and metastatic neuroblastomas (n = 71) were assessed by immunohistochemistry. Expression of 44 genes representing tumor and inflammatory cells was quantified in 133 metastatic NBL-NAs to assess age-dependent expression and to develop a logistic regression model to provide low- and high-risk scores for predicting progression-free survival (PFS). Tumors from high-risk patients enrolled onto two additional studies (n = 91) served as independent validation cohorts. RESULTS Metastatic neuroblastomas had higher infiltration of TAMs than locoregional tumors, and metastatic tumors diagnosed in patients at age ≥ 18 months had higher expression of inflammation-related genes than those in patients diagnosed at age < 18 months. Expression of genes representing TAMs (CD33/CD16/IL6R/IL10/FCGR3) contributed to 25% of the accuracy of a novel 14-gene tumor classification score. PFS at 5 years for children diagnosed at age ≥ 18 months with NBL-NA with a low- versus high-risk score was 47% versus 12%, 57% versus 8%, and 50% versus 20% in three independent clinical trials, respectively. CONCLUSION These data suggest that interactions between tumor and inflammatory cells may contribute to the clinical metastatic neuroblastoma phenotype, improve prognostication, and reveal novel therapeutic targets.


Genome Research | 2015

Bacterial infection remodels the DNA methylation landscape of human dendritic cells

Alain Pacis; Ludovic Tailleux; Alexander M. Morin; John J. Lambourne; Julia L. MacIsaac; Vania Yotova; Anne Dumaine; Anne Danckaert; Francesca Luca; Jean Christophe Grenier; Kasper D. Hansen; Brigitte Gicquel; Miao Yu; Athma A. Pai; Chuan He; Jenny Tung; Tomi Pastinen; Michael S. Kobor; Roger Pique-Regi; Yoav Gilad; Luis B. Barreiro

DNA methylation is an epigenetic mark thought to be robust to environmental perturbations on a short time scale. Here, we challenge that view by demonstrating that the infection of human dendritic cells (DCs) with a live pathogenic bacteria is associated with rapid and active demethylation at thousands of loci, independent of cell division. We performed an integrated analysis of data on genome-wide DNA methylation, histone mark patterns, chromatin accessibility, and gene expression, before and after infection. We found that infection-induced demethylation rarely occurs at promoter regions and instead localizes to distal enhancer elements, including those that regulate the activation of key immune transcription factors. Active demethylation is associated with extensive epigenetic remodeling, including the gain of histone activation marks and increased chromatin accessibility, and is strongly predictive of changes in the expression levels of nearby genes. Collectively, our observations show that active, rapid changes in DNA methylation in enhancers play a previously unappreciated role in regulating the transcriptional response to infection, even in nonproliferating cells.


BMC Bioinformatics | 2010

R-Gada: a fast and flexible pipeline for copy number analysis in association studies

Roger Pique-Regi; Alejandro Cáceres; Juan R. González

BackgroundGenome-wide association studies (GWAS) using Copy Number Variation (CNV) are becoming a central focus of genetic research. CNVs have successfully provided target genome regions for some disease conditions where simple genetic variation (i.e., SNPs) has previously failed to provide a clear association.ResultsHere we present a new R package, that integrates: (i) data import from most common formats of Affymetrix, Illumina and aCGH arrays; (ii) a fast and accurate segmentation algorithm to call CNVs based on Genome Alteration Detection Analysis (GADA); and (iii) functions for displaying and exporting the Copy Number calls, identification of recurrent CNVs, multivariate analysis of population structure, and tools for performing association studies. Using a large dataset containing 270 HapMap individuals (Affymetrix Human SNP Array 6.0 Sample Dataset) we demonstrate a flexible pipeline implemented with the package. It requires less than one minute per sample (3 million probe arrays) on a single core computer, and provides a flexible parallelization for very large datasets. Case-control data were generated from the HapMap dataset to demonstrate a GWAS analysis.ConclusionsThe package provides the tools for creating a complete integrated pipeline from data normalization to statistical association. It can effciently handle a massive volume of data consisting of millions of genetic markers and hundreds or thousands of samples with very accurate results.


Bioinformatics | 2009

Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA

Roger Pique-Regi; Antonio Ortega; Shahab Asgharzadeh

MOTIVATION The complexity of a large number of recently discovered copy number polymorphisms is much higher than initially thought, thus making it more difficult to detect them in the presence of significant measurement noise. In this scenario, separate normalization and segmentation is prone to lead to many false detections of changes in copy number. New approaches capable of jointly modeling the copy number and the non-copy number (noise) hybridization effects across multiple samples will potentially lead to more accurate results. METHODS In this article, the genome alteration detection analysis (GADA) approach introduced in our previous work is extended to a multiple sample model. The copy number component is independent for each sample and uses a sparse Bayesian prior, while the reference hybridization level is not necessarily sparse but identical on all samples. The expectation maximization (EM) algorithm used to fit the model iteratively determines whether the observed hybridization levels are more likely due to a copy number variation or to a shared hybridization bias. RESULTS The new proposed approach is compared with the currently used strategy of separate normalization followed by independent segmentation of each array. Real microarray data obtained from HapMap samples are randomly partitioned to create different reference sets. Using the new approach, copy number and reference intensity estimates are significantly less variable if the reference set changes; and a higher consistency on copy numbers detected within HapMap family trios is obtained. Finally, the running time to fit the model grows linearly in the number samples and probes. AVAILABILITY http://biron.usc.edu/~piquereg/GADA.


BMC Bioinformatics | 2011

A fast and accurate method to detect allelic genomic imbalances underlying mosaic rearrangements using SNP array data

Juan R. González; Benjamín Rodríguez-Santiago; Alejandro Cáceres; Roger Pique-Regi; Nathaniel Rothman; Stephen J. Chanock; Lluís Armengol; Luis A. Pérez-Jurado

BackgroundMosaicism for copy number and copy neutral chromosomal rearrangements has been recently identified as a relatively common source of genetic variation in the normal population. However its prevalence is poorly defined since it has been only studied systematically in one large-scale study and by using non optimal ad-hoc SNP array data analysis tools, uncovering rather large alterations (> 1 Mb) and affecting a high proportion of cells. Here we propose a novel methodology, Mosaic Alteration Detection-MAD, by providing a software tool that is effective for capturing previously described alterations as wells as new variants that are smaller in size and/or affecting a low percentage of cells.ResultsThe developed method identified all previously known mosaic abnormalities reported in SNP array data obtained from controls, bladder cancer and HapMap individuals. In addition MAD tool was able to detect new mosaic variants not reported before that were smaller in size and with lower percentage of cells affected. The performance of the tool was analysed by studying simulated data for different scenarios. Our method showed high sensitivity and specificity for all assessed scenarios.ConclusionsThe tool presented here has the ability to identify mosaic abnormalities with high sensitivity and specificity. Our results confirm the lack of sensitivity of former methods by identifying new mosaic variants not reported in previously utilised datasets. Our work suggests that the prevalence of mosaic alterations could be higher than initially thought. The use of appropriate SNP array data analysis methods would help in defining the human genome mosaic map.

Collaboration


Dive into the Roger Pique-Regi's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Shahab Asgharzadeh

Children's Hospital Los Angeles

View shared research outputs
Top Co-Authors

Avatar

Antonio Ortega

University of Southern California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Athma A. Pai

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge