Ines de Santiago
University of Cambridge
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Ines de Santiago.
Nature Communications | 2013
Michael N. C. Fletcher; Mauro A. A. Castro; Xin Wang; Ines de Santiago; Martin O’Reilly; Suet-Feung Chin; Oscar M. Rueda; Carlos Caldas; Bruce A.J. Ponder; Florian Markowetz; Kerstin B. Meyer
The fibroblast growth factor receptor 2 (FGFR2) locus has been consistently identified as a breast cancer risk locus in independent genome-wide association studies. However, the molecular mechanisms underlying FGFR2-mediated risk are still unknown. Using model systems we show that FGFR2-regulated genes are preferentially linked to breast cancer risk loci in expression quantitative trait loci analysis, supporting the concept that risk genes cluster in pathways. Using a network derived from 2,000 transcriptional profiles we identify SPDEF, ERα, FOXA1, GATA3 and PTTG1 as master regulators of fibroblast growth factor receptor 2 signalling, and show that ERα occupancy responds to fibroblast growth factor receptor 2 signalling. Our results indicate that ERα, FOXA1 and GATA3 contribute to the regulation of breast cancer susceptibility genes, which is consistent with the effects of anti-oestrogen treatment in breast cancer prevention, and suggest that fibroblast growth factor receptor 2 signalling has an important role in mediating breast cancer risk.
Frontiers in Genetics | 2014
Thomas Carroll; Ziwei Liang; Rafik Salama; Rory Stark; Ines de Santiago
With the advent of ChIP-seq multiplexing technologies and the subsequent increase in ChIP-seq throughput, the development of working standards for the quality assessment of ChIP-seq studies has received significant attention. The ENCODE consortiums large scale analysis of transcription factor binding and epigenetic marks as well as concordant work on ChIP-seq by other laboratories has established a new generation of ChIP-seq quality control measures. The use of these metrics alongside common processing steps has however not been evaluated. In this study, we investigate the effects of blacklisting and removal of duplicated reads on established metrics of ChIP-seq quality and show that the interpretation of these metrics is highly dependent on the ChIP-seq preprocessing steps applied. Further to this we perform the first investigation of the use of these metrics for ChIP-exo data and make recommendations for the adaptation of the NSC statistic to allow for the assessment of ChIP-exo efficiency.
Genome Biology | 2014
Filipe Correia Martins; Ines de Santiago; Anne Trinh; Jian Xian; Anne Guo; Karen Sayal; Mercedes Jimenez-Linan; Suha Deen; Kristy Driver; Marie Mack; Jennifer Aslop; Paul Pharoah; Florian Markowetz; James D. Brenton
BackgroundTP53 and BRCA1/2 mutations are the main drivers in high-grade serous ovarian carcinoma (HGSOC). We hypothesise that combining tissue phenotypes from image analysis of tumour sections with genomic profiles could reveal other significant driver events.ResultsAutomatic estimates of stromal content combined with genomic analysis of TCGA HGSOC tumours show that stroma strongly biases estimates of PTEN expression. Tumour-specific PTEN expression was tested in two independent cohorts using tissue microarrays containing 521 cases of HGSOC. PTEN loss or downregulation occurred in 77% of the first cohort by immunofluorescence and 52% of the validation group by immunohistochemistry, and is associated with worse survival in a multivariate Cox-regression model adjusted for study site, age, stage and grade. Reanalysis of TCGA data shows that hemizygous loss of PTEN is common (36%) and expression of PTEN and expression of androgen receptor are positively associated. Low androgen receptor expression was associated with reduced survival in data from TCGA and immunohistochemical analysis of the first cohort.ConclusionPTEN loss is a common event in HGSOC and defines a subgroup with significantly worse prognosis, suggesting the rational use of drugs to target PI3K and androgen receptor pathways for HGSOC. This work shows that integrative approaches combining tissue phenotypes from images with genomic analysis can resolve confounding effects of tissue heterogeneity and should be used to identify new drivers in other cancers.
Nature Genetics | 2016
Mauro A. A. Castro; Ines de Santiago; Tom Campbell; Courtney Vaughn; Theresa E. Hickey; Edith M. Ross; Wayne D. Tilley; Florian Markowetz; Bruce A.J. Ponder; Kerstin B. Meyer
Genetic risk for breast cancer is conferred by a combination of multiple variants of small effect. To better understand how risk loci might combine, we examined whether risk-associated genes share regulatory mechanisms. We created a breast cancer gene regulatory network comprising transcription factors and groups of putative target genes (regulons) and asked whether specific regulons are enriched for genes associated with risk loci via expression quantitative trait loci (eQTLs). We identified 36 overlapping regulons that were enriched for risk loci and formed a distinct cluster within the network, suggesting shared biology. The risk transcription factors driving these regulons are frequently mutated in cancer and lie in two opposing subgroups, which relate to estrogen receptor (ER)+ luminal A or luminal B and ER− basal-like cancers and to different luminal epithelial cell populations in the adult mammary gland. Our network approach provides a foundation for determining the regulatory circuits governing breast cancer, to identify targets for intervention, and is transferable to other disease settings.
Cancer Research | 2016
Mika Hilvo; Ines de Santiago; Peddinti Gopalacharyulu; Wolfgang D. Schmitt; Jan Budczies; Marc Kuhberg; Manfred Dietel; Tero Aittokallio; Florian Markowetz; Carsten Denkert; Jalid Sehouli; Christian Frezza; Silvia Darb-Esfahani; Elena Ioana Braicu
Ovarian cancer is a heterogeneous disease of low prevalence, but poor survival. Early diagnosis is critical for survival, but it is often challenging because the symptoms of ovarian cancer are subtle and become apparent only during advanced stages of the disease. Therefore, the identification of robust biomarkers of early disease is a clinical priority. Metabolomic profiling is an emerging diagnostic tool enabling the detection of biomarkers reflecting alterations in tumor metabolism, a hallmark of cancer. In this study, we performed metabolomic profiling of serum and tumor tissue from 158 patients with high-grade serous ovarian cancer (HGSOC) and 100 control patients with benign or non-neoplastic lesions. We report metabolites of hydroxybutyric acid (HBA) as novel diagnostic and prognostic biomarkers associated with tumor burden and patient survival. The accumulation of HBA metabolites caused by HGSOC was also associated with reduced expression of succinic semialdehyde dehydrogenase (encoded by ALDH5A1), and with the presence of an epithelial-to-mesenchymal transition gene signature, implying a role for these metabolic alterations in cancer cell migration and invasion. In conclusion, our findings represent the first comprehensive metabolomics analysis in HGSOC and propose a new set of metabolites as biomarkers of disease with diagnostic and prognostic capabilities.
Carcinogenesis | 2016
Tom Campbell; Mauro A. A. Castro; Ines de Santiago; Michael N. C. Fletcher; Silvia Halim; Radhika Prathalingam; Bruce A.J. Ponder; Kerstin B. Meyer
Summary The fibroblast growth factor receptor 2 (FGFR2) locus is the ‘top hit’ in genome-wide association studies for breast cancer. Here, we examine the effect of FGFR2 signalling on transcriptional networks in breast cancer and propose a mechanism for FGFR2 risk single-nucleotide polymorphism function.
PLOS Medicine | 2017
Shivan Sivakumar; Ines de Santiago; Leon Chlon; Florian Markowetz
Background KRAS is the most frequently mutated gene in pancreatic ductal adenocarcinoma (PDAC), but the mechanisms underlying the transcriptional response to oncogenic KRAS are still not fully understood. We aimed to uncover transcription factors that regulate the transcriptional response of oncogenic KRAS in pancreatic cancer and to understand their clinical relevance. Methods and Findings We applied a well-established network biology approach (master regulator analysis) to combine a transcriptional signature for oncogenic KRAS derived from a murine isogenic cell line with a coexpression network derived by integrating 560 human pancreatic cancer cases across seven studies. The datasets included the ICGC cohort (n = 242), the TCGA cohort (n = 178), and five smaller studies (n = 17, 25, 26, 36, and 36). 55 transcription factors were coexpressed with a significant number of genes in the transcriptional signature (gene set enrichment analysis [GSEA] p < 0.01). Community detection in the coexpression network identified 27 of the 55 transcription factors contributing to three major biological processes: Notch pathway, down-regulated Hedgehog/Wnt pathway, and cell cycle. The activities of these processes define three distinct subtypes of PDAC, which demonstrate differences in survival and mutational load as well as stromal and immune cell composition. The Hedgehog subgroup showed worst survival (hazard ratio 1.73, 95% CI 1.1 to 2.72, coxPH test p = 0.018) and the Notch subgroup the best (hazard ratio 0.62, 95% CI 0.42 to 0.93, coxPH test p = 0.019). The cell cycle subtype showed highest mutational burden (ANOVA p < 0.01) and the smallest amount of stromal admixture (ANOVA p < 2.2e–16). This study is limited by the information provided in published datasets, not all of which provide mutational profiles, survival data, or the specifics of treatment history. Conclusions Our results characterize the regulatory mechanisms underlying the transcriptional response to oncogenic KRAS and provide a framework to develop strategies for specific subtypes of this disease using current therapeutics and by identifying targets for new groups.
Genome Biology | 2017
Ines de Santiago; Wei Liu; Ke Yuan; Martin O’Reilly; Chandra Sekhar Reddy Chilamakuri; Bruce A.J. Ponder; Kerstin B. Meyer; Florian Markowetz
Allele-specific measurements of transcription factor binding from ChIP-seq data are key to dissecting the allelic effects of non-coding variants and their contribution to phenotypic diversity. However, most methods of detecting an allelic imbalance assume diploid genomes. This assumption severely limits their applicability to cancer samples with frequent DNA copy-number changes. Here we present a Bayesian statistical approach called BaalChIP to correct for the effect of background allele frequency on the observed ChIP-seq read counts. BaalChIP allows the joint analysis of multiple ChIP-seq samples across a single variant and outperforms competing approaches in simulations. Using 548 ENCODE ChIP-seq and six targeted FAIRE-seq samples, we show that BaalChIP effectively corrects allele-specific analysis for copy-number variation and increases the power to detect putative cis-acting regulatory variants in cancer genomes.
bioRxiv | 2017
Ines de Santiago; Christopher Yau; Mark R. Middleton; Michael L. Dustin; Florian Markowetz; Shivan Sivakumar
Pancreatic ductal adenocarcinoma (PDAC) is the most common malignancy of the pancreas and has one of the highest mortality rates of any cancer type with a 5-year survival rate of < 5% and median overall survival of typically six months from diagnosis. Recent transcriptional studies of PDAC have provided several competing stratifications of the disease. However, the development of therapeutic strategies will depend on a unique and coherent classification of PDAC. Here, we use an integrative meta-analysis of four different PDAC gene expression studies to derive the consensus PDAC classification. Despite the fact that immunotherapies have yet to have an impact in treatment of PDAC, the gene expression signatures that stratify PDAC across studies are immunologic. We define these as “adaptive”, “innate” and “immune-exclusion” immunologic signatures, which are prognostic across independent cohorts. An appreciation of the immune composition of PDAC with prognostic significance is an opportunity to understand distinct immune escape mechanisms in development of the disease and design novel immune-oncology therapeutic strategies to overcome current barriers.
Archive | 2018
Ines de Santiago; Thomas Carroll
The development of novel high-throughput sequencing methods for ChIP (chromatin immunoprecipitation) has provided a very powerful tool to study gene regulation in multiple conditions at unprecedented resolution and scale. Proactive quality-control and appropriate data analysis techniques are of critical importance to extract the most meaningful results from the data. Over the last years, an array of R/Bioconductor tools has been developed allowing researchers to process and analyze ChIP-seq data. This chapter provides an overview of the methods available to analyze ChIP-seq data based primarily on software packages from the open-source Bioconductor project. Protocols described in this chapter cover basic steps including data alignment, peak calling, quality control and data visualization, as well as more complex methods such as the identification of differentially bound regions and functional analyses to annotate regulatory regions. The steps in the data analysis process were demonstrated on publicly available data sets and will serve as a demonstration of the computational procedures routinely used for the analysis of ChIP-seq data in R/Bioconductor, from which readers can construct their own analysis pipelines.