Irene Gallego Romero | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Irene Gallego Romero is active.

Explore More

Publication

Featured researches published by Irene Gallego Romero.

Science | 2012

A Systematic Survey of Loss-of-Function Variants in Human Protein-Coding Genes

Daniel G. MacArthur; Suganthi Balasubramanian; Adam Frankish; Ni Huang; James A. Morris; Klaudia Walter; Luke Jostins; Lukas Habegger; Joseph K. Pickrell; Stephen B. Montgomery; Cornelis A. Albers; Zhengdong D. Zhang; Donald F. Conrad; Gerton Lunter; Hancheng Zheng; Qasim Ayub; Mark A. DePristo; Eric Banks; Min Hu; Robert E. Handsaker; Jeffrey A. Rosenfeld; Menachem Fromer; Mike Jin; Xinmeng Jasmine Mu; Ekta Khurana; Kai Ye; Mike Kay; Gary Saunders; Marie-Marthe Suner; Toby Hunt

Defective Gene Detective Identifying genes that give rise to diseases is one of the major goals of sequencing human genomes. However, putative loss-of-function genes, which are often some of the first identified targets of genome and exome sequencing, have often turned out to be sequencing errors rather than true genetic variants. In order to identify the true scope of loss-of-function genes within the human genome, MacArthur et al. (p. 823; see the Perspective by Quintana-Murci) extensively validated the genomes from the 1000 Genomes Project, as well as an additional European individual, and found that the average person has about 100 true loss-of-function alleles of which approximately 20 have two copies within an individual. Because many known disease-causing genes were identified in “normal” individuals, the process of clinical sequencing needs to reassess how to identify likely causative alleles. Validation of predicted nonfunctional alleles in the human genome affects the medical interpretation of genomic analyses. Genome-sequencing studies indicate that all humans carry many genetic variants predicted to cause loss of function (LoF) of protein-coding genes, suggesting unexpected redundancy in the human genome. Here we apply stringent filters to 2951 putative LoF variants obtained from 185 human genomes to determine their true prevalence and properties. We estimate that human genomes typically contain ~100 genuine LoF variants with ~20 genes completely inactivated. We identify rare and likely deleterious LoF alleles, including 26 known and 21 predicted severe disease–causing variants, as well as common LoF variants in nonessential genes. We describe functional and evolutionary differences between LoF-tolerant and recessive disease genes and a method for using these differences to prioritize candidate genes found in clinical sequencing studies.

Science | 2011

An Aboriginal Australian Genome Reveals Separate Human Dispersals into Asia

Morten Rasmussen; Xiaosen Guo; Yong Wang; Kirk E. Lohmueller; Simon Rasmussen; Anders Albrechtsen; Line Skotte; Stinus Lindgreen; Mait Metspalu; Thibaut Jombart; Toomas Kivisild; Weiwei Zhai; Anders Eriksson; Andrea Manica; Ludovic Orlando; Francisco M. De La Vega; Silvana R. Tridico; Ene Metspalu; Kasper Nielsen; María C. Ávila-Arcos; J. Víctor Moreno-Mayar; Craig Muller; Joe Dortch; M. Thomas P. Gilbert; Ole Lund; Agata Wesolowska; Monika Karmin; Lucy A. Weinert; Bo Wang; Jun Li

Whole-genome data indicate that early modern humans expanded into Australia 62,000 to 75,000 years ago. We present an Aboriginal Australian genomic sequence obtained from a 100-year-old lock of hair donated by an Aboriginal man from southern Western Australia in the early 20th century. We detect no evidence of European admixture and estimate contamination levels to be below 0.5%. We show that Aboriginal Australians are descendants of an early human dispersal into eastern Asia, possibly 62,000 to 75,000 years ago. This dispersal is separate from the one that gave rise to modern Asians 25,000 to 38,000 years ago. We also find evidence of gene flow between populations of the two dispersal waves prior to the divergence of Native Americans from modern Asian ancestors. Our findings support the hypothesis that present-day Aboriginal Australians descend from the earliest humans to occupy Australia, likely representing one of the oldest continuous populations outside Africa.

Nature | 2016

The Simons Genome Diversity Project: 300 genomes from 142 diverse populations

Swapan Mallick; Heng Li; Mark Lipson; Iain Mathieson; Melissa Gymrek; Fernando Racimo; Mengyao Zhao; Niru Chennagiri; Arti Tandon; Pontus Skoglund; Iosif Lazaridis; Sriram Sankararaman; Qiaomei Fu; Nadin Rohland; Gabriel Renaud; Yaniv Erlich; Thomas Willems; Carla Gallo; Jeffrey P. Spence; Yun S. Song; Giovanni Poletti; Francois Balloux; George van Driem; Peter de Knijff; Irene Gallego Romero; Aashish R. Jha; Doron M. Behar; Claudio M. Bravi; Cristian Capelli; Tor Hervig

Here we report the Simons Genome Diversity Project data set: high quality genomes from 300 individuals from 142 diverse populations. These genomes include at least 5.8 million base pairs that are not present in the human reference genome. Our analysis reveals key features of the landscape of human genome variation, including that the rate of accumulation of mutations has accelerated by about 5% in non-Africans compared to Africans since divergence. We show that the ancestors of some pairs of present-day human populations were substantially separated by 100,000 years ago, well before the archaeologically attested onset of behavioural modernity. We also demonstrate that indigenous Australians, New Guineans and Andamanese do not derive substantial ancestry from an early dispersal of modern humans; instead, their modern human ancestry is consistent with coming from the same source as that of other non-Africans.

Nature Reviews Genetics | 2012

Comparative studies of gene expression and the evolution of gene regulation

Irene Gallego Romero; Ilya Ruvinsky; Yoav Gilad

The hypothesis that differences in gene regulation have an important role in speciation and adaptation is more than 40 years old. With the advent of new sequencing technologies, we are able to characterize and study gene expression levels and associated regulatory mechanisms in a large number of individuals and species at an unprecedented resolution and scale. We have thus gained new insights into the evolutionary pressures that shape gene expression levels and have developed an appreciation for the relative importance of evolutionary changes in different regulatory genetic and epigenetic mechanisms. The current challenge is to link gene regulatory changes to adaptive evolution of complex phenotypes. Here we mainly focus on comparative studies in primates and how they are complemented by studies in model organisms.

Science | 2015

Global diversity, population stratification, and selection of human copy-number variation.

Peter H. Sudmant; Swapan Mallick; Bradley J. Nelson; Fereydoun Hormozdiari; Niklas Krumm; John Huddleston; Bradley P. Coe; Carl Baker; Michael J. Bamshad; Lynn B. Jorde; Olga L. Posukh; Hovhannes Sahakyan; W. Scott Watkins; Levon Yepiskoposyan; M. Syafiq Abdullah; Claudio M. Bravi; Cristian Capelli; Tor Hervig; Joseph Wee; Chris Tyler-Smith; George van Driem; Irene Gallego Romero; Aashish R. Jha; Sena Karachanak-Yankova; Draga Toncheva; David Comas; Brenna M. Henn; Toomas Kivisild; Andres Ruiz-Linares; Antti Sajantila

Duplications and deletions in the human genome Duplications and deletions can lead to variation in copy number for genes and genomic loci among humans. Such variants can reveal evolutionary patterns and have implications for human health. Sudmant et al. examined copy-number variation across 236 individual genomes from 125 human populations. Deletions were under more selection, whereas duplications showed more population-specific structure. Interestingly, Oceanic populations retain large duplications postulated to have originated in an ancient Denisovan lineage. Science, this issue 10.1126/science.aab3761 Copy-number variation reveals how selection affects the human genome across the globe. INTRODUCTION Most studies of human genetic variation have focused on single-nucleotide variants (SNVs). However, copy-number variants (CNVs) affect more base pairs of DNA among humans, and yet our understanding of CNV diversity among human populations is limited. RATIONALE We aimed to understand the pattern, selection, and diversity of copy-number variation by analyzing deeply sequenced genomes representing the diversity of all humans. We compared the selective constraints of deletions versus duplications to understand population stratification in the context of the ancestral human genome and to assess differences in CNV load between African and non-African populations. RESULTS We sequenced 236 individual genomes from 125 distinct human populations and identified 14,467 autosomal CNVs and 545 X-linked CNVs with a sequence read-depth approach. Deletions exhibit stronger selective pressure and are better phylogenetic markers of population relationships than duplication polymorphisms. We identified 1036 population-stratified copy-number–variable regions, 295 of which intersect coding regions and 199 of which exhibit extreme signatures of differentiation. Duplicated loci were 1.8-fold more likely to be stratified than deletions but were poorly correlated with flanking genetic diversity. Among these, we highlight a duplication polymorphism restricted to modern Oceanic populations yet also present in the genome of the archaic Denisova hominin. This 225–kilo–base pair (kbp) duplication includes two microRNA genes and is almost fixed among human Papuan-Bougainville genomes. The data allowed us to reconstruct the ancestral human genome and create a more accurate evolutionary framework for the gain and loss of sequences during human evolution. We identified 571 loci that segregate in the human population and another 2026 loci of fixed-copy 2 in all human genomes but absent from the reference genome. The total deletion and duplication load between African and non-African population groups showed no difference after we account for ancestral sequences missing from the human reference. However, we did observe that the relative number of base pairs affected by CNVs compared to single-nucleotide polymorphisms is higher among non-Africans than Africans. CONCLUSION Deletions, duplications, and CNVs have shaped, to different extents, the genetic diversity of human populations by the combined forces of mutation, selection, and demography. Figure Global human CNV diversity and archaic introgression of a chromosome 16 duplication. (Left) The geographic coordinates of populations sampled are indicated on a world map (colored dots). The pie charts show the continental population allele frequency of a single ~225-kbp duplication polymorphism found exclusively among Oceanic populations and an archaic Denisova. (Right) The ancestral structure of this duplication locus (1) and the Denisova duplication structure (2) are shown in relation to their position on chromosome 16. We estimate that the duplication emerged ~440 thousand years ago (ka) in the Denisova and then introgressed into ancestral Papuan populations ~40 ka. In order to explore the diversity and selective signatures of duplication and deletion human copy-number variants (CNVs), we sequenced 236 individuals from 125 distinct human populations. We observed that duplications exhibit fundamentally different population genetic and selective signatures than deletions and are more likely to be stratified between human populations. Through reconstruction of the ancestral human genome, we identify megabases of DNA lost in different human lineages and pinpoint large duplications that introgressed from the extinct Denisova lineage now found at high frequency exclusively in Oceanic populations. We find that the proportion of CNV base pairs to single-nucleotide–variant base pairs is greater among non-Africans than it is among African populations, but we conclude that this difference is likely due to unique aspects of non-African population history as opposed to differences in CNV load.

BMC Biology | 2014

RNA-seq: impact of RNA degradation on transcript quantification

Irene Gallego Romero; Athma A. Pai; Jenny Tung; Yoav Gilad

BackgroundThe use of low quality RNA samples in whole-genome gene expression profiling remains controversial. It is unclear if transcript degradation in low quality RNA samples occurs uniformly, in which case the effects of degradation can be corrected via data normalization, or whether different transcripts are degraded at different rates, potentially biasing measurements of expression levels. This concern has rendered the use of low quality RNA samples in whole-genome expression profiling problematic. Yet, low quality samples (for example, samples collected in the course of fieldwork) are at times the sole means of addressing specific questions.ResultsWe sought to quantify the impact of variation in RNA quality on estimates of gene expression levels based on RNA-seq data. To do so, we collected expression data from tissue samples that were allowed to decay for varying amounts of time prior to RNA extraction. The RNA samples we collected spanned the entire range of RNA Integrity Number (RIN) values (a metric commonly used to assess RNA quality). We observed widespread effects of RNA quality on measurements of gene expression levels, as well as a slight but significant loss of library complexity in more degraded samples.ConclusionsWhile standard normalizations failed to account for the effects of degradation, we found that by explicitly controlling for the effects of RIN using a linear model framework we can correct for the majority of these effects. We conclude that in instances in which RIN and the effect of interest are not associated, this approach can help recover biologically meaningful signals in data from degraded RNA samples.

American Journal of Human Genetics | 2011

Shared and Unique Components of Human Population Structure and Genome-Wide Signals of Positive Selection in South Asia

Mait Metspalu; Irene Gallego Romero; Bayazit Yunusbayev; Gyaneshwer Chaubey; Chandana Basu Mallick; Georgi Hudjashov; Mari Nelis; Reedik Mägi; Ene Metspalu; Maido Remm; Ramasamy Pitchappan; Lalji Singh; Kumarasamy Thangaraj; Richard Villems; Toomas Kivisild

South Asia harbors one of the highest levels genetic diversity in Eurasia, which could be interpreted as a result of its long-term large effective population size and of admixture during its complex demographic history. In contrast to Pakistani populations, populations of Indian origin have been underrepresented in previous genomic scans of positive selection and population structure. Here we report data for more than 600,000 SNP markers genotyped in 142 samples from 30 ethnic groups in India. Combining our results with other available genome-wide data, we show that Indian populations are characterized by two major ancestry components, one of which is spread at comparable frequency and haplotype diversity in populations of South and West Asia and the Caucasus. The second component is more restricted to South Asia and accounts for more than 50% of the ancestry in Indian populations. Haplotype diversity associated with these South Asian ancestry components is significantly higher than that of the components dominating the West Eurasian ancestry palette. Modeling of the observed haplotype diversities suggests that both Indian ancestry components are older than the purported Indo-Aryan invasion 3,500 YBP. Consistent with the results of pairwise genetic distances among world regions, Indians share more ancestry signals with West than with East Eurasians. However, compared to Pakistani populations, a higher proportion of their genes show regionally specific signals of high haplotype homozygosity. Among such candidates of positive selection in India are MSTN and DOK5, both of which have potential implications in lipid metabolism and the etiology of type 2 diabetes.

Molecular Biology and Evolution | 2012

Herders of Indian and European Cattle Share Their Predominant Allele for Lactase Persistence

Irene Gallego Romero; Chandana Basu Mallick; Anke Liebert; Federica Crivellaro; Gyaneshwer Chaubey; Yuval Itan; Mait Metspalu; Muthukrishnan Eaaswarkhanth; Ramasamy Pitchappan; Richard Villems; David Reich; Lalji Singh; Kumarasamy Thangaraj; Mark G. Thomas; Dallas M. Swallow; Marta Mirazón Lahr; Toomas Kivisild

Milk consumption and lactose digestion after weaning are exclusively human traits made possible by the continued production of the enzyme lactase in adulthood. Multiple independent mutations in a 100-bp region--part of an enhancer--approximately 14-kb upstream of the LCT gene are associated with this trait in Europeans and pastoralists from Saudi Arabia and Africa. However, a single mutation of purported western Eurasian origin accounts for much of observed lactase persistence outside Africa. Given the high levels of present-day milk consumption in India, together with archaeological and genetic evidence for the independent domestication of cattle in the Indus valley roughly 7,000 years ago, we sought to determine whether lactase persistence has evolved independently in the subcontinent. Here, we present the results of the first comprehensive survey of the LCT enhancer region in south Asia. Having genotyped 2,284 DNA samples from across the Indian subcontinent, we find that the previously described west Eurasian -13910 C>T mutation accounts for nearly all the genetic variation we observed in the 400- to 700-bp LCT regulatory region that we sequenced. Geography is a significant predictor of -13910*T allele frequency, and consistent with other genomic loci, its distribution in India follows a general northwest to southeast declining pattern, although frequencies among certain neighboring populations vary substantially. We confirm that the mutation is identical by descent to the European allele and is associated with the same>1 Mb extended haplotype in both populations.

eLife | 2015

A panel of induced pluripotent stem cells from chimpanzees: a resource for comparative functional genomics

Irene Gallego Romero; Bryan J Pavlovic; Irene Hernando-Herraez; Xiang Zhou; Michelle C. Ward; Nicholas E. Banovich; Courtney L Kagan; Jonathan E. Burnett; Constance H Huang; Amy Mitrano; Claudia Chavarria; Inbar Friedrich Ben-Nun; Yingchun Li; Karen Sabatini; Trevor R. Leonardo; Mana M. Parast; Tomas Marques-Bonet; Louise C. Laurent; Jeanne F. Loring; Yoav Gilad

Comparative genomics studies in primates are restricted due to our limited access to samples. In order to gain better insight into the genetic processes that underlie variation in complex phenotypes in primates, we must have access to faithful model systems for a wide range of cell types. To facilitate this, we generated a panel of 7 fully characterized chimpanzee induced pluripotent stem cell (iPSC) lines derived from healthy donors. To demonstrate the utility of comparative iPSC panels, we collected RNA-sequencing and DNA methylation data from the chimpanzee iPSCs and the corresponding fibroblast lines, as well as from 7 human iPSCs and their source lines, which encompass multiple populations and cell types. We observe much less within-species variation in iPSCs than in somatic cells, indicating the reprogramming process erases many inter-individual differences. The low within-species regulatory variation in iPSCs allowed us to identify many novel inter-species regulatory differences of small magnitude. DOI: http://dx.doi.org/10.7554/eLife.07103.001

PLOS ONE | 2015

Increased Risk of Genetic and Epigenetic Instability in Human Embryonic Stem Cells Associated with Specific Culture Conditions

Ibon Garitaonandia; Hadar Amir; Francesca S. Boscolo; Gerald Wambua; Heather L. Schultheisz; Karen Sabatini; Robert Morey; Shannon Waltz; Yu-Chieh Wang; Ha Tran; Trevor R. Leonardo; Kristopher L. Nazor; Ileana Slavin; Candace L. Lynch; Yingchun Li; Ronald Coleman; Irene Gallego Romero; Gulsah Altun; David Reynolds; Stephen Dalton; Mana M. Parast; Jeanne F. Loring; Louise C. Laurent

The self-renewal and differentiation capacities of human pluripotent stem cells (hPSCs) make them a promising source of material for cell transplantation therapy, drug development, and studies of cellular differentiation and development. However, the large numbers of cells necessary for many of these applications require extensive expansion of hPSC cultures, a process that has been associated with genetic and epigenetic alterations. We have performed a combinatorial study on both hESCs and hiPSCs to compare the effects of enzymatic vs. mechanical passaging, and feeder-free vs. mouse embryonic fibroblast feeder substrate, on the genetic and epigenetic stability and the phenotypic characteristics of hPSCs. In extensive experiments involving over 100 continuous passages, we observed that both enzymatic passaging and feeder-free culture were associated with genetic instability, higher rates of cell proliferation, and persistence of OCT4/POU5F1-positive cells in teratomas, with enzymatic passaging having the stronger effect. In all combinations of culture conditions except for mechanical passaging on feeder layers, we noted recurrent deletions in the genomic region containing the tumor suppressor gene TP53, which was associated with decreased mRNA expression of TP53, as well as alterations in the expression of several downstream genes consistent with a decrease in the activity of the TP53 pathway. Among the hESC cultures, we also observed culture-associated variations in global gene expression and DNA methylation. The effects of enzymatic passaging and feeder-free conditions were also observed in hiPSC cultures. Our results highlight the need for careful assessment of the effects of culture conditions on cells intended for clinical therapies.

Explore More