Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Jorge Amigo is active.

Publication


Featured researches published by Jorge Amigo.


Nucleic Acids Research | 2010

Genome-wide identification of hypoxia-inducible factor binding sites and target genes by a probabilistic model integrating transcription-profiling data and in silico binding site prediction

Amaya Ortiz-Barahona; Diego Villar; Nuria Pescador; Jorge Amigo; Luis del Peso

The transcriptional response driven by Hypoxia-inducible factor (HIF) is central to the adaptation to oxygen restriction. Hence, the complete identification of HIF targets is essential for understanding the cellular responses to hypoxia. Herein we describe a computational strategy based on the combination of phylogenetic footprinting and transcription profiling meta-analysis for the identification of HIF-target genes. Comparison of the resulting candidates with published HIF1a genome-wide chromatin immunoprecipitation indicates a high sensitivity (78%) and specificity (97.8%). To validate our strategy, we performed HIF1a chromatin immunoprecipitation on a set of putative targets. Our results confirm the robustness of the computational strategy in predicting HIF-binding sites and reveal several novel HIF targets, including RE1-silencing transcription factor co-repressor (RCOR2). In addition, mapping of described polymorphisms to the predicted HIF-binding sites identified several single-nucleotide polymorphisms (SNPs) that could alter HIF binding. As a proof of principle, we demonstrate that SNP rs17004038, mapping to a functional hypoxia response element in the macrophage migration inhibitory factor (MIF) locus, prevents induction of this gene by hypoxia. Altogether, our results show that the proposed strategy is a powerful tool for the identification of HIF direct targets that expands our knowledge of the cellular adaptation to hypoxia and provides cues on the inter-individual variation in this response.


BMC Bioinformatics | 2008

SPSmart: adapting population based SNP genotype databases for fast and comprehensive web access

Jorge Amigo; Antonio Salas; C. Phillips; Angel Carracedo

BackgroundIn the last five years large online resources of human variability have appeared, notably HapMap, Perlegen and the CEPH foundation. These databases of genotypes with population information act as catalogues of human diversity, and are widely used as reference sources for population genetics studies. Although many useful conclusions may be extracted by querying databases individually, the lack of flexibility for combining data from within and between each database does not allow the calculation of key population variability statistics.ResultsWe have developed a novel tool for accessing and combining large-scale genomic databases of single nucleotide polymorphisms (SNPs) in widespread use in human population genetics: SPSmart (SNPs for Population Studies). A fast pipeline creates and maintains a data mart from the most commonly accessed databases of genotypes containing population information: data is mined, summarized into the standard statistical reference indices, and stored into a relational database that currently handles as many as 4 × 109 genotypes and that can be easily extended to new database initiatives. We have also built a web interface to the data mart that allows the browsing of underlying data indexed by population and the combining of populations, allowing intuitive and straightforward comparison of population groups. All the information served is optimized for web display, and most of the computations are already pre-processed in the data mart to speed up the data browsing and any computational treatment requested.ConclusionIn practice, SPSmart allows populations to be combined into user-defined groups, while multiple databases can be accessed and compared in a few simple steps from a single query. It performs the queries rapidly and gives straightforward graphical summaries of SNP population variability through visual inspection of allele frequencies outlined in standard pie-chart format. In addition, full numerical description of the data is output in statistical results panels that include common population genetics metrics such as heterozygosity, Fst and In.


Forensic Science International-genetics | 2011

Analysis of global variability in 15 established and 5 new European Standard Set (ESS) STRs using the CEPH human genome diversity panel

C. Phillips; L. Fernandez-Formoso; Manuel García-Magariños; L. Porras; Torben Tvedebrink; Jorge Amigo; M. Fondevila; Antonio Gómez-Tato; José Antonio Álvarez-Dios; Ana Freire-Aradas; Alberto Gómez-Carballa; Ana Mosquera-Miguel; Angel Carracedo; M.V. Lareu

The CEPH human genome diversity cell line panel (CEPH-HGDP) of 51 globally distributed populations was used to analyze patterns of variability in 20 core human identification STRs. The markers typed comprised the 15 STRs of Identifiler, one of the most widely used forensic STR multiplexes, plus five recently introduced European Standard Set (ESS) STRs: D1S1656, D2S441, D10S1248, D12S391 and D22S1045. From the genotypes obtained for the ESS STRs we identified rare, intermediate or off-ladder alleles that had not been previously reported for these loci. Examples of novel ESS STR alleles found were characterized by sequence analysis. This revealed extensive repeat structure variation in three ESS STRs, with D12S391 showing particularly high variability for tandem runs of AGAT and AGAC repeat units. The global geographic distribution of the CEPH panel samples gave an opportunity to study in detail the extent of substructure shown by the 20 STRs amongst populations and between their parent population groups. An assessment was made of the forensic informativeness of the new ESS STRs compared to the loci they will replace: CSF1PO, D5S818, D7S820, D13S317 and TPOX, with results showing a clear enhancement of discrimination power using multiplexes that genotype the new ESS loci. We also measured the ability of Identifiler and ESS STRs to infer the ancestry of the CEPH-HGDP samples and demonstrate that forensic STRs in large multiplexes have the potential to differentiate the major population groups but only with sufficient reliability when used with other ancestry-informative markers such as single nucleotide polymorphisms. Finally we checked for possible association by linkage between the two ESS multiplex STRs closely positioned on chromosome-12: vWA and D12S391 by examining paired genotypes from the complete CEPH data set.


BMC Bioinformatics | 2009

Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes

Jorge Amigo; C. Phillips; Antonio Salas; Angel Carracedo

BackgroundDatabases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies.ResultsTo address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genotyping data retrieved from different repositories, but also the calculation of statistical indices from simple allele frequency estimates to more elaborate genetic differentiation tests within populations, together with the ability to combine population samples from different databases.ConclusionThe present study demonstrates the viability of implementing scripts for handling extensive datasets of SNP genotypes with low computational costs, dealing with certain complex issues that arise from the divergent nature and configuration of the most popular SNP repositories. The information contained in these databases can also be enriched with additional information obtained from other complementary databases, in order to build a dedicated data mart. Updating the data structure is straightforward, as well as permitting easy implementation of new external data and the computation of supplementary statistical indices of interest.


International Journal of Legal Medicine | 2008

The SNPforID browser: an online tool for query and display of frequency data from the SNPforID project

Jorge Amigo; C. Phillips; M.V. Lareu; Angel Carracedo

The SNPforID browser is a web-based tool for the query and visualization of the SNP allele frequency data generated by the SNPforID consortium (http://www.snpforid.org/). From this project, validated panels of single nucleotide polymorphisms (SNPs) for a variety of forensic applications have been generated with the browser concentrating on the single-tube identification SNP set comprising 52 markers. A web interface allows the visitor to review the allele frequencies of the studied markers from all the available populations used by SNPforID to validate global SNP variability. The interface has been designed to offer the useful facility of combining populations into appropriate geographic groups for visual comparison of populations individually or amongst user-defined groupings and with equivalent HapMap data.


BMC Bioinformatics | 2011

ENGINES: exploring single nucleotide variation in entire human genomes

Jorge Amigo; Antonio Salas; C. Phillips

BackgroundNext generation ultra-sequencing technologies are starting to produce extensive quantities of data from entire human genome or exome sequences, and therefore new software is needed to present and analyse this vast amount of information. The 1000 Genomes project has recently released raw data for 629 complete genomes representing several human populations through their Phase I interim analysis and, although there are certain public tools available that allow exploration of these genomes, to date there is no tool that permits comprehensive population analysis of the variation catalogued by such data.DescriptionWe have developed a genetic variant site explorer able to retrieve data for Single Nucleotide Variation (SNVs), population by population, from entire genomes without compromising future scalability and agility. ENGINES (ENtire Genome INterface for Exploring SNVs) uses data from the 1000 Genomes Phase I to demonstrate its capacity to handle large amounts of genetic variation (>7.3 billion genotypes and 28 million SNVs), as well as deriving summary statistics of interest for medical and population genetics applications. The whole dataset is pre-processed and summarized into a data mart accessible through a web interface. The query system allows the combination and comparison of each available population sample, while searching by rs-number list, chromosome region, or genes of interest. Frequency and FST filters are available to further refine queries, while results can be visually compared with other large-scale Single Nucleotide Polymorphism (SNP) repositories such as HapMap or Perlegen.ConclusionsENGINES is capable of accessing large-scale variation data repositories in a fast and comprehensive manner. It allows quick browsing of whole genome variation, while providing statistical information for each variant site such as allele frequency, heterozygosity or FST values for genetic differentiation. Access to the data mart generating scripts and to the web interface is granted from http://spsmart.cesga.es/engines.php


Bioinformatics | 2015

BigBWA: approaching the Burrows–Wheeler aligner to Big Data technologies

José Manuel Abuín; Juan Carlos Pichel; Tomás F. Pena; Jorge Amigo

UNLABELLED BigBWA is a new tool that uses the Big Data technology Hadoop to boost the performance of the Burrows-Wheeler aligner (BWA). Important reductions in the execution times were observed when using this tool. In addition, BigBWA is fault tolerant and it does not require any modification of the original BWA source code. AVAILABILITY AND IMPLEMENTATION BigBWA is available at the project GitHub repository: https://github.com/citiususc/BigBWA.


Journal of Medical Genetics | 2014

A novel stop mutation in the vascular endothelial growth factor-C gene (VEGFC) results in Milroy-like disease

Emilia Balboa-Beltrán; María J Fernández-Seara; Alejandro Pérez-Muñuzuri; Ramón Lago; Carlos García-Magán; María L. Couce; Beatriz Sobrino; Jorge Amigo; Angel Carracedo; Francisco Barros

Background Milroy and Milroy-like disease are rare disorders characterised by congenital lymphoedema caused by dysfunctional lymphatic vessel formation. Loss of extracellular response mediated by vascular endothelial growth factor receptor 3 (VEGFR-3) is associated with Milroy disease, and VEGFR-3 gene is mutated in around 70% of the cases diagnosed. The only genetic alteration known to be associated with Milroy-like disease was recently identified in a family with a frameshift mutation in vascular endothelial growth factor C (VEGFC) gene, which encodes a VEGFR3 ligand. Methods and results We report a newborn patient with an external phenotype consistent with Milroy disease and a truncating mutation (p.R210X) in the VEGFC gene detected by exome sequence analysis. Subsequent analysis, by lymphoscintigraphic scan, performed for research purposes, allowed us to correct the diagnosis, confirming patient’s disease as Milroy-like. The mutation segregates with the phenotype in the family according to a dominant model with full penetrance. Conclusions The clinical presentation, similar to Milroy disease, indicates an overlapping of the external phenotype of both diseases, suggesting that genetic analysis of VEGFC would be useful in diagnosing patients that present with Milroy features but have no mutation in VEGFR-3. Establishing a well-defined genetic pattern would help with differential diagnosis.


PLOS ONE | 2016

SparkBWA: Speeding Up the Alignment of High-Throughput DNA Sequencing Data.

José Manuel Abuín; Juan Carlos Pichel; Tomás F. Pena; Jorge Amigo

Next-generation sequencing (NGS) technologies have led to a huge amount of genomic data that need to be analyzed and interpreted. This fact has a huge impact on the DNA sequence alignment process, which nowadays requires the mapping of billions of small DNA sequences onto a reference genome. In this way, sequence alignment remains the most time-consuming stage in the sequence analysis workflow. To deal with this issue, state of the art aligners take advantage of parallelization strategies. However, the existent solutions show limited scalability and have a complex implementation. In this work we introduce SparkBWA, a new tool that exploits the capabilities of a big data technology as Spark to boost the performance of one of the most widely adopted aligner, the Burrows-Wheeler Aligner (BWA). The design of SparkBWA uses two independent software layers in such a way that no modifications to the original BWA source code are required, which assures its compatibility with any BWA version (future or legacy). SparkBWA is evaluated in different scenarios showing noticeable results in terms of performance and scalability. A comparison to other parallel BWA-based aligners validates the benefits of our approach. Finally, an intuitive and flexible API is provided to NGS professionals in order to facilitate the acceptance and adoption of the new tool. The source code of the software described in this paper is publicly available at https://github.com/citiususc/SparkBWA, with a GPL3 license.


PLOS ONE | 2015

Genetic Analysis of Arrhythmogenic Diseases in the Era of NGS: The Complexity of Clinical Decision-Making in Brugada Syndrome

Catarina Allegue; Monica Coll; Jesus Mates; Oscar Campuzano; Anna Iglesias; Beatriz Sobrino; Maria Brion; Jorge Amigo; Angel Carracedo; Pedro Brugada; Josep Brugada; Ramon Brugada

Background The use of next-generation sequencing enables a rapid analysis of many genes associated with sudden cardiac death in diseases like Brugada Syndrome. Genetic variation is identified and associated with 30–35% of cases of Brugada Syndrome, with nearly 20–25% attributable to variants in SCN5A, meaning many cases remain undiagnosed genetically. To evaluate the role of genetic variants in arrhythmogenic diseases and the utility of next-generation sequencing, we applied this technology to resequence 28 main genes associated with arrhythmogenic disorders. Materials and Methods A cohort of 45 clinically diagnosed Brugada Syndrome patients classified as SCN5A-negative was analyzed using next generation sequencing. Twenty-eight genes were resequenced: AKAP9, ANK2, CACNA1C, CACNB2, CASQ2, CAV3, DSC2, DSG2, DSP, GPD1L, HCN4, JUP, KCNE1, KCNE2, KCNE3, KCNH2, KCNJ2, KCNJ5, KCNQ1, NOS1AP, PKP2, RYR2, SCN1B, SCN3B, SCN4B, SCN5A, SNTA1, and TMEM43. A total of 85 clinically evaluated relatives were also genetically analyzed to ascertain familial segregation. Results and Discussion Twenty-two patients carried 30 rare genetic variants in 12 genes, only 4 of which were previously associated with Brugada Syndrome. Neither insertion/deletion nor copy number variation were detected. We identified genetic variants in novel candidate genes potentially associated to Brugada Syndrome. These include: 4 genetic variations in AKAP9 including a de novo genetic variation in 3 positive cases; 5 genetic variations in ANK2 detected in 4 cases; variations in KCNJ2 together with CASQ2 in 1 case; genetic variations in RYR2, including a de novo genetic variation and desmosomal proteins encoding genes including DSG2, DSP and JUP, detected in 3 of the cases. Larger gene panels or whole exome sequencing should be considered to identify novel genes associated to Brugada Syndrome. However, application of approaches such as whole exome sequencing would difficult the interpretation for clinical purposes due to the large amount of data generated. The identification of these genetic variants opens new perspectives on the implications of genetic background in the arrhythmogenic substrate for research purposes. Conclusions As a paradigm for other arrhythmogenic diseases and for unexplained sudden death, our data show that clinical genetic diagnosis is justified in a family perspective for confirmation of genetic causality. In the era of personalized medicine using high-throughput tools, clinical decision-making is increasingly complex.

Collaboration


Dive into the Jorge Amigo's collaboration.

Top Co-Authors

Avatar

Angel Carracedo

University of Santiago de Compostela

View shared research outputs
Top Co-Authors

Avatar

Antonio Salas

University of Santiago de Compostela

View shared research outputs
Top Co-Authors

Avatar

C. Phillips

University of Santiago de Compostela

View shared research outputs
Top Co-Authors

Avatar

Beatriz Sobrino

University of Santiago de Compostela

View shared research outputs
Top Co-Authors

Avatar

Alberto Gómez-Carballa

University of Santiago de Compostela

View shared research outputs
Top Co-Authors

Avatar

Jacobo Pardo-Seco

University of Santiago de Compostela

View shared research outputs
Top Co-Authors

Avatar

Federico Martinón-Torres

University of Santiago de Compostela

View shared research outputs
Top Co-Authors

Avatar

Javier Costas

University of Santiago de Compostela

View shared research outputs
Top Co-Authors

Avatar

M.V. Lareu

University of Santiago de Compostela

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge