Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Darío Guerrero-Fernández is active.

Publication


Featured researches published by Darío Guerrero-Fernández.


Biology | 2012

Why Assembling Plant Genome Sequences Is So Challenging

Manuel G. Claros; Rocío Bautista; Darío Guerrero-Fernández; Hicham Benzerki; Pedro Seoane; Noe Fernandez-Pozo

In spite of the biological and economic importance of plants, relatively few plant species have been sequenced. Only the genome sequence of plants with relatively small genomes, most of them angiosperms, in particular eudicots, has been determined. The arrival of next-generation sequencing technologies has allowed the rapid and efficient development of new genomic resources for non-model or orphan plant species. But the sequencing pace of plants is far from that of animals and microorganisms. This review focuses on the typical challenges of plant genomes that can explain why plant genomics is less developed than animal genomics. Explanations about the impact of some confounding factors emerging from the nature of plant genomes are given. As a result of these challenges and confounding factors, the correct assembly and annotation of plant genomes is hindered, genome drafts are produced, and advances in plant genomics are delayed.


BMC Genomics | 2014

De novo assembly, characterization and functional annotation of Senegalese sole (Solea senegalensis) and common sole (Solea solea) transcriptomes: integration in a database and design of a microarray

Hicham Benzekri; Paula Armesto; Xavier Cousin; Mireia Rovira; Diego Crespo; Manuel Alejandro Merlo; David Mazurais; Rocío Bautista; Darío Guerrero-Fernández; Noe Fernandez-Pozo; Marian Ponce; Carlos Infante; José Zambonino; Sabine Nidelet; Marta Gut; Laureana Rebordinos; Josep V. Planas; Marie-Laure Bégout; M. Gonzalo Claros; Manuel Manchado

BackgroundSenegalese sole (Solea senegalensis) and common sole (S. solea) are two economically and evolutionary important flatfish species both in fisheries and aquaculture. Although some genomic resources and tools were recently described in these species, further sequencing efforts are required to establish a complete transcriptome, and to identify new molecular markers. Moreover, the comparative analysis of transcriptomes will be useful to understand flatfish evolution.ResultsA comprehensive characterization of the transcriptome for each species was carried out using a large set of Illumina data (more than 1,800 millions reads for each sole species) and 454 reads (more than 5 millions reads only in S. senegalensis), providing coverages ranging from 1,384x to 2,543x. After a de novo assembly, 45,063 and 38,402 different transcripts were obtained, comprising 18,738 and 22,683 full-length cDNAs in S. senegalensis and S. solea, respectively. A reference transcriptome with the longest unique transcripts and putative non-redundant new transcripts was established for each species. A subset of 11,953 reference transcripts was qualified as highly reliable orthologs (>97% identity) between both species. A small subset of putative species-specific, lineage-specific and flatfish-specific transcripts were also identified. Furthermore, transcriptome data permitted the identification of single nucleotide polymorphisms and simple-sequence repeats confirmed by FISH to be used in further genetic and expression studies. Moreover, evidences on the retention of crystallins crybb1, crybb1-like and crybb3 in the two species of soles are also presented. Transcriptome information was applied to the design of a microarray tool in S. senegalensis that was successfully tested and validated by qPCR. Finally, transcriptomic data were hosted and structured at SoleaDB.ConclusionsTranscriptomes and molecular markers identified in this study represent a valuable source for future genomic studies in these economically important species. Orthology analysis provided new clues regarding sole genome evolution indicating a divergent evolution of crystallins in flatfish. The design of a microarray and establishment of a reference transcriptome will be useful for large-scale gene expression studies. Moreover, the integration of transcriptomic data in the SoleaDB will facilitate the management of genomic information in these important species.


BMC Genomics | 2011

EuroPineDB: a high-coverage web database for maritime pine transcriptome

Noe Fernandez-Pozo; Javier Canales; Darío Guerrero-Fernández; David P. Villalobos; Sara M. Díaz-Moreno; Rocío Bautista; Arantxa Flores-Monterroso; M. Ángeles Guevara; Pedro Perdiguero; Carmen Collada; M. Teresa Cervera; Álvaro Soto; Ricardo J. Ordás; Francisco R. Cantón; Concepción Ávila; Francisco M. Cánovas; M. Gonzalo Claros

BackgroundPinus pinaster is an economically and ecologically important species that is becoming a woody gymnosperm model. Its enormous genome size makes whole-genome sequencing approaches are hard to apply. Therefore, the expressed portion of the genome has to be characterised and the results and annotations have to be stored in dedicated databases.DescriptionEuroPineDB is the largest sequence collection available for a single pine species, Pinus pinaster (maritime pine), since it comprises 951 641 raw sequence reads obtained from non-normalised cDNA libraries and high-throughput sequencing from adult (xylem, phloem, roots, stem, needles, cones, strobili) and embryonic (germinated embryos, buds, callus) maritime pine tissues. Using open-source tools, sequences were optimally pre-processed, assembled, and extensively annotated (GO, EC and KEGG terms, descriptions, SNPs, SSRs, ORFs and InterPro codes). As a result, a 10.5× P. pinaster genome was covered and assembled in 55 322 UniGenes. A total of 32 919 (59.5%) of P. pinaster UniGenes were annotated with at least one description, revealing at least 18 466 different genes. The complete database, which is designed to be scalable, maintainable, and expandable, is freely available at: http://www.scbi.uma.es/pindb/. It can be retrieved by gene libraries, pine species, annotations, UniGenes and microarrays (i.e., the sequences are distributed in two-colour microarrays; this is the only conifer database that provides this information) and will be periodically updated. Small assemblies can be viewed using a dedicated visualisation tool that connects them with SNPs. Any sequence or annotation set shown on-screen can be downloaded. Retrieval mechanisms for sequences and gene annotations are provided.ConclusionsThe EuroPineDB with its integrated information can be used to reveal new knowledge, offers an easy-to-use collection of information to directly support experimental work (including microarray hybridisation), and provides deeper knowledge on the maritime pine transcriptome.


Frontiers in Plant Science | 2015

ReprOlive: a database with linked data for the olive tree (Olea europaea L.) reproductive transcriptome.

Rosario Carmona; Adoración Zafra; Pedro Seoane; Antonio Jesús Castro; Darío Guerrero-Fernández; Trinidad Castillo-Castillo; Ana Medina-García; Francisco M. Cánovas; José F. Aldana-Montes; Ismael Navas-Delgado; Juan de Dios Alché; M. Gonzalo Claros

Plant reproductive transcriptomes have been analyzed in different species due to the agronomical and biotechnological importance of plant reproduction. Here we presented an olive tree reproductive transcriptome database with samples from pollen and pistil at different developmental stages, and leaf and root as control vegetative tissues http://reprolive.eez.csic.es). It was developed from 2,077,309 raw reads to 1,549 Sanger sequences. Using a pre-defined workflow based on open-source tools, sequences were pre-processed, assembled, mapped, and annotated with expression data, descriptions, GO terms, InterPro signatures, EC numbers, KEGG pathways, ORFs, and SSRs. Tentative transcripts (TTs) were also annotated with the corresponding orthologs in Arabidopsis thaliana from TAIR and RefSeq databases to enable Linked Data integration. It results in a reproductive transcriptome comprising 72,846 contigs with average length of 686 bp, of which 63,965 (87.8%) included at least one functional annotation, and 55,356 (75.9%) had an ortholog. A minimum of 23,568 different TTs was identified and 5,835 of them contain a complete ORF. The representative reproductive transcriptome can be reduced to 28,972 TTs for further gene expression studies. Partial transcriptomes from pollen, pistil, and vegetative tissues as control were also constructed. ReprOlive provides free access and download capability to these results. Retrieval mechanisms for sequences and transcript annotations are provided. Graphical localization of annotated enzymes into KEGG pathways is also possible. Finally, ReprOlive has included a semantic conceptualisation by means of a Resource Description Framework (RDF) allowing a Linked Data search for extracting the most updated information related to enzymes, interactions, allergens, structures, and reactive oxygen species.


Computational Biology Journal | 2013

SCBI_MapReduce, a New Ruby Task-Farm Skeleton for Automated Parallelisation and Distribution in Chunks of Sequences: The Implementation of a Boosted Blast

Darío Guerrero-Fernández; Juan Falgueras; M. Gonzalo Claros

Current genomic analyses often require the managing and comparison of big data using desktop bioinformatic software that was not developed regarding multicore distribution. The task-farm SCBI_MAPREDUCE is intended to simplify the trivial parallelisation and distribution of new and legacy software and scripts for biologists who are interested in using computers but are not skilled programmers. In the case of legacy applications, there is no need of modification or rewriting the source code. It can be used from multicore workstations to heterogeneous grids. Tests have demonstrated that speed-up scales almost linearly and that distribution in small chunks increases it. It is also shown that SCBI_MAPREDUCE takes advantage of shared storage when necessary, is fault-tolerant, allows for resuming aborted jobs, does not need special hardware or virtual machine support, and provides the same results than a parallelised, legacy software. The same is true for interrupted and relaunched jobs. As proof-of-concept, distribution of a compiled version of BLAST


Journal of Biomedical Informatics | 2010

GENote v. β : a web tool prototype for annotation of unfinished sequences in non-model eukaryotes

Noe Fernandez-Pozo; Darío Guerrero-Fernández; Rocío Bautista; Josefa Gómez-Maldonado; Concepción Ávila; Francisco M. Cánovas; M. Gonzalo Claros

De novo identification of genes in newly-sequenced eukaryotic genomes is based on sensors, which are not available in non-model organisms. Many annotation tools have been developed and most of them require sequence training, computer skills and accessibility to sufficient computational power. The main need of non-model organisms is finding genes, transposable elements, repetitions, etc., in reliable assemblies. GENote v.β is intended to cope with these aspects as a web tool for researchers without bioinformatics skills. It facilitates the annotation of new, unfinished sequences with descriptions, GO terms, EC numbers and KEEG pathways. It currently localises genes and transposons, which enable the sorting of contigs or scaffolds from a BAC clone, and reveals some putative assembly inconsistencies. Results are provided in GFF3 format and in tab-delimited text readable in viewers; a summary of findings is provided also as a PNG file.


Plant Biotechnology Journal | 2014

De novo assembly of maritime pine transcriptome: implications for forest breeding and biotechnology

Javier Canales; Rocío Bautista; Philippe Label; Josefa Gómez-Maldonado; Isabelle Lesur; Noe Fernandez-Pozo; Marina Rueda-López; Darío Guerrero-Fernández; Vanessa Castro-Rodríguez; Hicham Benzekri; Rafael A. Cañas; M. A. Guevara; Andreia Rodrigues; Pedro Seoane; Caroline Teyssier; Alexandre Morel; François Ehrenmann; Grégoire Le Provost; Céline Lalanne; Céline Noirot; Christophe Klopp; Isabelle Reymond; Angel García-Gutiérrez; Jean-François Trontin; Marie-Anne Lelu-Walter; Célia Miguel; María Teresa Cervera; Francisco R. Cantón; Christophe Plomion; Luc Harvengt


ScienceOpen Posters | 2015

AUTOMATED CONSTRUCTION OF TRANSCRIPTOME DATABASES FOR UNCHARACTERISED-GENOME PLANTS, SUCH AS OLIVE TREE

M. Gonzalo Claros; Pedro Seoane; Rosario Carmona; Adoración Zafra; Antonio J. Castro Gómez; Darío Guerrero-Fernández; Trinidad Castillo-Castillo; Ana Medina-García; José F. Aldana-Montes; Ismael Navas-Delgado; Juan de Dios Alché


IWBBIO | 2014

AutoFlow: an easy way to build workflows

Pedro Seoane; Rosario Carmona; Rocío Bautista; Darío Guerrero-Fernández; Manuel G. Claros


IWBBIO | 2013

FQbin a compatible and optimized format for storing and managing sequence data.

Darío Guerrero-Fernández; Rafael Larrosa; M. Gonzalo Claros

Collaboration


Dive into the Darío Guerrero-Fernández's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Noe Fernandez-Pozo

Boyce Thompson Institute for Plant Research

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Adoración Zafra

Spanish National Research Council

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge