François Serra
Pompeu Fabra University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by François Serra.
Molecular Biology and Evolution | 2016
Jaime Huerta-Cepas; François Serra; Peer Bork
The Environment for Tree Exploration (ETE) is a computational framework that simplifies the reconstruction, analysis, and visualization of phylogenetic trees and multiple sequence alignments. Here, we present ETE v3, featuring numerous improvements in the underlying library of methods, and providing a novel set of standalone tools to perform common tasks in comparative genomics and phylogenetics. The new features include (i) building gene-based and supermatrix-based phylogenies using a single command, (ii) testing and visualizing evolutionary models, (iii) calculating distances between trees of different size or including duplications, and (iv) providing seamless integration with the NCBI taxonomy database. ETE is freely available at http://etetoolkit.org
Nucleic Acids Research | 2011
Rubén Sánchez; François Serra; Joaquín Tárraga; Ignacio Medina; José Carbonell; Luis Pulido; Alejandro de María; Salvador Capella-Gutiérrez; Jaime Huerta-Cepas; Toni Gabaldón; Joaquín Dopazo; Hernán Dopazo
Phylemon 2.0 is a new release of the suite of web tools for molecular evolution, phylogenetics, phylogenomics and hypotheses testing. It has been designed as a response to the increasing demand of molecular sequence analyses for experts and non-expert users. Phylemon 2.0 has several unique features that differentiates it from other similar web resources: (i) it offers an integrated environment that enables evolutionary analyses, format conversion, file storage and edition of results; (ii) it suggests further analyses, thereby guiding the users through the web server; and (iii) it allows users to design and save phylogenetic pipelines to be used over multiple genes (phylogenomics). Altogether, Phylemon 2.0 integrates a suite of 30 tools covering sequence alignment reconstruction and trimming; tree reconstruction, visualization and manipulation; and evolutionary hypotheses testing.
FEBS Letters | 2015
François Serra; Marco Di Stefano; Yannick G. Spill; Yasmina Cuartero; Michael Goodstadt; Davide Baù; Marc A. Marti-Renom
Chromosomes are large polymer molecules composed of nucleotides. In some species, such as humans, this polymer can sum up to meters long and still be properly folded within the nuclear space of few microns in size. The exact mechanisms of how the meters long DNA is folded into the nucleus, as well as how the regulatory machinery can access it, is to a large extend still a mystery. However, and thanks to newly developed molecular, genomic and computational approaches based on the Chromosome Conformation Capture (3C) technology, we are now obtaining insight on how genomes are spatially organized. Here we review a new family of computational approaches that aim at using 3C‐based data to obtain spatial restraints for modeling genomes and genomic domains.
Nucleic Acids Research | 2015
Marie Trussart; François Serra; Davide Baù; Ivan Junier; Luis Serrano; Marc A. Marti-Renom
Restraint-based modeling of genomes has been recently explored with the advent of Chromosome Conformation Capture (3C-based) experiments. We previously developed a reconstruction method to resolve the 3D architecture of both prokaryotic and eukaryotic genomes using 3C-based data. These models were congruent with fluorescent imaging validation. However, the limits of such methods have not systematically been assessed. Here we propose the first evaluation of a mean-field restraint-based reconstruction of genomes by considering diverse chromosome architectures and different levels of data noise and structural variability. The results show that: first, current scoring functions for 3D reconstruction correlate with the accuracy of the models; second, reconstructed models are robust to noise but sensitive to structural variability; third, the local structure organization of genomes, such as Topologically Associating Domains, results in more accurate models; fourth, to a certain extent, the models capture the intrinsic structural variability in the input matrices and fifth, the accuracy of the models can be a priori predicted by analyzing the properties of the interaction matrices. In summary, our work provides a systematic analysis of the limitations of a mean-field restrain-based method, which could be taken into consideration in further development of methods as well as their applications.
Environmental Microbiology | 2012
Luís G. Gonçalves; Nuno Borges; François Serra; Pedro L. Fernandes; Hernán Dopazo; Helena Santos
The synthesis of di-myo-inositol phosphate (DIP), a common compatible solute in hyperthermophiles, involves the consecutive actions of inositol-1-phosphate cytidylyltransferase (IPCT) and di-myo-inositol phosphate phosphate synthase (DIPPS). In most cases, both activities are present in a single gene product, but separate genes are also found in a few organisms. Genes for IPCT and DIPPS were found in the genomes of 33 organisms, all with thermophilic/hyperthermophilic lifestyles. Phylogeny of IPCT/DIPPS revealed an incongruent topology with 16S RNA phylogeny, thus suggesting horizontal gene transfer. The phylogenetic tree of the DIPPS domain was rooted by using phosphatidylinositol phosphate synthase sequences as out-group. The root locates at the separation of genomes with fused and split genes. We propose that the gene encoding DIPPS was recruited from the biosynthesis of phosphatidylinositol. The last DIP-synthesizing ancestor harboured separated genes for IPCT and DIPPS and this architecture was maintained in a crenarchaeal lineage, and transferred by horizontal gene transfer to hyperthermophilic marine Thermotoga species. It is plausible that the driving force for the assembly of those two genes in the early ancestor is related to the acquired advantage of DIP producers to cope with high temperature. This work corroborates the view that Archaea were the first hyperthermophilic organisms.
PLOS Computational Biology | 2017
François Serra; Davide Baù; Mike Goodstadt; David Castillo; Guillaume J. Filion; Marc A. Marti-Renom
The sequence of a genome is insufficient to understand all genomic processes carried out in the cell nucleus. To achieve this, the knowledge of its three-dimensional architecture is necessary. Advances in genomic technologies and the development of new analytical methods, such as Chromosome Conformation Capture (3C) and its derivatives, provide unprecedented insights in the spatial organization of genomes. Here we present TADbit, a computational framework to analyze and model the chromatin fiber in three dimensions. Our package takes as input the sequencing reads of 3C-based experiments and performs the following main tasks: (i) pre-process the reads, (ii) map the reads to a reference genome, (iii) filter and normalize the interaction data, (iv) analyze the resulting interaction matrices, (v) build 3D models of selected genomic domains, and (vi) analyze the resulting models to characterize their structural properties. To illustrate the use of TADbit, we automatically modeled 50 genomic domains from the fly genome revealing differential structural features of the previously defined chromatin colors, establishing a link between the conformation of the genome and the local chromatin composition. TADbit provides three-dimensional models built from 3C-based experiments, which are ready for visualization and for characterizing their relation to gene expression and epigenetic states. TADbit is an open-source Python library available for download from https://github.com/3DGenomes/tadbit.
Evolutionary Bioinformatics | 2012
Nicolás Lavagnino; François Serra; Leonardo Arbiza; Hernán Dopazo; Esteban Hasson
Previous comparative genomic studies of genes involved in olfactory behavior in Drosophila focused only on particular gene families such as odorant receptor and/or odorant binding proteins. However, olfactory behavior has a complex genetic architecture that is orchestrated by many interacting genes. In this paper, we present a comparative genomic study of olfactory behavior in Drosophila including an extended set of genes known to affect olfactory behavior. We took advantage of the recent burst of whole genome sequences and the development of powerful statistical tools to analyze genomic data and test evolutionary and functional hypotheses of olfactory genes in the six species of the Drosophila melanogaster species group for which whole genome sequences are available. Our study reveals widespread purifying selection and limited incidence of positive selection on olfactory genes. We show that the pace of evolution of olfactory genes is mostly independent of the life cycle stage, and of the number of life cycle stages, in which they participate in olfaction. However, we detected a relationship between evolutionary rates and the position that the gene products occupy in the olfactory system, genes occupying central positions tend to be more constrained than peripheral genes. Finally, we demonstrate that specialization to one host does not seem to be associated with bursts of adaptive evolution in olfactory genes in D. sechellia and D. erecta, the two specialists species analyzed, but rather different lineages have idiosyncratic evolutionary histories in which both historical and ecological factors have been involved.
BMC Genomics | 2011
Urko M. Marigorta; Oscar Lao; Ferran Casals; Francesc Calafell; Carlos Morcillo-Suarez; Rui Faria; Elena Bosch; François Serra; Jaume Bertranpetit; Hernán Dopazo; Arcadi Navarro
BackgroundSearching for associations between genetic variants and complex diseases has been a very active area of research for over two decades. More than 51,000 potential associations have been studied and published, a figure that keeps increasing, especially with the recent explosion of array-based Genome-Wide Association Studies. Even if the number of true associations described so far is high, many of the putative risk variants detected so far have failed to be consistently replicated and are widely considered false positives. Here, we focus on the world-wide patterns of replicability of published association studies.ResultsWe report three main findings. First, contrary to previous results, genes associated to complex diseases present lower degrees of genetic differentiation among human populations than average genome-wide levels. Second, also contrary to previous results, the differences in replicability of disease associated-loci between Europeans and East Asians are highly correlated with genetic differentiation between these populations. Finally, highly replicated genes present increased levels of high-frequency derived alleles in European and Asian populations when compared to African populations.ConclusionsOur findings highlight the heterogeneous nature of the genetic etiology of complex disease, confirm the importance of the recent evolutionary history of our species in current patterns of disease susceptibility and could cast doubts on the status as false positives of some associations that have failed to replicate across populations.
bioRxiv | 2016
François Serra; Davide Baù; Guillaume J. Filion; Marc A. Marti-Renom
Background The sequence of a genome is insufficient to understand all genomic processes carried out in the cell nucleus. To achieve this, the knowledge of its threedimensional architecture is necessary. Advances in genomic technologies and the development of new analytical methods, such as Chromosome Conformation Capture (3C) and its derivatives, provide unprecedented insights on the spatial organization of genomes. However, inferring structures from raw contact data is a tedious process for shortage of available tools. Results Here we present TADbit, a computational framework to analyze and model the chromatin fiber in three dimensions. To illustrate the use of TADbit, we automatically modeled 50 genomic domains from the fly genome revealing differential structural features of the previously defined chromatin colors, establishing a link between the conformation of the genome and the local chromatin composition. Conclusions TADbit provides three-dimensional built from 3C-based experiments, which are ready for visualization and for characterizing their relation to gene expression and epigenetic states. TADbit is open-source and available for download from http://www.3DGenomes.org.
PLOS ONE | 2011
Lena Lüke; Alberto Vicens; François Serra; Juan José Luque-Larena; Hernán Dopazo; Eduardo R. S. Roldan; Montserrat Gomendio
Sexual selection has been proposed as the driving force promoting the rapid evolutionary changes observed in some reproductive genes including protamines. We test this hypothesis in a group of rodents which show marked differences in the intensity of sexual selection. Levels of sperm competition were not associated with the evolutionary rates of protamine 1 but, contrary to expectations, were negatively related to the evolutionary rate of cleaved- and mature-protamine 2. Since both domains were found to be under relaxation, our findings reveal an unforeseen role of sexual selection: to halt the degree of degeneration that proteins within families may experience due to functional redundancy. The degree of relaxation of protamine 2 in this group of rodents is such that in some species it has become dysfunctional and it is not expressed in mature spermatozoa. In contrast, protamine 1 is functionally conserved but shows directed positive selection on specific sites which are functionally relevant such as DNA-anchoring domains and phosphorylation sites. We conclude that in rodents protamine 2 is under relaxation and that sexual selection removes deleterious mutations among species with high levels of sperm competition to maintain the protein functional and the spermatozoa competitive.