Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Xavier Messeguer is active.

Publication


Featured researches published by Xavier Messeguer.


Bioinformatics | 2003

DnaSP, DNA polymorphism analyses by the coalescent and other methods

Julio Rozas; Juan Carlos Sánchez-DelBarrio; Xavier Messeguer; Ricardo Rozas

SUMMARY DnaSP is a software package for the analysis of DNA polymorphism data. Present version introduces several new modules and features which, among other options allow: (1) handling big data sets (approximately 5 Mb per sequence); (2) conducting a large number of coalescent-based tests by Monte Carlo computer simulations; (3) extensive analyses of the genetic differentiation and gene flow among populations; (4) analysing the evolutionary pattern of preferred and unpreferred codons; (5) generating graphical outputs for an easy visualization of results. AVAILABILITY The software package, including complete documentation and examples, is freely available to academic users from: http://www.ub.es/dnasp


Bioinformatics | 2002

PROMO: detection of known transcription regulatory elements using species-tailored searches

Xavier Messeguer; Ruth Escudero; Domènec Farré; Oscar Núñez; Javier Martı́nez; M. Mar Albà

We have developed a set of tools to construct positional weight matrices from known transcription factor binding sites in a species or taxon-specific manner, and to search for matches in DNA sequences.


Nucleic Acids Research | 2003

Identification of patterns in biological sequences at the ALGGEN server: PROMO and MALGEN

Domènec Farré; Roman Roset; Mario Huerta; José E. Adsuara; Llorenç Roselló; M. Mar Albà; Xavier Messeguer

In this paper we present several web-based tools to identify conserved patterns in sequences. In particular we present details on the functionality of PROMO version 2.0, a program for the prediction of transcription factor binding site in a single sequence or in a group of related sequences and, of MALGEN, a tool to visualize sequence correspondences among long DNA sequences. The web tools and associated documentation can be accessed at http://www.lsi.upc.es/~alggen (RESEARCH link).


eLife | 2014

Long non-coding RNAs as a source of new peptides

Jorge Ruiz-Orera; Xavier Messeguer; Juan Antonio Subirana; M. Mar Albà

Deep transcriptome sequencing has revealed the existence of many transcripts that lack long or conserved open reading frames (ORFs) and which have been termed long non-coding RNAs (lncRNAs). The vast majority of lncRNAs are lineage-specific and do not yet have a known function. In this study, we test the hypothesis that they may act as a repository for the synthesis of new peptides. We find that a large fraction of the lncRNAs expressed in cells from six different species is associated with ribosomes. The patterns of ribosome protection are consistent with the translation of short peptides. lncRNAs show similar coding potential and sequence constraints than evolutionary young protein coding sequences, indicating that they play an important role in de novo protein evolution. DOI: http://dx.doi.org/10.7554/eLife.03523.001


Nucleic Acids Research | 2006

ABS: a database of Annotated regulatory Binding Sites from orthologous promoters.

Enrique Blanco; Domènec Farré; M. Mar Albà; Xavier Messeguer; Roderic Guigó

Information about the genomic coordinates and the sequence of experimentally identified transcription factor binding sites is found scattered under a variety of diverse formats. The availability of standard collections of such high-quality data is important to design, evaluate and improve novel computational approaches to identify binding motifs on promoter sequences from related genes. ABS () is a public database of known binding sites identified in promoters of orthologous vertebrate genes that have been manually curated from bibliography. We have annotated 650 experimental binding sites from 68 transcription factors and 100 orthologous target genes in human, mouse, rat or chicken genome sequences. Computational predictions and promoter alignment information are also provided for each entry. A simple and easy-to-use web interface facilitates data retrieval allowing different views of the information. In addition, the release 1.0 of ABS includes a customizable generator of artificial datasets based on the known sites contained in the collection and an evaluation tool to aid during the training and the assessment of motif-finding programs.


BMC Bioinformatics | 2006

M-GCAT: interactively and efficiently constructing large-scale multiple genome comparison frameworks in closely related species

Todd J. Treangen; Xavier Messeguer

BackgroundDue to recent advances in whole genome shotgun sequencing and assembly technologies, the financial cost of decoding an organisms DNA has been drastically reduced, resulting in a recent explosion of genomic sequencing projects. This increase in related genomic data will allow for in depth studies of evolution in closely related species through multiple whole genome comparisons.ResultsTo facilitate such comparisons, we present an interactive multiple genome comparison and alignment tool, M-GCAT, that can efficiently construct multiple genome comparison frameworks in closely related species. M-GCAT is able to compare and identify highly conserved regions in up to 20 closely related bacterial species in minutes on a standard computer, and as many as 90 (containing 75 cloned genomes from a set of 15 published enterobacterial genomes) in an hour. M-GCAT also incorporates a novel comparative genomics data visualization interface allowing the user to globally and locally examine and inspect the conserved regions and gene annotations.ConclusionM-GCAT is an interactive comparative genomics tool well suited for quickly generating multiple genome comparisons frameworks and alignments among closely related species. M-GCAT is freely available for download for academic and non-commercial use at: http://alggen.lsi.upc.es/recerca/align/mgcat/intro-mgcat.html.


Genome Biology | 2007

Housekeeping genes tend to show reduced upstream sequence conservation

Domènec Farré; Nicolás Bellora; Loris Mularoni; Xavier Messeguer; M. Mar Albà

BackgroundUnderstanding the constraints that operate in mammalian gene promoter sequences is of key importance to understand the evolution of gene regulatory networks. The level of promoter conservation varies greatly across orthologous genes, denoting differences in the strength of the evolutionary constraints. Here we test the hypothesis that the number of tissues in which a gene is expressed is related in a significant manner to the extent of promoter sequence conservation.ResultsWe show that mammalian housekeeping genes, expressed in all or nearly all tissues, show significantly lower promoter sequence conservation, especially upstream of position -500 with respect to the transcription start site, than genes expressed in a subset of tissues. In addition, we evaluate the effect of gene function, CpG island content and protein evolutionary rate on promoter sequence conservation. Finally, we identify a subset of transcription factors that bind to motifs that are specifically over-represented in housekeeping gene promoters.ConclusionThis is the first report that shows that the promoters of housekeeping genes show reduced sequence conservation with respect to genes expressed in a more tissue-restricted manner. This is likely to be related to simpler gene expression, requiring a smaller number of functional cis-regulatory motifs.


PLOS Computational Biology | 2006

Transcription Factor Map Alignment of Promoter Regions

Enrique Blanco; Xavier Messeguer; Temple F. Smith; Roderic Guigó

We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels—to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human–mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments.


Methods of Molecular Biology | 2007

Analyzing Patterns of Microbial Evolution Using the Mauve Genome Alignment System

Aaron E. Darling; Todd J. Treangen; Xavier Messeguer; Nicole T. Perna

During the course of evolution, genomes can undergo large-scale mutation events such as rearrangement and lateral transfer. Such mutations can result in significant variations in gene order and gene content among otherwise closely related organisms. The Mauve genome alignment system can successfully identify such rearrangement and lateral transfer events in comparisons of multiple microbial genomes even under high levels of recombination. This chapter outlines the main features of Mauve and provides examples that describe how to use Mauve to conduct a rigorous multiple genome comparison and study evolutionary patterns.


IEEE/ACM Transactions on Computational Biology and Bioinformatics | 2009

A Novel Heuristic for Local Multiple Alignment of Interspersed DNA Repeats

Todd J. Treangen; Aaron E. Darling; Guillaume Achaz; Mark A. Ragan; Xavier Messeguer; Eduardo P. C. Rocha

Pairwise local sequence alignment methods have been the prevailing technique to identify homologous nucleotides between related species. However, existing methods that identify and align all homologous nucleotides in one or more genomes have suffered from poor scalability and limited accuracy. We propose a novel method that couples a gapped extension heuristic with an efficient filtration method for identifying interspersed repeats in genome sequences. During gapped extension, we use the MUSCLE implementation of progressive global multiple alignment with iterative refinement. The resulting gapped extensions potentially contain alignments of unrelated sequence. We detect and remove such undesirable alignments using a hidden Markov model (HMM) to predict the posterior probability of homology. The HMM emission frequencies for nucleotide substitutions can be derived from any time-reversible nucleotide substitution matrix. We evaluate the performance of our method and previous approaches on a hybrid data set of real genomic DNA with simulated interspersed repeats. Our method outperforms a related method in terms of sensitivity, positive predictive value, and localizing boundaries of homology. The described methods have been implemented in freely available software, Repeatoire, available from: http://wwwabi.snv.jussieu.fr/public/Repeatoire.

Collaboration


Dive into the Xavier Messeguer's collaboration.

Top Co-Authors

Avatar

Joaquim Gabarró

Polytechnic University of Catalonia

View shared research outputs
Top Co-Authors

Avatar

Juan A. Subirana

Polytechnic University of Catalonia

View shared research outputs
Top Co-Authors

Avatar

M. Mar Albà

Pompeu Fabra University

View shared research outputs
Top Co-Authors

Avatar

Todd J. Treangen

Polytechnic University of Catalonia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Conrado Martínez

Polytechnic University of Catalonia

View shared research outputs
Top Co-Authors

Avatar

Josepa Ribes

University of Barcelona

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge