Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Manuel Gil is active.

Publication


Featured researches published by Manuel Gil.


Systematic Biology | 2011

Survey of Branch Support Methods Demonstrates Accuracy, Power, and Robustness of Fast Likelihood-based Approximation Schemes

Maria Anisimova; Manuel Gil; Jean-François Dufayard; Christophe Dessimoz

Abstract Phylogenetic inference and evaluating support for inferred relationships is at the core of many studies testing evolutionary hypotheses. Despite the popularity of nonparametric bootstrap frequencies and Bayesian posterior probabilities, the interpretation of these measures of tree branch support remains a source of discussion. Furthermore, both methods are computationally expensive and become prohibitive for large data sets. Recent fast approximate likelihood-based measures of branch supports (approximate likelihood ratio test [aLRT] and Shimodaira–Hasegawa [SH]-aLRT) provide a compelling alternative to these slower conventional methods, offering not only speed advantages but also excellent levels of accuracy and power. Here we propose an additional method: a Bayesian-like transformation of aLRT (aBayes). Considering both probabilistic and frequentist frameworks, we compare the performance of the three fast likelihood-based methods with the standard bootstrap (SBS), the Bayesian approach, and the recently introduced rapid bootstrap. Our simulations and real data analyses show that with moderate model violations, all tests are sufficiently accurate, but aLRT and aBayes offer the highest statistical power and are very fast. With severe model violations aLRT, aBayes and Bayesian posteriors can produce elevated false-positive rates. With data sets for which such violation can be detected, we recommend using SH-aLRT, the nonparametric version of aLRT based on a procedure similar to the Shimodaira–Hasegawa tree selection. In general, the SBS seems to be excessively conservative and is much slower than our approximate likelihood-based methods.


The Plant Cell | 2008

β-AMYLASE4, a Noncatalytic Protein Required for Starch Breakdown, Acts Upstream of Three Active β-Amylases in Arabidopsis Chloroplasts

Daniel C. Fulton; Michaela Stettler; Tabea Mettler; Cara K. Vaughan; Jing Li; Perigio Francisco; Manuel Gil; Heike Reinhold; Simona Eicke; Gaëlle Messerli; Gary Dorken; Karen J. Halliday; Alison M. Smith; Steven M. Smith; Samuel C. Zeeman

This work investigated the roles of β-amylases in the breakdown of leaf starch. Of the nine β-amylase (BAM)–like proteins encoded in the Arabidopsis thaliana genome, at least four (BAM1, -2, -3, and -4) are chloroplastic. When expressed as recombinant proteins in Escherichia coli, BAM1, BAM2, and BAM3 had measurable β-amylase activity but BAM4 did not. BAM4 has multiple amino acid substitutions relative to characterized β-amylases, including one of the two catalytic residues. Modeling predicts major differences between the glucan binding site of BAM4 and those of active β-amylases. Thus, BAM4 probably lost its catalytic capacity during evolution. Total β-amylase activity was reduced in leaves of bam1 and bam3 mutants but not in bam2 and bam4 mutants. The bam3 mutant had elevated starch levels and lower nighttime maltose levels than the wild type, whereas bam1 did not. However, the bam1 bam3 double mutant had a more severe phenotype than bam3, suggesting functional overlap between the two proteins. Surprisingly, bam4 mutants had elevated starch levels. Introduction of the bam4 mutation into the bam3 and bam1 bam3 backgrounds further elevated the starch levels in both cases. These data suggest that BAM4 facilitates or regulates starch breakdown and operates independently of BAM1 and BAM3. Together, our findings are consistent with the proposal that β-amylase is a major enzyme of starch breakdown in leaves, but they reveal unexpected complexity in terms of the specialization of protein function.


Genome Biology | 2010

Phylogenetic assessment of alignments reveals neglected tree signal in gaps

Christophe Dessimoz; Manuel Gil

BackgroundThe alignment of biological sequences is of chief importance to most evolutionary and comparative genomics studies, yet the two main approaches used to assess alignment accuracy have flaws: reference alignments are derived from the biased sample of proteins with known structure, and simulated data lack realism.ResultsHere, we introduce tree-based tests of alignment accuracy, which not only use large and representative samples of real biological data, but also enable the evaluation of the effect of gap placement on phylogenetic inference. We show that (i) the current belief that consistency-based alignments outperform scoring matrix-based alignments is misguided; (ii) gaps carry substantial phylogenetic signal, but are poorly exploited by most alignment and tree building programs; (iii) even so, excluding gaps and variable regions is detrimental; (iv) disagreement among alignment programs says little about the accuracy of resulting trees.ConclusionsThis study provides the broad community relying on sequence alignment with important practical recommendations, sets superior standards for assessing alignment accuracy, and paves the way for the development of phylogenetic inference methods of significantly higher resolution.


PLOS ONE | 2013

Inferring hierarchical orthologous groups from orthologous gene pairs.

Adrian M. Altenhoff; Manuel Gil; Gaston H. Gonnet; Christophe Dessimoz

Hierarchical orthologous groups are defined as sets of genes that have descended from a single common ancestor within a taxonomic range of interest. Identifying such groups is useful in a wide range of contexts, including inference of gene function, study of gene evolution dynamics and comparative genomics. Hierarchical orthologous groups can be derived from reconciled gene/species trees but, this being a computationally costly procedure, many phylogenomic databases work on the basis of pairwise gene comparisons instead (“graph-based” approach). To our knowledge, there is only one published algorithm for graph-based hierarchical group inference, but both its theoretical justification and performance in practice are as of yet largely uncharacterised. We establish a formal correspondence between the orthology graph and hierarchical orthologous groups. Based on that, we devise GETHOGs (“Graph-based Efficient Technique for Hierarchical Orthologous Groups”), a novel algorithm to infer hierarchical groups directly from the orthology graph, thus without needing gene tree inference nor gene/species tree reconciliation. GETHOGs is shown to correctly reconstruct hierarchical orthologous groups when applied to perfect input, and several extensions with stringency parameters are provided to deal with imperfect input data. We demonstrate its competitiveness using both simulated and empirical data. GETHOGs is implemented as a part of the freely-available OMA standalone package (http://omabrowser.org/standalone). Furthermore, hierarchical groups inferred by GETHOGs (“OMA HOGs”) on >1,000 genomes can be interactively queried via the OMA browser (http://omabrowser.org).


Molecular Biology and Evolution | 2013

CodonPhyML: Fast Maximum Likelihood Phylogeny Estimation under Codon Substitution Models

Manuel Gil; Marcelo Serrano Zanetti; Stefan Zoller; Maria Anisimova

Markov models of codon substitution naturally incorporate the structure of the genetic code and the selection intensity at the protein level, providing a more realistic representation of protein-coding sequences compared with nucleotide or amino acid models. Thus, for protein-coding genes, phylogenetic inference is expected to be more accurate under codon models. So far, phylogeny reconstruction under codon models has been elusive due to computational difficulties of dealing with high dimension matrices. Here, we present a fast maximum likelihood (ML) package for phylogenetic inference, CodonPhyML offering hundreds of different codon models, the largest variety to date, for phylogeny inference by ML. CodonPhyML is tested on simulated and real data and is shown to offer excellent speed and convergence properties. In addition, CodonPhyML includes most recent fast methods for estimating phylogenetic branch supports and provides an integral framework for models selection, including amino acid and DNA models.


research in computational molecular biology | 2005

OMA, a comprehensive, automated project for the identification of orthologs from complete genome data: introduction and first achievements

Christophe Dessimoz; Gina M. Cannarozzi; Manuel Gil; Daniel Margadant; Alexander Roth; Adrian Schneider; Gaston H. Gonnet

The OMA project is a large-scale effort to identify groups of orthologs from complete genome data, currently 150 species. The algorithm relies solely on protein sequence information and does not require any human supervision. It has several original features, in particular a verification step that detects paralogs and prevents them from being clustered together. Consistency checks and verification are performed throughout the process. The resulting groups, whenever a comparison could be made, are highly consistent both with EC assignments, and with assignments from the manually curated database HAMAP. A highly accurate set of orthologous sequences constitutes the basis for several other investigations, including phylogenetic analysis and protein classification.


Systematic Biology | 2015

Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference

Ge Tan; Matthieu Muffato; Christian Ledergerber; Javier Herrero; Nick Goldman; Manuel Gil; Christophe Dessimoz

Phylogenetic inference is generally performed on the basis of multiple sequence alignments (MSA). Because errors in an alignment can lead to errors in tree estimation, there is a strong interest in identifying and removing unreliable parts of the alignment. In recent years several automated filtering approaches have been proposed, but despite their popularity, a systematic and comprehensive comparison of different alignment filtering methods on real data has been lacking. Here, we extend and apply recently introduced phylogenetic tests of alignment accuracy on a large number of gene families and contrast the performance of unfiltered versus filtered alignments in the context of single-gene phylogeny reconstruction. Based on multiple genome-wide empirical and simulated data sets, we show that the trees obtained from filtered MSAs are on average worse than those obtained from unfiltered MSAs. Furthermore, alignment filtering often leads to an increase in the proportion of well-supported branches that are actually wrong. We confirm that our findings hold for a wide range of parameters and methods. Although our results suggest that light filtering (up to 20% of alignment positions) has little impact on tree accuracy and may save some computation time, contrary to widespread practice, we do not generally recommend the use of current alignment filtering methods for phylogenetic inference. By providing a way to rigorously and systematically measure the impact of filtering on alignments, the methodology set forth here will guide the development of better filtering algorithms.


Methods of Molecular Biology | 2014

Who watches the watchmen? An appraisal of benchmarks for multiple sequence alignment

Stefano Iantorno; Kevin Gori; Nick Goldman; Manuel Gil; Christophe Dessimoz

Multiple sequence alignment (MSA) is a fundamental and ubiquitous technique in bioinformatics used to infer related residues among biological sequences. Thus alignment accuracy is crucial to a vast range of analyses, often in ways difficult to assess in those analyses. To compare the performance of different aligners and help detect systematic errors in alignments, a number of benchmarking strategies have been pursued. Here we present an overview of the main strategies-based on simulation, consistency, protein structure, and phylogeny-and discuss their different advantages and associated risks. We outline a set of desirable characteristics for effective benchmarking, and evaluate each strategy in light of them. We conclude that there is currently no universally applicable means of benchmarking MSA, and that developers and users of alignment tools should base their choice of benchmark depending on the context of application-with a keen awareness of the assumptions underlying each benchmarking strategy.


PLOS ONE | 2015

Social Support and Health in Diabetes Patients: An Observational Study in Six European Countries in an Era of Austerity

Jan Koetsenruijter; Jan van Lieshout; Christos Lionis; Maria Carmen Portillo; Ivo Vassilev; Elka Todorova; Christina Foss; Manuel Gil; Ingrid Ruud Knutsen; Agapi Angelaki; Agurtzane Mujika; Poli Roukova; Anne Kennedy; Anne Rogers; Michel Wensing

Introduction Support from individual social networks, community organizations and neighborhoods is associated with better self-management and health outcomes. This international study examined the relative impact of different types of support on health and health-related behaviors in patients with type 2 diabetes. Methods Observational study (using interviews and questionnaires) in a sample of 1,692 type 2 diabetes patients with 5,433 connections from Bulgaria, Greece, Netherlands, Norway, Spain, and the United Kingdom. Outcomes were patient-reported health status (SF-12), physical exercise (RAPA), diet and smoking (SDCSCA). Random coefficient regression models were used to examine linkages with individual networks, community organizations, and neighborhood type (deprived rural, deprived urban, or affluent urban). Results Patients had a median of 3 support connections and 34.6% participated in community organizations. Controlled for patients’ age, sex, education, income and comorbidities, large emotional support networks were associated with decrease of non-smoking (OR = 0.87). Large practical support networks were associated with worse physical and mental health (B = -0.46 and -0.27 respectively) and less physical activity (OR = 0.90). Participation in community organizations was associated with better physical and mental health (B = 1.39 and 1.22, respectively) and, in patients with low income, with more physical activity (OR = 1.53). Discussion Participation in community organizations was most consistently related to better health status. Many diabetes patients have individual support networks, but this study did not provide evidence to increase their size as a public health strategy. The consistent association between participation in community organizations and health status provides a clear target for interventions and policies.


Proceedings of the National Academy of Sciences of the United States of America | 2015

Simple chained guide trees give poorer multiple sequence alignments than inferred trees in simulation and phylogenetic benchmarks

Ge Tan; Manuel Gil; Ari Löytynoja; Nick Goldman; Christophe Dessimoz

Multiple sequence aligners typically work by progressively aligning the most closely related sequences or group of sequences according to guide trees. In PNAS, Boyce et al. (1) report that alignments reconstructed using simple chained trees (i.e., comb-like topologies) with random leaf assignment performed better in protein structure-based benchmarks than those reconstructed using phylogenies estimated from the data as guide trees. The authors state that this result could turn decades of research in the field on its head. In light of this statement, it is important to check immediately whether their result holds under evolutionary criteria: recovery of homologous sequence residues and inference of phylogenetic trees from the alignments (2). We have done this and the results are entirely opposed to Boyce et al.’s findings (1).

Collaboration


Dive into the Manuel Gil's collaboration.

Top Co-Authors

Avatar

Nick Goldman

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Anne Rogers

University of Southampton

View shared research outputs
Top Co-Authors

Avatar

Ge Tan

Imperial College London

View shared research outputs
Top Co-Authors

Avatar

Ivo Vassilev

University of Southampton

View shared research outputs
Top Co-Authors

Avatar

Jan Koetsenruijter

Radboud University Nijmegen

View shared research outputs
Top Co-Authors

Avatar

Jan van Lieshout

Radboud University Nijmegen

View shared research outputs
Researchain Logo
Decentralizing Knowledge