Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where David Talavera is active.

Publication


Featured researches published by David Talavera.


PLOS Computational Biology | 2007

The (In)dependence of alternative splicing and gene duplication

David Talavera; Christine Vogel; Modesto Orozco; Sarah A. Teichmann; Xavier de la Cruz

Alternative splicing (AS) and gene duplication (GD) both are processes that diversify the protein repertoire. Recent examples have shown that sequence changes introduced by AS may be comparable to those introduced by GD. In addition, the two processes are inversely correlated at the genomic scale: large gene families are depleted in splice variants and vice versa. All together, these data strongly suggest that both phenomena result in interchangeability between their effects. Here, we tested the extent to which this applies with respect to various protein characteristics. The amounts of AS and GD per gene are anticorrelated even when accounting for different gene functions or degrees of sequence divergence. In contrast, the two processes appear to be independent in their influence on variation in mRNA expression. Further, we conducted a detailed comparison of the effect of sequence changes in both alternative splice variants and gene duplicates on protein structure, in particular the size, location, and types of sequence substitutions and insertions/deletions. We find that, in general, alternative splicing affects protein sequence and structure in a more drastic way than gene duplication and subsequent divergence. Our results reveal an interesting paradox between the anticorrelation of AS and GD at the genomic level, and their impact at the protein level, which shows little or no equivalence in terms of effects on protein sequence, structure, and function. We discuss possible explanations that relate to the order of appearance of AS and GD in a gene family, and to the selection pressure imposed by the environment.


Journal of the Royal Society Interface | 2009

Genome and proteome annotation: organization, interpretation and integration

Gabrielle A. Reeves; David Talavera; Janet M. Thornton

Recent years have seen a huge increase in the generation of genomic and proteomic data. This has been due to improvements in current biological methodologies, the development of new experimental techniques and the use of computers as support tools. All these raw data are useless if they cannot be properly analysed, annotated, stored and displayed. Consequently, a vast number of resources have been created to present the data to the wider community. Annotation tools and databases provide the means to disseminate these data and to comprehend their biological importance. This review examines the various aspects of annotation: type, methodology and availability. Moreover, it puts a special interest on novel annotation fields, such as that of phenotypes, and highlights the recent efforts focused on the integrating annotations.


Journal of Chemical Information and Modeling | 2011

Scoring by Intermolecular Pairwise Propensities of Exposed Residues (SIPPER): A New Efficient Potential for Protein―Protein Docking

Carles Pons; David Talavera; Xavier de la Cruz; Modesto Orozco; Juan Fernández-Recio

A detailed and complete structural knowledge of the interactome is one of the grand challenges in Biology, and a variety of computational docking approaches have been developed to complement experimental efforts and help in the characterization of protein-protein interactions. Among the different docking scoring methods, those based on physicochemical considerations can give the maximum accuracy at the atomic level, but they are usually computationally demanding and necessarily noisy when implemented in rigid-body approaches. Coarser-grained knowledge-based potentials are less sensitive to details of atomic arrangements, thus providing an efficient alternative for scoring of rigid-body docking poses. In this study, we have extracted new statistical potentials from intermolecular pairs of exposed residues in known complex structures, which were then used to score protein-protein docking poses. The new method, called SIPPER (scoring by intermolecular pairwise propensities of exposed residues), combines the value of residue desolvation based on solvent-exposed area with the propensity-based contribution of intermolecular residue pairs. This new scoring function found a near-native orientation within the top 10 predictions in nearly one-third of the cases of a standard docking benchmark and proved to be also useful as a filtering step, drastically reducing the number of docking candidates needed by energy-based methods like pyDock.


Genome Biology | 2015

Global mRNA selection mechanisms for translation initiation

Joseph L. Costello; Lydia M. Castelli; William Rowe; Christopher J. Kershaw; David Talavera; Sarah S. Mohammad-Qureshi; Paul F. G. Sims; Chris M. Grant; Graham D. Pavitt; Simon J. Hubbard; Mark P. Ashe

BackgroundThe selection and regulation of individual mRNAs for translation initiation from a competing pool of mRNA are poorly understood processes. The closed loop complex, comprising eIF4E, eIF4G and PABP, and its regulation by 4E-BPs are perceived to be key players. Using RIP-seq, we aimed to evaluate the role in gene regulation of the closed loop complex and 4E-BP regulation across the entire yeast transcriptome.ResultsWe find that there are distinct populations of mRNAs with coherent properties: one mRNA pool contains many ribosomal protein mRNAs and is enriched specifically with all of the closed loop translation initiation components. This class likely represents mRNAs that rely heavily on the closed loop complex for protein synthesis. Other heavily translated mRNAs are apparently under-represented with most closed loop components except Pab1p. Combined with data showing a close correlation between Pab1p interaction and levels of translation, these data suggest that Pab1p is important for the translation of these mRNAs in a closed loop independent manner. We also identify a translational regulatory mechanism for the 4E-BPs; these appear to self-regulate by inhibiting translation initiation of their own mRNAs.ConclusionsOverall, we show that mRNA selection for translation initiation is not as uniformly regimented as previously anticipated. Components of the closed loop complex are highly relevant for many mRNAs, but some heavily translated mRNAs interact poorly with this machinery. Therefore, alternative, possibly Pab1p-dependent mechanisms likely exist to load ribosomes effectively onto mRNAs. Finally, these studies identify and characterize a complex self-regulatory circuit for the yeast 4E-BPs.


PLOS ONE | 2011

Characterization of Protein-Protein Interaction Interfaces from a Single Species

David Talavera; David Robertson; Simon C. Lovell

Most proteins attain their biological functions through specific interactions with other proteins. Thus, the study of protein-protein interactions and the interfaces that mediate these interactions is of prime importance for the understanding of biological function. In particular the precise determinants of binding specificity and their contributions to binding energy within protein interfaces are not well understood. In order to better understand these determinants an appropriate description of the interaction surface is needed. Available data from the yeast Saccharomyces cerevisiae allow us to focus on a single species and to use all the available structures, correcting for redundancy, instead of using structural representatives. This allows us to control for potentially confounding factors that may affect sequence propensities. We find a significant contribution of main-chain atoms to protein-protein interactions. These include interactions both with other main-chain and side-chain atoms on the interacting chain. We find that the type of interaction depends on both amino acid and secondary structure type involved in the contact. For example, residues in α-helices and large amino acids are the most likely to be involved in interactions through their side-chain atoms. We find an intriguing homogeneity when calculating the average solvation energy of different areas of the protein surface. Unexpectedly, homo- and hetero-complexes have quite similar results for all analyses. Our findings demonstrate that the manner in which protein-protein interactions are formed is determined by the residue type and the secondary structure found in the interface. However the homogeneity of the desolvation energy despite heterogeneity of interface properties suggests a complex relationship between interface composition and binding energy.


Molecular Biology and Evolution | 2015

Covariation Is a Poor Measure of Molecular Coevolution

David Talavera; Simon C. Lovell; Simon Whelan

Recent developments in the analysis of amino acid covariation are leading to breakthroughs in protein structure prediction, protein design, and prediction of the interactome. It is assumed that observed patterns of covariation are caused by molecular coevolution, where substitutions at one site affect the evolutionary forces acting at neighboring sites. Our theoretical and empirical results cast doubt on this assumption. We demonstrate that the strongest coevolutionary signal is a decrease in evolutionary rate and that unfeasibly long times are required to produce coordinated substitutions. We find that covarying substitutions are mostly found on different branches of the phylogenetic tree, indicating that they are independent events that may or may not be attributable to coevolution. These observations undermine the hypothesis that molecular coevolution is the primary cause of the covariation signal. In contrast, we find that the pairs of residues with the strongest covariation signal tend to have low evolutionary rates, and that it is this low rate that gives rise to the covariation signal. Slowly evolving residue pairs are disproportionately located in the protein’s core, which explains covariation methods’ ability to detect pairs of residues that are close in three dimensions. These observations lead us to propose the “coevolution paradox”: The strength of coevolution required to cause coordinated changes means the evolutionary rate is so low that such changes are highly unlikely to occur. As modern covariation methods may lead to breakthroughs in structural genomics, it is critical to recognize their biases and limitations.


Systematic Biology | 2015

ModelOMatic: Fast and Automated Model Selection between RY, Nucleotide, Amino Acid, and Codon Substitution Models

Simon Whelan; James E. Allen; Benjamin P. Blackburne; David Talavera

Molecular phylogenetics is a powerful tool for inferring both the process and pattern of evolution from genomic sequence data. Statistical approaches, such as maximum likelihood and Bayesian inference, are now established as the preferred methods of inference. The choice of models that a researcher uses for inference is of critical importance, and there are established methods for model selection conditioned on a particular type of data, such as nucleotides, amino acids, or codons. A major limitation of existing model selection approaches is that they can only compare models acting upon a single type of data. Here, we extend model selection to allow comparisons between models describing different types of data by introducing the idea of adapter functions, which project aggregated models onto the originally observed sequence data. These projections are implemented in the program ModelOMatic and used to perform model selection on 3722 families from the PANDIT database, 68 genes from an arthropod phylogenomic data set, and 248 genes from a vertebrate phylogenomic data set. For the PANDIT and arthropod data, we find that amino acid models are selected for the overwhelming majority of alignments; with progressively smaller numbers of alignments selecting codon and nucleotide models, and no families selecting RY-based models. In contrast, nearly all alignments from the vertebrate data set select codon-based models. The sequence divergence, the number of sequences, and the degree of selection acting upon the protein sequences may contribute to explaining this variation in model selection. Our ModelOMatic program is fast, with most families from PANDIT taking fewer than 150 s to complete, and should therefore be easily incorporated into existing phylogenetic pipelines. ModelOMatic is available at https://code.google.com/p/modelomatic/.


Bioinformatics | 2009

WSsas: a web service for the annotation of functional residues through structural homologues

David Talavera; Roman A. Laskowski; Janet M. Thornton

MOTIVATION Annotation tools help scientists to traverse the gap between characterized and uncharacterized proteins. Tools for the prediction of protein function include those which predict the function of entire proteins or complexes, those annotating functional domains and those which predict specific residues within the domain. We have developed WSsas, a web service focused on the annotation of essential functional residues. WSsas uses similarity searches and pairwise alignments to transfer functional information about binding, catalytic and protein-protein interaction residues from solved structures to query sequences. In addition, WSsas can supply information about the relevant functional atoms. The web service definition (WSDL) file and a Perl client are freely available at http://www.ebi.ac.uk/thornton-srv/databases/WSsas/.


PLOS Genetics | 2015

The Yeast La Related Protein Slf1p Is a Key Activator of Translation during the Oxidative Stress Response

Christopher J. Kershaw; Joseph L. Costello; Lydia M. Castelli; David Talavera; William Rowe; Paul F. G. Sims; Mark P. Ashe; Simon J. Hubbard; Graham D. Pavitt; Chris M. Grant

The mechanisms by which RNA-binding proteins control the translation of subsets of mRNAs are not yet clear. Slf1p and Sro9p are atypical-La motif containing proteins which are members of a superfamily of RNA-binding proteins conserved in eukaryotes. RIP-Seq analysis of these two yeast proteins identified overlapping and distinct sets of mRNA targets, including highly translated mRNAs such as those encoding ribosomal proteins. In paralell, transcriptome analysis of slf1Δ and sro9Δ mutant strains indicated altered gene expression in similar functional classes of mRNAs following loss of each factor. The loss of SLF1 had a greater impact on the transcriptome, and in particular, revealed changes in genes involved in the oxidative stress response. slf1Δ cells are more sensitive to oxidants and RIP-Seq analysis of oxidatively stressed cells enriched Slf1p targets encoding antioxidants and other proteins required for oxidant tolerance. To quantify these effects at the protein level, we used label-free mass spectrometry to compare the proteomes of wild-type and slf1Δ strains following oxidative stress. This analysis identified several proteins which are normally induced in response to hydrogen peroxide, but where this increase is attenuated in the slf1Δ mutant. Importantly, a significant number of the mRNAs encoding these targets were also identified as Slf1p-mRNA targets. We show that Slf1p remains associated with the few translating ribosomes following hydrogen peroxide stress and that Slf1p co-immunoprecipitates ribosomes and members of the eIF4E/eIF4G/Pab1p ‘closed loop’ complex suggesting that Slf1p interacts with actively translated mRNAs following stress. Finally, mutational analysis of SLF1 revealed a novel ribosome interacting domain in Slf1p, independent of its RNA binding La-motif. Together, our results indicate that Slf1p mediates a translational response to oxidative stress via mRNA-specific translational control.


Scientific Reports | 2015

Integrated multi-omics analyses reveal the pleiotropic nature of the control of gene expression by Puf3p.

Christopher J. Kershaw; Joseph L. Costello; David Talavera; William Rowe; Lydia M. Castelli; Paul F. G. Sims; Chris M. Grant; Mark P. Ashe; Simon J. Hubbard; Graham D. Pavitt

The PUF family of RNA-binding proteins regulate gene expression post-transcriptionally. Saccharomyces cerevisiae Puf3p is characterised as binding nuclear-encoded mRNAs specifying mitochondrial proteins. Extensive studies of its regulation of COX17 demonstrate its role in mRNA decay. Using integrated genome-wide approaches we define an expanded set of Puf3p target mRNAs and quantitatively assessed the global impact of loss of PUF3 on gene expression using mRNA and polysome profiling and quantitative proteomics. In agreement with prior studies, our sequencing of affinity-purified Puf3-TAP associated mRNAs (RIP-seq) identified mRNAs encoding mitochondrially-targeted proteins. Additionally, we also found 720  new mRNA targets that predominantly encode proteins that enter the nucleus. Comparing transcript levels in wild-type and puf3∆ cells revealed that only a small fraction of mRNA levels alter, suggesting Puf3p determines mRNA stability for only a limited subset of its target mRNAs. Finally, proteomic and translatomic studies suggest that loss of Puf3p has widespread, but modest, impact on mRNA translation. Taken together our integrated multi-omics data point to multiple classes of Puf3p targets, which display coherent post-transcriptional regulatory properties and suggest Puf3p plays a broad, but nuanced, role in the fine-tuning of gene expression.

Collaboration


Dive into the David Talavera's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Xavier de la Cruz

Autonomous University of Barcelona

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Petr Budera

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Chris M. Grant

University of Manchester

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge