Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Martin Weigt is active.

Publication


Featured researches published by Martin Weigt.


Proceedings of the National Academy of Sciences of the United States of America | 2011

Direct-coupling analysis of residue coevolution captures native contacts across many protein families

Faruck Morcos; Andrea Pagnani; Bryan Lunt; Arianna Bertolino; Debora S. Marks; Chris Sander; Riccardo Zecchina; José N. Onuchic; Terence Hwa; Martin Weigt

The similarity in the three-dimensional structures of homologous proteins imposes strong constraints on their sequence variability. It has long been suggested that the resulting correlations among amino acid compositions at different sequence positions can be exploited to infer spatial contacts within the tertiary protein structure. Crucial to this inference is the ability to disentangle direct and indirect correlations, as accomplished by the recently introduced direct-coupling analysis (DCA). Here we develop a computationally efficient implementation of DCA, which allows us to evaluate the accuracy of contact prediction by DCA for a large number of protein domains, based purely on sequence information. DCA is shown to yield a large number of correctly predicted contacts, recapitulating the global structure of the contact map for the majority of the protein domains examined. Furthermore, our analysis captures clear signals beyond intradomain residue contacts, arising, e.g., from alternative protein conformations, ligand-mediated residue couplings, and interdomain interactions in protein oligomers. Our findings suggest that contacts predicted by DCA can be used as a reliable guide to facilitate computational predictions of alternative protein conformations, protein complex formation, and even the de novo prediction of protein domain structures, contingent on the existence of a large number of homologous sequences which are being rapidly made available due to advances in genome sequencing.


Proceedings of the National Academy of Sciences of the United States of America | 2009

Identification of direct residue contacts in protein–protein interaction by message passing

Martin Weigt; Robert A. White; Hendrik Szurmant; James A. Hoch; Terence Hwa

Understanding the molecular determinants of specificity in protein–protein interaction is an outstanding challenge of postgenome biology. The availability of large protein databases generated from sequences of hundreds of bacterial genomes enables various statistical approaches to this problem. In this context covariance-based methods have been used to identify correlation between amino acid positions in interacting proteins. However, these methods have an important shortcoming, in that they cannot distinguish between directly and indirectly correlated residues. We developed a method that combines covariance analysis with global inference analysis, adopted from use in statistical physics. Applied to a set of >2,500 representatives of the bacterial two-component signal transduction system, the combination of covariance with global inference successfully and robustly identified residue pairs that are proximal in space without resorting to ad hoc tuning parameters, both for heterointeractions between sensor kinase (SK) and response regulator (RR) proteins and for homointeractions between RR proteins. The spectacular success of this approach illustrates the effectiveness of the global inference approach in identifying direct interaction based on sequence information alone. We expect this method to be applicable soon to interaction surfaces between proteins present in only 1 copy per genome as the number of sequenced genomes continues to expand. Use of this method could significantly increase the potential targets for therapeutic intervention, shed light on the mechanism of protein–protein interaction, and establish the foundation for the accurate prediction of interacting protein partners.


Physical Review E | 2013

Improved contact prediction in proteins: Using pseudolikelihoods to infer Potts models

Magnus Ekeberg; Cecilia Lövkvist; Yueheng Lan; Martin Weigt; Erik Aurell

Spatially proximate amino acids in a protein tend to coevolve. A proteins three-dimensional (3D) structure hence leaves an echo of correlations in the evolutionary record. Reverse engineering 3D structures from such correlations is an open problem in structural biology, pursued with increasing vigor as more and more protein sequences continue to fill the data banks. Within this task lies a statistical inference problem, rooted in the following: correlation between two sites in a protein sequence can arise from firsthand interaction but can also be network-propagated via intermediate sites; observed correlation is not enough to guarantee proximity. To separate direct from indirect interactions is an instance of the general problem of inverse statistical mechanics, where the task is to learn model parameters (fields, couplings) from observables (magnetizations, correlations, samples) in large systems. In the context of protein sequences, the approach has been referred to as direct-coupling analysis. Here we show that the pseudolikelihood method, applied to 21-state Potts models describing the statistical properties of families of evolutionarily related proteins, significantly outperforms existing approaches to the direct-coupling analysis, the latter being based on standard mean-field techniques. This improved performance also relies on a modified score for the coupling strength. The results are verified using known crystal structures of specific sequence instances of various protein families. Code implementing the new method can be found at http://plmdca.csc.kth.se/.


Proceedings of the National Academy of Sciences of the United States of America | 2012

Genomics-aided structure prediction

Joanna I. Sulkowska; Faruck Morcos; Martin Weigt; Terence Hwa; José N. Onuchic

We introduce a theoretical framework that exploits the ever-increasing genomic sequence information for protein structure prediction. Structure-based models are modified to incorporate constraints by a large number of non-local contacts estimated from direct coupling analysis (DCA) of co-evolving genomic sequences. A simple hybrid method, called DCA-fold, integrating DCA contacts with an accurate knowledge of local information (e.g., the local secondary structure) is sufficient to fold proteins in the range of 1–3 Å resolution.


Proceedings of the National Academy of Sciences of the United States of America | 2012

Structural basis of histidine kinase autophosphorylation deduced by integrating genomics, molecular dynamics, and mutagenesis

Angel E. Dago; Alexander Schug; Andrea Procaccini; James A. Hoch; Martin Weigt; Hendrik Szurmant

Signal transduction proteins such as bacterial sensor histidine kinases, designed to transition between multiple conformations, are often ruled by unstable transient interactions making structural characterization of all functional states difficult. This study explored the inactive and signal-activated conformational states of the two catalytic domains of sensor histidine kinases, HisKA and HATPase. Direct coupling analyses, a global statistical inference approach, was applied to >13,000 such domains from protein databases to identify residue contacts between the two domains. These contacts guided structural assembly of the domains using MAGMA, an advanced molecular dynamics docking method. The active conformation structure generated by MAGMA simultaneously accommodated the sequence derived residue contacts and the ATP-catalytic histidine contact. The validity of this structure was confirmed biologically by mutation of contact positions in the Bacillus subtilis sensor histidine kinase KinA and by restoration of activity in an inactive KinA(HisKA):KinD(HATPase) hybrid protein. These data indicate that signals binding to sensor domains activate sensor histidine kinases by causing localized strain and unwinding at the end of the C-terminal helix of the HisKA domain. This destabilizes the contact positions of the inactive conformation of the two domains, identified by previous crystal structure analyses and by the sequence analysis described here, inducing the formation of the active conformation. This study reveals that structures of unstable transient complexes of interacting proteins and of protein domains are accessible by applying this combination of cross-validating technologies.


PLOS Computational Biology | 2013

From Principal Component to Direct Coupling Analysis of Coevolution in Proteins: Low-Eigenvalue Modes are Needed for Structure Prediction

Simona Cocco; Rémi Monasson; Martin Weigt

Various approaches have explored the covariation of residues in multiple-sequence alignments of homologous proteins to extract functional and structural information. Among those are principal component analysis (PCA), which identifies the most correlated groups of residues, and direct coupling analysis (DCA), a global inference method based on the maximum entropy principle, which aims at predicting residue-residue contacts. In this paper, inspired by the statistical physics of disordered systems, we introduce the Hopfield-Potts model to naturally interpolate between these two approaches. The Hopfield-Potts model allows us to identify relevant ‘patterns’ of residues from the knowledge of the eigenmodes and eigenvalues of the residue-residue correlation matrix. We show how the computation of such statistical patterns makes it possible to accurately predict residue-residue contacts with a much smaller number of parameters than DCA. This dimensional reduction allows us to avoid overfitting and to extract contact information from multiple-sequence alignments of reduced size. In addition, we show that low-eigenvalue correlation modes, discarded by PCA, are important to recover structural information: the corresponding patterns are highly localized, that is, they are concentrated in few sites, which we find to be in close contact in the three-dimensional protein fold.


PLOS Computational Biology | 2013

Perturbation biology: inferring signaling networks in cellular systems.

Evan Molinelli; Anil Korkut; Weiqing Wang; Martin L. Miller; Nicholas Paul Gauthier; Xiaohong Jing; Poorvi Kaushik; Qin He; Gordon B. Mills; David B. Solit; Christine A. Pratilas; Martin Weigt; Alfredo Braunstein; Andrea Pagnani; Riccardo Zecchina; Chris Sander

We present a powerful experimental-computational technology for inferring network models that predict the response of cells to perturbations, and that may be useful in the design of combinatorial therapy against cancer. The experiments are systematic series of perturbations of cancer cell lines by targeted drugs, singly or in combination. The response to perturbation is quantified in terms of relative changes in the measured levels of proteins, phospho-proteins and cellular phenotypes such as viability. Computational network models are derived de novo, i.e., without prior knowledge of signaling pathways, and are based on simple non-linear differential equations. The prohibitively large solution space of all possible network models is explored efficiently using a probabilistic algorithm, Belief Propagation (BP), which is three orders of magnitude faster than standard Monte Carlo methods. Explicit executable models are derived for a set of perturbation experiments in SKMEL-133 melanoma cell lines, which are resistant to the therapeutically important inhibitor of RAF kinase. The resulting network models reproduce and extend known pathway biology. They empower potential discoveries of new molecular interactions and predict efficacious novel drug perturbations, such as the inhibition of PLK1, which is verified experimentally. This technology is suitable for application to larger systems in diverse areas of molecular biology.


Molecular Biology and Evolution | 2016

Coevolutionary Landscape Inference and the Context-Dependence of Mutations in Beta-Lactamase TEM-1

Matteo Figliuzzi; Hervé Jacquier; Alexander Schug; Olivier Tenaillon; Martin Weigt

The quantitative characterization of mutational landscapes is a task of outstanding importance in evolutionary and medical biology: It is, for example, of central importance for our understanding of the phenotypic effect of mutations related to disease and antibiotic drug resistance. Here we develop a novel inference scheme for mutational landscapes, which is based on the statistical analysis of large alignments of homologs of the protein of interest. Our method is able to capture epistatic couplings between residues, and therefore to assess the dependence of mutational effects on the sequence context where they appear. Compared with recent large-scale mutagenesis data of the beta-lactamase TEM-1, a protein providing resistance against beta-lactam antibiotics, our method leads to an increase of about 40% in explicative power as compared with approaches neglecting epistasis. We find that the informative sequence context extends to residues at native distances of about 20 Å from the mutated site, reaching thus far beyond residues in direct physical contact.


PLOS ONE | 2014

Fast and Accurate Multivariate Gaussian Modeling of Protein Families: Predicting Residue Contacts and Protein-Interaction Partners

Carlo Baldassi; Marco Zamparo; Christoph Feinauer; Andrea Procaccini; Riccardo Zecchina; Martin Weigt; Andrea Pagnani

In the course of evolution, proteins show a remarkable conservation of their three-dimensional structure and their biological function, leading to strong evolutionary constraints on the sequence variability between homologous proteins. Our method aims at extracting such constraints from rapidly accumulating sequence data, and thereby at inferring protein structure and function from sequence information alone. Recently, global statistical inference methods (e.g. direct-coupling analysis, sparse inverse covariance estimation) have achieved a breakthrough towards this aim, and their predictions have been successfully implemented into tertiary and quaternary protein structure prediction methods. However, due to the discrete nature of the underlying variable (amino-acids), exact inference requires exponential time in the protein length, and efficient approximations are needed for practical applicability. Here we propose a very efficient multivariate Gaussian modeling approach as a variant of direct-coupling analysis: the discrete amino-acid variables are replaced by continuous Gaussian random variables. The resulting statistical inference problem is efficiently and exactly solvable. We show that the quality of inference is comparable or superior to the one achieved by mean-field approximations to inference with discrete variables, as done by direct-coupling analysis. This is true for (i) the prediction of residue-residue contacts in proteins, and (ii) the identification of protein-protein interaction partner in bacterial signal transduction. An implementation of our multivariate Gaussian approach is available at the website http://areeweb.polito.it/ricerca/cmp/code.


Nucleic Acids Research | 2015

Direct-Coupling Analysis of nucleotide coevolution facilitates RNA secondary and tertiary structure prediction

Eleonora De Leonardis; Benjamin Lutz; Sebastian Ratz; Simona Cocco; Rémi Monasson; Alexander Schug; Martin Weigt

Despite the biological importance of non-coding RNA, their structural characterization remains challenging. Making use of the rapidly growing sequence databases, we analyze nucleotide coevolution across homologous sequences via Direct-Coupling Analysis to detect nucleotide-nucleotide contacts. For a representative set of riboswitches, we show that the results of Direct-Coupling Analysis in combination with a generalized Nussinov algorithm systematically improve the results of RNA secondary structure prediction beyond traditional covariance approaches based on mutual information. Even more importantly, we show that the results of Direct-Coupling Analysis are enriched in tertiary structure contacts. By integrating these predictions into molecular modeling tools, systematically improved tertiary structure predictions can be obtained, as compared to using secondary structure information alone.

Collaboration


Dive into the Martin Weigt's collaboration.

Top Co-Authors

Avatar

Hendrik Szurmant

Scripps Research Institute

View shared research outputs
Top Co-Authors

Avatar

Alexander Schug

Karlsruhe Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Rémi Monasson

École Normale Supérieure

View shared research outputs
Top Co-Authors

Avatar

Simona Cocco

École Normale Supérieure

View shared research outputs
Top Co-Authors

Avatar

Terence Hwa

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Faruck Morcos

University of Notre Dame

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Eleonora De Leonardis

Centre national de la recherche scientifique

View shared research outputs
Researchain Logo
Decentralizing Knowledge