Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Menachem Fromer is active.

Publication


Featured researches published by Menachem Fromer.


Molecular Systems Biology | 2009

Viral adaptation to host: a proteome-based analysis of codon usage and amino acid preferences

Iris Bahir; Menachem Fromer; Yosef Prat; Michal Linial

Viruses differ markedly in their specificity toward host organisms. Here, we test the level of general sequence adaptation that viruses display toward their hosts. We compiled a representative data set of viruses that infect hosts ranging from bacteria to humans. We consider their respective amino acid and codon usages and compare them among the viruses and their hosts. We show that bacteria‐infecting viruses are strongly adapted to their specific hosts, but that they differ from other unrelated bacterial hosts. Viruses that infect humans, but not those that infect other mammals or aves, show a strong resemblance to most mammalian and avian hosts, in terms of both amino acid and codon preferences. In groups of viruses that infect humans or other mammals, the highest observed level of adaptation of viral proteins to host codon usages is for those proteins that appear abundantly in the virion. In contrast, proteins that are known to participate in host‐specific recognition do not necessarily adapt to their respective hosts. The implication for the potential of viral infectivity is discussed.


intelligent systems in molecular biology | 2008

Efficient algorithms for accurate hierarchical clustering of huge datasets

Yaniv Loewenstein; Elon Portugaly; Menachem Fromer; Michal Linial

Motivation: UPGMA (average linking) is probably the most popular algorithm for hierarchical data clustering, especially in computational biology. However, UPGMA requires the entire dissimilarity matrix in memory. Due to this prohibitive requirement, UPGMA is not scalable to very large datasets. Application: We present a novel class of memory-constrained UPGMA (MC-UPGMA) algorithms. Given any practical memory size constraint, this framework guarantees the correct clustering solution without explicitly requiring all dissimilarities in memory. The algorithms are general and are applicable to any dataset. We present a data-dependent characterization of hardness and clustering efficiency. The presented concepts are applicable to any agglomerative clustering formulation. Results: We apply our algorithm to the entire collection of protein sequences, to automatically build a comprehensive evolutionary-driven hierarchy of proteins from sequence alone. The newly created tree captures protein families better than state-of-the-art large-scale methods such as CluSTr, ProtoNet4 or single-linkage clustering. We demonstrate that leveraging the entire mass embodied in all sequence similarities allows to significantly improve on current protein family clusterings which are unable to directly tackle the sheer mass of this data. Furthermore, we argue that non-metric constraints are an inherent complexity of the sequence space and should not be overlooked. The robustness of UPGMA allows significant improvement, especially for multidomain proteins, and for large or divergent families. Availability: A comprehensive tree built from all UniProt sequence similarities, together with navigation and classification tools will be made available as part of the ProtoNet service. A C++ implementation of the algorithm is available on request. Contact: [email protected]


Nucleic Acids Research | 2004

ProtoNet 4.0: A hierarchical classification of one million protein sequences

Noam Kaplan; Ori Sasson; Uri Inbar; Moriah Friedlich; Menachem Fromer; Hillel Fleischer; Elon Portugaly; Nathan Linial; Michal Linial

ProtoNet is an automatic hierarchical classification of the protein sequence space. In 2004, the ProtoNet (version 4.0) presents the analysis of over one million proteins merged from SwissProt and TrEMBL databases. In addition to rich visualization and analysis tools to navigate the clustering hierarchy, we incorporated several improvements that allow a simplified view of the scaffold of the proteins. An unsupervised, biologically valid method that was developed resulted in a condensation of the ProtoNet hierarchy to only 12% of the clusters. A large portion of these clusters was automatically assigned high confidence biological names according to their correspondence with functional annotations. ProtoNet is available at: http://www.protonet.cs.huji.ac.il.


Mechanisms of Development | 1999

Vg1 RBP intracellular distribution and evolutionarily conserved expression at multiple stages during development.

Qinghong Zhang; Karina Yaniv; Froma Oberman; Uta Wolke; Anna Git; Menachem Fromer; William L. Taylor; Dirk Meyer; Nancy Standart; Erez Raz; Joel K. Yisraeli

We have analyzed the expression and intracellular distribution, during oogenesis and embryogenesis, of Vg1 RBP, a protein implicated in the intracellular localization of Vg1 mRNA to the vegetal cortex of Xenopus oocytes. Vg1 RBP (protein) colocalizes with Vg1 RNA at all stages of oogenesis. Vg1 RBP RNA, however, localizes to the animal pole during late oogenesis, and remains in the animal blastomeres and ectodermal precursors until its zygotic transcription is activated, around stage 12. Vg1 RBP mRNA then becomes expressed throughout the neural epithelium. Vg1 RBP mRNA expression is also detected in what appears to be neural crest cells undergoing delamination and lateral migration. By tailbud stages, Vg1 RBP expression is present in the branchial arches, otic vesicle, pronephros, and along the neural tube. To examine the expression pattern in different species, we cloned the zebrafish homolog of Vg1 RBP by using a highly homologous EST clone to screen an embryonic cDNA library. In situ hybridization reveals that Vg1 RBP RNA localizes early in oogenesis to the animal pole. Although Vg1 RBP RNA is detected in all blastomeres of the early embryo, the expression pattern in the one day old zebrafish embryo is almost identical to that of the equivalent stage Xenopus embryo. These results indicate that the zygotic expression pattern is similar in frogs and fish, and that there is a conserved zygotic expression of Vg1 RBP distinct from its expression in the oocyte.


PLOS Computational Biology | 2009

Tradeoff Between Stability and Multispecificity in the Design of Promiscuous Proteins

Menachem Fromer; Julia M. Shifman

Natural proteins often partake in several highly specific protein-protein interactions. They are thus subject to multiple opposing forces during evolutionary selection. To be functional, such multispecific proteins need to be stable in complex with each interaction partner, and, at the same time, to maintain affinity toward all partners. How is this multispecificity acquired through natural evolution? To answer this compelling question, we study a prototypical multispecific protein, calmodulin (CaM), which has evolved to interact with hundreds of target proteins. Starting from high-resolution structures of sixteen CaM-target complexes, we employ state-of-the-art computational methods to predict a hundred CaM sequences best suited for interaction with each individual CaM target. Then, we design CaM sequences most compatible with each possible combination of two, three, and all sixteen targets simultaneously, producing almost 70,000 low energy CaM sequences. By comparing these sequences and their energies, we gain insight into how nature has managed to find the compromise between the need for favorable interaction energies and the need for multispecificity. We observe that designing for more partners simultaneously yields CaM sequences that better match natural sequence profiles, thus emphasizing the importance of such strategies in nature. Furthermore, we show that the CaM binding interface can be nicely partitioned into positions that are critical for the affinity of all CaM-target complexes and those that are molded to provide interaction specificity. We reveal several basic categories of sequence-level tradeoffs that enable the compromise necessary for the promiscuity of this protein. We also thoroughly quantify the tradeoff between interaction energetics and multispecificity and find that facilitating seemingly competing interactions requires only a small deviation from optimal energies. We conclude that multispecific proteins have been subjected to a rigorous optimization process that has fine-tuned their sequences for interactions with a precise set of targets, thus conferring their multiple cellular functions.


BMC Evolutionary Biology | 2009

Codon usage is associated with the evolutionary age of genes in metazoan genomes

Yosef Prat; Menachem Fromer; Nathan Linial; Michal Linial

BackgroundCodon usage may vary significantly between different organisms and between genes within the same organism. Several evolutionary processes have been postulated to be the predominant determinants of codon usage: selection, mutation, and genetic drift. However, the relative contribution of each of these factors in different species remains debatable. The availability of complete genomes for tens of multicellular organisms provides an opportunity to inspect the relationship between codon usage and the evolutionary age of genes.ResultsWe assign an evolutionary age to a gene based on the relative positions of its identified homologues in a standard phylogenetic tree. This yields a classification of all genes in a genome to several evolutionary age classes. The present study starts from the observation that each age class of genes has a unique codon usage and proceeds to provide a quantitative analysis of the codon usage in these classes. This observation is made for the genomes of Homo sapiens, Mus musculus, and Drosophila melanogaster. It is even more remarkable that the differences between codon usages in different age groups exhibit similar and consistent behavior in various organisms. While we find that GC content and gene length are also associated with the evolutionary age of genes, they can provide only a partial explanation for the observed codon usage.ConclusionWhile factors such as GC content, mutational bias, and selection shape the codon usage in a genome, the evolutionary history of an organism over hundreds of millions of years is an overlooked property that is strongly linked to GC content, protein length, and, even more significantly, to the codon usage of metazoan genomes.


BMC Bioinformatics | 2004

A functional hierarchical organization of the protein sequence space

Noam Kaplan; Moriah Friedlich; Menachem Fromer; Michal Linial

BackgroundIt is a major challenge of computational biology to provide a comprehensive functional classification of all known proteins. Most existing methods seek recurrent patterns in known proteins based on manually-validated alignments of known protein families. Such methods can achieve high sensitivity, but are limited by the necessary manual labor. This makes our current view of the protein world incomplete and biased. This paper concerns ProtoNet, a automatic unsupervised global clustering system that generates a hierarchical tree of over 1,000,000 proteins, based solely on sequence similarity.ResultsIn this paper we show that ProtoNet correctly captures functional and structural aspects of the protein world. Furthermore, a novel feature is an automatic procedure that reduces the tree to 12% its original size. This procedure utilizes only parameters intrinsic to the clustering process. Despite the substantial reduction in size, the systems predictive power concerning biological functions is hardly affected. We then carry out an automatic comparison with existing functional protein annotations. Consequently, 78% of the clusters in the compressed tree (5,300 clusters) get assigned a biological function with a high confidence. The clustering and compression processes are unsupervised, and robust.ConclusionsWe present an automatically generated unbiased method that provides a hierarchical classification of all currently known proteins.


Journal of Computational Chemistry | 2007

Dead‐end elimination for multistate protein design

Chen Yanover; Menachem Fromer; Julia M. Shifman

Multistate protein design is the task of predicting the amino acid sequence that is best suited to selectively and stably fold to one state out of a set of competing structures. Computationally, it entails solving a challenging optimization problem. Therefore, notwithstanding the increased interest in multistate design, the only implementations reported are based on either genetic algorithms or Monte Carlo methods. The dead‐end elimination (DEE) theorem cannot be readily transfered to multistate design problems despite its successful application to single‐state protein design. In this article we propose a variant of the standard DEE, called type‐dependent DEE. Our method reduces the size of the conformational space of the multistate design problem, while provably preserving the minimal energy conformational assignment for any choice of amino acid sequence. Type‐dependent DEE can therefore be used as a preprocessing step in any computational multistate design scheme. We demonstrate the applicability of type‐dependent DEE on a set of multistate design problems and discuss its strength and limitations.


Proteins | 2009

Accurate prediction for atomic-level protein design and its application in diversifying the near-optimal sequence space

Menachem Fromer; Chen Yanover

The task of engineering a protein to assume a target three‐dimensional structure is known as protein design. Computational search algorithms are devised to predict a minimal energy amino acid sequence for a particular structure. In practice, however, an ensemble of low‐energy sequences is often sought. Primarily, this is performed because an individual predicted low‐energy sequence may not necessarily fold to the target structure because of both inaccuracies in modeling protein energetics and the nonoptimal nature of search algorithms employed. Additionally, some low‐energy sequences may be overly stable and thus lack the dynamic flexibility required for biological functionality. Furthermore, the investigation of low‐energy sequence ensembles will provide crucial insights into the pseudo‐physical energy force fields that have been derived to describe structural energetics for protein design. Significantly, numerous studies have predicted low‐energy sequences, which were subsequently synthesized and demonstrated to fold to desired structures. However, the characterization of the sequence space defined by such energy functions as compatible with a target structure has not been performed in full detail. This issue is critical for protein design scientists to successfully continue using these force fields at an ever‐increasing pace and scale. In this paper, we present a conceptually novel algorithm that rapidly predicts the set of lowest energy sequences for a given structure. Based on the theory of probabilistic graphical models, it performs efficient inspection and partitioning of the near‐optimal sequence space, without making any assumptions of positional independence. We benchmark its performance on a diverse set of relevant protein design examples and show that it consistently yields sequences of lower energy than those derived from state‐of‐the‐art techniques. Thus, we find that previously presented search techniques do not fully depict the low‐energy space as precisely. Examination of the predicted ensembles indicates that, for each structure, the amino acid identity at a majority of positions must be chosen extremely selectively so as to not incur significant energetic penalties. We investigate this high degree of similarity and demonstrate how more diverse near‐optimal sequences can be predicted in order to systematically overcome this bottleneck for computational design. Finally, we exploit this in‐depth analysis of a collection of the lowest energy sequences to suggest an explanation for previously observed experimental design results. The novel methodologies introduced here accurately portray the sequence space compatible with a protein structure and further supply a scheme to yield heterogeneous low‐energy sequences, thus providing a powerful instrument for future work on protein design. Proteins 2009.


intelligent systems in molecular biology | 2008

A computational framework to empower probabilistic protein design

Menachem Fromer; Chen Yanover

Motivation: The task of engineering a protein to perform a target biological function is known as protein design. A commonly used paradigm casts this functional design problem as a structural one, assuming a fixed backbone. In probabilistic protein design, positional amino acid probabilities are used to create a random library of sequences to be simultaneously screened for biological activity. Clearly, certain choices of probability distributions will be more successful in yielding functional sequences. However, since the number of sequences is exponential in protein length, computational optimization of the distribution is difficult. Results: In this paper, we develop a computational framework for probabilistic protein design following the structural paradigm. We formulate the distribution of sequences for a structure using the Boltzmann distribution over their free energies. The corresponding probabilistic graphical model is constructed, and we apply belief propagation (BP) to calculate marginal amino acid probabilities. We test this method on a large structural dataset and demonstrate the superiority of BP over previous methods. Nevertheless, since the results obtained by BP are far from optimal, we thoroughly assess the paradigm using high-quality experimental data. We demonstrate that, for small scale sub-problems, BP attains identical results to those produced by exact inference on the paradigmatic model. However, quantitative analysis shows that the distributions predicted significantly differ from the experimental data. These findings, along with the excellent performance we observed using BP on the smaller problems, suggest potential shortcomings of the paradigm. We conclude with a discussion of how it may be improved in the future. Contact: [email protected]

Collaboration


Dive into the Menachem Fromer's collaboration.

Top Co-Authors

Avatar

Michal Linial

Hebrew University of Jerusalem

View shared research outputs
Top Co-Authors

Avatar

Nathan Linial

Hebrew University of Jerusalem

View shared research outputs
Top Co-Authors

Avatar

Yosef Prat

Hebrew University of Jerusalem

View shared research outputs
Top Co-Authors

Avatar

Chen Yanover

Hebrew University of Jerusalem

View shared research outputs
Top Co-Authors

Avatar

Amir Globerson

Hebrew University of Jerusalem

View shared research outputs
Top Co-Authors

Avatar

Chen Yanover

Hebrew University of Jerusalem

View shared research outputs
Top Co-Authors

Avatar

Elon Portugaly

Hebrew University of Jerusalem

View shared research outputs
Top Co-Authors

Avatar

Julia M. Shifman

Hebrew University of Jerusalem

View shared research outputs
Top Co-Authors

Avatar

Moriah Friedlich

Hebrew University of Jerusalem

View shared research outputs
Top Co-Authors

Avatar

Noam Kaplan

Weizmann Institute of Science

View shared research outputs
Researchain Logo
Decentralizing Knowledge