R. Gonzalo Parra
Facultad de Ciencias Exactas y Naturales
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by R. Gonzalo Parra.
Nucleic Acids Research | 2012
Michael Jenik; R. Gonzalo Parra; Leandro G. Radusky; Adrián G. Turjanski; Peter G. Wolynes; Diego U. Ferreiro
The frustratometer is an energy landscape theory-inspired algorithm that aims at quantifying the location of frustration manifested in protein molecules. Frustration is a useful concept for gaining insight to the proteins biological behavior by analyzing how the energy is distributed in protein structures and how mutations or conformational changes shift the energetics. Sites of high local frustration often indicate biologically important regions involved in binding or allostery. In contrast, minimally frustrated linkages comprise a stable folding core of the molecule that is conserved in conformational changes. Here, we describe the implementation of these ideas in a webserver freely available at the National EMBNet node-Argentina, at URL: http://lfp.qb.fcen.uba.ar/embnet/.
Journal of Physical Chemistry B | 2013
R. Gonzalo Parra; Rocío Espada; Ignacio E. Sánchez; Manfred J. Sippl; Diego U. Ferreiro
The notion of energy landscapes provides conceptual tools for understanding the complexities of protein folding and function. Energy landscape theory indicates that it is much easier to find sequences that satisfy the “Principle of Minimal Frustration” when the folded structure is symmetric (Wolynes, P. G. Symmetry and the Energy Landscapes of Biomolecules. Proc. Natl. Acad. Sci. U.S.A.1996, 93, 14249–14255). Similarly, repeats and structural mosaics may be fundamentally related to landscapes with multiple embedded funnels. Here we present analytical tools to detect and compare structural repetitions in protein molecules. By an exhaustive analysis of the distribution of structural repeats using a robust metric, we define those portions of a protein molecule that best describe the overall structure as a tessellation of basic units. The patterns produced by such tessellations provide intuitive representations of the repeating regions and their association toward higher order arrangements. We find that some protein architectures can be described as nearly periodic, while in others clear separations between repetitions exist. Since the method is independent of amino acid sequence information, we can identify structural units that can be encoded by a variety of distinct amino acid sequences.
Nucleic Acids Research | 2016
R. Gonzalo Parra; Nicholas P. Schafer; Leandro G. Radusky; Min-Yeh Tsai; A. Brenda Guzovsky; Peter G. Wolynes; Diego U. Ferreiro
The protein frustratometer is an energy landscape theory-inspired algorithm that aims at localizing and quantifying the energetic frustration present in protein molecules. Frustration is a useful concept for analyzing proteins’ biological behavior. It compares the energy distributions of the native state with respect to structural decoys. The network of minimally frustrated interactions encompasses the folding core of the molecule. Sites of high local frustration often correlate with functional regions such as binding sites and regions involved in allosteric transitions. We present here an upgraded version of a webserver that measures local frustration. The new implementation that allows the inclusion of electrostatic energy terms, important to the interactions with nucleic acids, is significantly faster than the previous version enabling the analysis of large macromolecular complexes within a user-friendly interface. The webserver is freely available at URL: http://frustratometer.qb.fcen.uba.ar.
PLOS Computational Biology | 2015
R. Gonzalo Parra; Rocío Espada; Nina Verstraete; Diego U. Ferreiro
Ankyrin repeat containing proteins are one of the most abundant solenoid folds. Usually implicated in specific protein-protein interactions, these proteins are readily amenable for design, with promising biotechnological and biomedical applications. Studying repeat protein families presents technical challenges due to the high sequence divergence among the repeating units. We developed and applied a systematic method to consistently identify and annotate the structural repetitions over the members of the complete Ankyrin Repeat Protein Family, with increased sensitivity over previous studies. We statistically characterized the number of repeats, the folding of the repeat-arrays, their structural variations, insertions and deletions. An energetic analysis of the local frustration patterns reveal the basic features underlying fold stability and its relation to the functional binding regions. We found a strong linear correlation between the conservation of the energetic features in the repeat arrays and their sequence variations, and discuss new insights into the organization and function of these ubiquitous proteins.
BMC Bioinformatics | 2015
Rocío Espada; R. Gonzalo Parra; Thierry Mora; Aleksandra M. Walczak; Diego U. Ferreiro
BackgroundThe analysis of correlations of amino acid occurrences in globular domains has led to the development of statistical tools that can identify native contacts – portions of the chains that come to close distance in folded structural ensembles. Here we introduce a direct coupling analysis for repeat proteins – natural systems for which the identification of folding domains remains challenging.ResultsWe show that the inherent translational symmetry of repeat protein sequences introduces a strong bias in the pair correlations at precisely the length scale of the repeat-unit. Equalizing for this bias in an objective way reveals true co-evolutionary signals from which local native contacts can be identified. Importantly, parameter values obtained for all other interactions are not significantly affected by the equalization. We quantify the robustness of the procedure and assign confidence levels to the interactions, identifying the minimum number of sequences needed to extract evolutionary information in several repeat protein families.ConclusionsThe overall procedure can be used to reconstruct the interactions at distances larger than repeat-pairs, identifying the characteristics of the strongest couplings in each family, and can be applied to any system that appears translationally symmetric.
Biochemical Society Transactions | 2015
Rocío Espada; R. Gonzalo Parra; Manfred J. Sippl; Thierry Mora; Aleksandra M. Walczak; Diego U. Ferreiro
Structural domains are believed to be modules within proteins that can fold and function independently. Some proteins show tandem repetitions of apparent modular structure that do not fold independently, but rather co-operate in stabilizing structural forms that comprise several repeat-units. For many natural repeat-proteins, it has been shown that weak energetic links between repeats lead to the breakdown of co-operativity and the appearance of folding sub-domains within an apparently regular repeat array. The quasi-1D architecture of repeat-proteins is crucial in detailing how the local energetic balances can modulate the folding dynamics of these proteins, which can be related to the physiological behaviour of these ubiquitous biological systems.
Bioinformatics | 2013
Cristian Oscar Rohr; R. Gonzalo Parra; Patricio Yankilevich; Carolina Perez-Castro
MOTIVATION Transcriptional regulation occurs through the concerted actions of multiple transcription factors (TFs) that bind cooperatively to cis-regulatory modules (CRMs) of genes. These CRMs usually contain a variable number of transcription factor-binding sites (TFBSs) involved in related cellular and physiological processes. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) has been effective in detecting TFBSs and nucleosome location to identify potential CRMs in genome-wide studies. Although several attempts were previously reported to predict the potential binding of TFs at TFBSs within CRMs by comparing different ChIP-seq data, these have been hampered by excessive background, usually emerging as a consequence of experimental conditions. To understand these complex regulatory circuits, it would be helpful to have reliable and updated user-friendly tools to assist in the identification of TFBSs and CRMs for gene(s) of interest. RESULTS Here we present INSECT (IN-silico SEarch for Co-occurring Transcription factors), a novel web server for identifying potential TFBSs and CRMs in gene sequences. By combining several strategies, INSECT provides flexible analysis of multiple co-occurring TFBSs, by applying differing search schemes and restriction parameters. availability and implementation: INSECT is freely available as a web server at http://bioinformatics.ibioba-mpsp-conicet.gov.ar/INSECT .
Scientific Reports | 2016
Pablo Turjanski; R. Gonzalo Parra; Rocío Espada; Verónica Becher; Diego U. Ferreiro
Some natural proteins display recurrent structural patterns. Despite being highly similar at the tertiary structure level, repeating patterns within a single repeat protein can be extremely variable at the sequence level. We use a mathematical definition of a repetition and investigate the occurrences of these in sequences of different protein families. We found that long stretches of perfect repetitions are infrequent in individual natural proteins, even for those which are known to fold into structures of recurrent structural motifs. We found that natural repeat proteins are indeed repetitive in their families, exhibiting abundant stretches of 6 amino acids or longer that are perfect repetitions in the reference family. We provide a systematic quantification for this repetitiveness. We show that this form of repetitiveness is not exclusive of repeat proteins, but also occurs in globular domains. A by-product of this work is a fast quantification of the likelihood of a protein to belong to a family.
bioRxiv | 2018
R. Gonzalo Parra; Nikolaos Papadopoulos; Laura Ahumada-Arranz; Jakob El Kholtei; Barbara Treutlein; Johannes Soeding
Advances in single-cell transcriptomics techniques are revolutionizing studies of cellular differentiation and heterogeneity. Consequently, it becomes possible to track the trajectory of thousands of genes across the cellular lineage trees that represent the temporal emergence of cell types during dynamic processes. However, reconstruction of cellular lineage trees with more than a few cell fates has proved challenging. We present MERLoT (https://github.com/soedinglab/merlot), a flexible and user-friendly tool to reconstruct complex lineage trees from single-cell transcriptomics data and further impute temporal gene expression profiles along the reconstructed tree structures. We demonstrate MERLoT’s capabilities on various real cases and hundreds of simulated datasets.
Bioinformatics | 2016
R. Gonzalo Parra; Cristian Oscar Rohr; Daniel Koile; Carolina Perez-Castro; Patricio Yankilevich
UNLABELLED INSECT is a user-friendly web server to predict the occurrence of Cis-Regulatory Modules (CRMs), which control gene expression. Here, we present a new release of INSECT which includes several new features, such as whole genome analysis, nucleosome occupancy predictions, and which provides additional links to third-party functional tools that complement user capabilities, CRM analysis and hypothesis construction. Improvements in the core implementation have led to a faster and more efficient tool. In addition, this new release introduces a new interface designed for a more integrative and dynamic user experience. AVAILABILITY AND IMPLEMENTATION http://bioinformatics.ibioba-mpsp-conicet.gov.ar/INSECT2 CONTACT: [email protected].