Featured Researches

Biomolecules

BPPart and BPMax: RNA-RNA Interaction Partition Function and Structure Prediction for the Base Pair Counting Model

RNA-RNA interaction (RRI) is ubiquitous and has complex roles in the cellular functions. In human health studies, miRNA-target and lncRNAs are among an elite class of RRIs that have been extensively studied. Bacterial ncRNA-target and RNA interference are other classes of RRIs that have received significant attention. In recent studies, mRNA-mRNA interaction instances have been observed, where both partners appear in the same pathway without any direct link between them, or any prior knowledge about their relationship. Those recently discovered cases suggest that RRI scope is much wider than those aforementioned elite classes. We revisit our RNA-RNA interaction partition function algorithm, piRNA, which computes the partition function, base-pairing probabilities, and structure for the comprehensive Turner energy model using 96 different dynamic programming tables. In this study, we strategically retreat from sophisticated thermodynamic models to the much simpler base pair counting model. That might seem counter-intuitive at the first glance; our idea is to benefit from the advantages of such simple models in terms of running time and memory footprint and compensate for the associated information loss by adding machine learning components in the future. Here, simple weighted base pair counting is considered to obtain BPPart for Base-pair Partition function and BPMax for Base-pair Maximization, which use 9 and 2 tables respectively. They are empirically 225 and 1350 fold faster than piRNA. A correlation of 0.855 and 0.836 was achieved between piRNA and BPPart and between piRNA and BPMax, respectively, in 37 degrees, and 0.920 and 0.904 in -180 degrees. We also discover two partner RNAs, SNORD3D and TRAF3, and hypothesize their potential roles in genetic diseases. We envision fusion of machine learning methods with the proposed algorithms in the future.

Read more
Biomolecules

Backbone chemical shift assignments of human 14-3-3 σ

14-3-3 proteins are a group of seven dimeric adapter proteins that exert their biological function by interacting with hundreds of phosphorylated proteins, thus influencing their sub-cellular localization, activity or stability in the cell. Due to this remarkable interaction network, 14-3-3 proteins have been associated with several pathologies and the protein-protein interactions established with a number of partners are now considered promising drug targets. The activity of 14-3-3 proteins is often isoform specific and to our knowledge only one out of seven isoforms, 14-3-3 ζ , has been assigned. Despite the availability of the crystal structures of all seven isoforms of 14-3-3, the additional NMR assignments of 14-3-3 proteins are important for both biological mechanism studies and chemical biology approaches. Herein, we present a robust backbone assignment of 14-3-3 σ , which will allow advances in the discovery of potential therapeutic compounds. This assignment is now being applied to the discovery of both inhibitors and stabilizers of 14-3-3 protein-protein interactions.

Read more
Biomolecules

Bacterial pathways for the biosynthesis of ubiquinone

Ubiquinone is an important component of the electron transfer chains in proteobacteria and eukaryotes. The biosynthesis of ubiquinone requires multiple steps, most of which are common to bacteria and eukaryotes. Whereas the enzymes of the mitochondrial pathway that produces ubiquinone are highly similar across eukaryotes, recent results point to a rather high diversity of pathways in bacteria. This review focuses on ubiquinone in bacteria, highlighting newly discovered functions and detailing the proteins that are known to participate to its biosynthetic pathways. Novel results showing that ubiquinone can be produced by a pathway independent of dioxygen suggest that ubiquinone may participate to anaerobiosis, in addition to its well established role for aerobiosis. We also discuss the supramolecular organization of ubiquinone biosynthesis proteins and we summarize the current understanding of the evolution of the ubiquinone pathways relative to those of other isoprenoid quinones like menaquinone and plastoquinone.

Read more
Biomolecules

Bayesian Graph Neural Networks for Molecular Property Prediction

Graph neural networks for molecular property prediction are frequently underspecified by data and fail to generalise to new scaffolds at test time. A potential solution is Bayesian learning, which can capture our uncertainty in the model parameters. This study benchmarks a set of Bayesian methods applied to a directed MPNN, using the QM9 regression dataset. We find that capturing uncertainty in both readout and message passing parameters yields enhanced predictive accuracy, calibration, and performance on a downstream molecular search task.

Read more
Biomolecules

Bayesian active learning for optimization and uncertainty quantification in protein docking

Motivation: Ab initio protein docking represents a major challenge for optimizing a noisy and costly "black box"-like function in a high-dimensional space. Despite progress in this field, there is no docking method available for rigorous uncertainty quantification (UQ) of its solution quality (e.g. interface RMSD or iRMSD). Results: We introduce a novel algorithm, Bayesian Active Learning (BAL), for optimization and UQ of such black-box functions and flexible protein docking. BAL directly models the posterior distribution of the global optimum (or native structures for protein docking) with active sampling and posterior estimation iteratively feeding each other. Furthermore, we use complex normal modes to represent a homogeneous Euclidean conformation space suitable for high-dimension optimization and construct funnel-like energy models for encounter complexes. Over a protein docking benchmark set and a CAPRI set including homology docking, we establish that BAL significantly improve against both starting points by rigid docking and refinements by particle swarm optimization, providing for one third targets a top-3 near-native prediction. BAL also generates tight confidence intervals with half range around 25% of iRMSD and confidence level at 85%. Its estimated probability of a prediction being native or not achieves binary classification AUROC at 0.93 and AUPRC over 0.60 (compared to 0.14 by chance); and also found to help ranking predictions. To the best of our knowledge, this study represents the first uncertainty quantification solution for protein docking, with theoretical rigor and comprehensive assessment. Source codes are available at this https URL.

Read more
Biomolecules

Best Practices for Alchemical Free Energy Calculations

Alchemical free energy calculations are a useful tool for predicting free energy differences associated with the transfer of molecules from one environment to another. The hallmark of these methods is the use of "bridging" potential energy functions representing \emph{alchemical} intermediate states that cannot exist as real chemical species. The data collected from these bridging alchemical thermodynamic states allows the efficient computation of transfer free energies (or differences in transfer free energies) with orders of magnitude less simulation time than simulating the transfer process directly. While these methods are highly flexible, care must be taken in avoiding common pitfalls to ensure that computed free energy differences can be robust and reproducible for the chosen force field, and that appropriate corrections are included to permit direct comparison with experimental data. In this paper, we review current best practices for several popular application domains of alchemical free energy calculations, including relative and absolute small molecule binding free energy calculations to biomolecular targets.

Read more
Biomolecules

Bifractal nature of chromosome contact maps

Modern biological techniques such as Hi-C permit to measure probabilities that different chromosomal regions are close in space. These probabilities can be visualised as matrices called contact maps. In this paper, we introduce a multifractal analysis of chromosomal contact maps. Our analysis reveals that Hi-C maps are bifractal, i.e. complex geometrical objects characterized by two distinct fractal dimensions. To rationalize this observation, we introduce a model that describes chromosomes as a hierarchical set of nested domains and we solve it exactly. The predicted multifractal spectrum is in excellent quantitative agreement with experimental data. Moreover, we show that our theory yields to a more robust estimation of the scaling exponent of the contact probability than existing methods. By applying this method to experimental data, we detect subtle conformational changes among chromosomes during differentiation of human stem cells.

Read more
Biomolecules

Binding Pathway of Opiates to μ Opioid Receptors Revealed by Unsupervised Machine Learning

Many important analgesics relieve pain by binding to the μ -Opioid Receptor ( μ OR), which makes the μ OR among the most clinically relevant proteins of the G Protein Coupled Receptor (GPCR) family. Despite previous studies on the activation pathways of the GPCRs, the mechanism of opiate binding and the selectivity of μ OR are largely unknown. We performed extensive molecular dynamics (MD) simulation and analysis to find the selective allosteric binding sites of the μ OR and the path opiates take to bind to the orthosteric site. In this study, we predicted that the allosteric site is responsible for the attraction and selection of opiates. Using Markov state models and machine learning, we traced the pathway of opiates in binding to the orthosteric site, the main binding pocket. Our results have important implications in designing novel analgesics.

Read more
Biomolecules

Bioconjugated oligonucleotides: recent developments and thera-eutic applications

Oligonucleotide-based agents have the potential to treat or cure almost any disease, and are one of the key therapeutic drug classes of the future. Bioconjugated oligonucleotides, a subset of this class, are emerging from basic research and being successfully translated to the clinic. In this review, we first briefly describe two approaches for inhibiting specific genes using oligonucleotides -- antisense DNA (ASO) and RNA interference (RNAi) -- followed by a discussion on delivery to cells. We then summarize and analyze recent developments in bioconjugated oligonucleotides including those possessing GalNAc, cell penetrating peptides, α -tocopherol, aptamers, antibodies, cholesterol, squalene, fatty acids, or nucleolipids. These novel conjugates provide a means to enhance tissue targeting, cell internalization, endosomal escape, target binding specificity, resistance to nucleases, and more. We next describe those bioconjugated oligonucleotides approved for patient use or in clinical trials. Finally, we summarize the state of the field, describe current limitations, and discuss future prospects. Biocon-jugation chemistry is at the centerpiece of this therapeutic oligonucleotide revolution, and significant opportunities exist for development of new modification chemistries, for mechanistic studies at the chemical-biology interface, and for translating such agents to the clinic.

Read more
Biomolecules

Biophysical inference of epistasis and the effects of mutations on protein stability and function

Understanding the relationship between protein sequence, function, and stability is a fundamental problem in biology. While high-throughput methods have produced large numbers of sequence-function pairs, functional assays do not distinguish whether mutations directly affect function or are destabilizing the protein. Here, we introduce a statistical method to infer the underlying biophysics from a high-throughput binding assay by combining information from many mutated variants. We fit a thermodynamic model describing the bound, unbound, and unfolded states to high quality data of protein G domain B1 binding to IgG-Fc. We infer an energy landscape with distinct folding and binding energies for each substitution providing a detailed view of how mutations affect binding and stability across the protein. We accurately infer folding energy of each variant in physical units, validated by independent data, whereas previous high-throughput methods could only measure indirect changes in stability. While we assume an additive sequence-energy relationship, the binding fraction is epistatic due its non-linear relation to energy. Despite having no epistasis in energy, our model explains much of the observed epistasis in binding fraction, with the remaining epistasis identifying conformationally dynamic regions.

Read more

Ready to get started?

Join us today