Featured Researches

Biomolecules

Fodis: software for protein unfolding analysis

The folding dynamics of proteins at the single molecule level has been studied with single-molecule force spectroscopy (SMFS) experiments for twenty years, but a common standardized method for the analysis of the collected data and for the sharing among the scientific community members is still not available. We have developed a new open source tool, Fodis, for the analysis of the Force-distance curves obtained in SMFS experiments, providing an almost automatic processing, analysis and classification of the obtained data. Our method provides also a classification of the possible unfolding pathways and structural heterogeneity, present during the unfolding of proteins.

Read more
Biomolecules

Free Radical Scavenging and Cytotoxic Activities of Substituted Pyrimidines

A library of substituted pyrimidines was synthesized and evaluated for free radical scavenging, and in vitro cytotoxic activity in 3T3 cells. All compounds showed good free radical scavenging activity with IC50 values in the range of 42.9 + 0.31 to 438.3 3.3 {\mu}M as compared to the standard butylated hydroxytoluene having IC50 value of 128.83 2. 1 {\mu}M. The structure activity-relationship was also established. Selected analogues 1, 2, 3, 5, 6, 7, 8, 9, 10, 12, 13, 15, 19, 20, 21, 24, 25, 26 and 28 were tested for cytotoxicity in mouse fibroblast 3T3 cell line using MTT assay, and most of the analogues showed cytotoxicity. This study has identified a number of cytotoxic novel substituted pyrimidines having free radical scavenging activities that can be used as inhibitory compounds for those cancer cells whose growth is mediated by reactive oxygen species.

Read more
Biomolecules

Frictional Effects on RNA Folding: Speed Limit and Kramers Turnover

We investigated frictional effects on the folding rates of a human telomerase hairpin (hTR HP) and H-type pseudoknot from the Beet Western Yellow Virus (BWYV PK) using simulations of the Three Interaction Site (TIS) model for RNA. The heat capacity from TIS model simulations, calculated using temperature replica exchange simulations, reproduces nearly quantitatively the available experimental data for the hTR HP. The corresponding results for BWYV PK serve as predictions. We calculated the folding rates ( k F ) from more than 100 folding trajectories for each value of the solvent viscosity ( η ) at a fixed salt concentration of 200 mM. By using the theoretical estimate ( ∝ N − − √ where N is the number of nucleotides) for folding free energy barrier, k F data for both the RNAs are quantitatively fit using one-dimensional Kramers' theory with two parameters specifying the curvatures in the unfolded basin and the barrier top. In the high-friction regime ( η≳ 10 −5 Pa\ensuremath{\cdot}s ), for both HP and PK, k F s decrease as 1/η whereas in the low friction regime, k F values increase as η increases, leading to a maximum folding rate at a moderate viscosity ( ∼ 10 −6 Pa\ensuremath{\cdot}s ), which is the Kramers turnover. From the fits, we find that the speed limit to RNA folding at water viscosity is between 1 and 4 μs , which is in accord with our previous theoretical prediction as well as results from several single molecule experiments. Both the RNA constructs fold by parallel pathways. Surprisingly, we find that the flux through the pathways could be altered by changing solvent viscosity, a prediction that is more easily testable in RNA than in proteins.

Read more
Biomolecules

GLN -- a method to reveal unique properties of lasso type topology in proteins

Geometry and topology are the main factors that determine the functional properties of proteins. In this work, we show how to use the Gauss linking integral (GLN) in the form of a matrix diagram - for a pair of a loop and a tail - to study both the geometry and topology of proteins with closed loops e.g. lassos. We show that the GLN method is a significantly faster technique to detect entanglement in lasso proteins in comparison with other methods. Based on the GLN technique, we conduct comprehensive analysis of all proteins deposited in the PDB and compare it to the statistical properties of the polymers. We found that there are significantly more lassos with negative crossings than those with positive ones in proteins, the average value of maxGLN (maximal GLN between loop and pieces of tail) depends logarithmically on the length of a tail similarly as in the polymers. Next, we show the how high and low GLN values correlate with the internal exibility of proteins, and how the GLN in the form of a matrix diagram can be used to study folding and unfolding routes. Finally, we discuss how the GLN method can be applied to study entanglement between two structures none of which are closed loops. Since this approach is much faster than other linking invariants, the next step will be evaluation of lassos in much longer molecules such as RNA or loops in a single chromosome.

Read more
Biomolecules

GPU accelerated enumeration and exploration of HP model genotype-phenotype maps for protein folding

Evolution can be broadly described in terms of mutations of the genotype and the subsequent selection of the phenotype. The full enumeration of a given genotype-phenotype (GP) map is therefore a powerful technique in examining evolutionary landscapes. However, because the number of genotypes typically grows exponentially with genome length, such calculations rapidly become intractable. Here I apply graphics processing unit(GPU) techniques to the hydrophobic-polar (HP)model for protein folding. This GP map is a simple and well-studied model for the complex process of protein folding. Prior studies on relatively small 2D and 3D lattices have been exclusively carried out using conventional central processing unit (CPU) approaches. By using GPU techniques, I was able to reproduce the pioneering calculations of Li et al.[1] with a speed up of 580-700 fold over a CPU. I was also able to perform the largest enumeration to date of the 6x6 lattice. These novel calculations provide evidence that a popular "plum-pudding" metaphor that suggests that phenotypes are disconnected in genotype space does not describe the data. Instead a "spaghetti" metaphor of connected genotype networks may be more suitable. Furthermore, the data allows the relationships between designability and complexity within GP space to be explored. GPU approaches appear extremely well suited toGP mapping and the success of this work provides a promising introduction for its wider application in this field.

Read more
Biomolecules

GPU-Accelerated Drug Discovery with Docking on the Summit Supercomputer: Porting, Optimization, and Application to COVID-19 Research

Protein-ligand docking is an in silico tool used to screen potential drug compounds for their ability to bind to a given protein receptor within a drug-discovery campaign. Experimental drug screening is expensive and time consuming, and it is desirable to carry out large scale docking calculations in a high-throughput manner to narrow the experimental search space. Few of the existing computational docking tools were designed with high performance computing in mind. Therefore, optimizations to maximize use of high-performance computational resources available at leadership-class computing facilities enables these facilities to be leveraged for drug discovery. Here we present the porting, optimization, and validation of the AutoDock-GPU program for the Summit supercomputer, and its application to initial compound screening efforts to target proteins of the SARS-CoV-2 virus responsible for the current COVID-19 pandemic.

Read more
Biomolecules

General Principles of Secondary Active Transporter Function

Transport of ions and small molecules across the cell membrane against electrochemical gradients is catalyzed by integral membrane proteins that use a source of free energy to drive the energetically uphill flux of the transported substrate. Secondary active transporters couple the spontaneous influx of a "driving" ion such as Na+ or H+ to the flux of the substrate. The thermodynamics of such cyclical non-equilibrium systems are well understood and recent work has focused on the molecular mechanism of secondary active transport. The fact that these transporters change their conformation between an inward-facing and outward-facing conformation in a cyclical fashion, called the alternating access model, is broadly recognized as the molecular framework in which to describe transporter function. However, only with the advent of high resolution crystal structures and detailed computer simulations has it become possible to recognize common molecular-level principles between disparate transporter families. Inverted repeat symmetry in secondary active transporters has shed light on how protein structures can encode a bi-stable two-state system. More detailed analysis (based on experimental structural data and detailed molecular dynamics simulations) indicates that transporters can be understood as gated pores with at least two coupled gates. These gates are not just a convenient cartoon element to illustrate a putative mechanism but map to distinct parts of the transporter protein. Enumerating all distinct gate states naturally includes occluded states in the alternating access picture and also suggests what kind of protein conformations might be observable. By connecting the possible conformational states and ion/substrate bound states in a kinetic model, a unified picture emerges in which symporter, antiporter, and uniporter function are extremes in a continuum of functionality.

Read more
Biomolecules

Generate Novel Molecules With Target Properties Using Conditional Generative Models

Drug discovery using deep learning has attracted a lot of attention of late as it has obvious advantages like higher efficiency, less manual guessing and faster process time. In this paper, we present a novel neural network for generating small molecules similar to the ones in the training set. Our network consists of an encoder made up of bi-GRU layers for converting the input samples to a latent space, predictor for enhancing the capability of encoder made up of 1D-CNN layers and a decoder comprised of uni-GRU layers for reconstructing the samples from the latent space representation. Condition vector in latent space is used for generating molecules with the desired properties. We present the loss functions used for training our network, experimental details and property prediction metrics. Our network outperforms previous methods using Molecular weight, LogP and Quantitative Estimation of Drug-likeness as the evaluation metrics.

Read more
Biomolecules

Generating Tertiary Protein Structures via an Interpretative Variational Autoencoder

Much scientific enquiry across disciplines is founded upon a mechanistic treatment of dynamic systems that ties form to function. A highly visible instance of this is in molecular biology, where an important goal is to determine functionally-relevant forms/structures that a protein molecule employs to interact with molecular partners in the living cell. This goal is typically pursued under the umbrella of stochastic optimization with algorithms that optimize a scoring function. Research repeatedly shows that current scoring function, though steadily improving, correlate weakly with molecular activity. Inspired by recent momentum in generative deep learning, this paper proposes and evaluates an alternative approach to generating functionally-relevant three-dimensional structures of a protein. Though typically deep generative models struggle with highly-structured data, the work presented here circumvents this challenge via graph-generative models. A comprehensive evaluation of several deep architectures shows the promise of generative models in directly revealing the latent space for sampling novel tertiary structures, as well as in highlighting axes/factors that carry structural meaning and open the black box often associated with deep models. The work presented here is a first step towards interpretative, deep generative models becoming viable and informative complementary approaches to protein structure prediction.

Read more
Biomolecules

Generative chemistry: drug discovery with deep learning generative models

The de novo design of molecular structures using deep learning generative models introduces an encouraging solution to drug discovery in the face of the continuously increased cost of new drug development. From the generation of original texts, images, and videos, to the scratching of novel molecular structures, the incredible creativity of deep learning generative models surprised us about the height machine intelligence can achieve. The purpose of this paper is to review the latest advances in generative chemistry which relies on generative modeling to expedite the drug discovery process. This review starts with a brief history of artificial intelligence in drug discovery to outline this emerging paradigm. Commonly used chemical databases, molecular representations, and tools in cheminformatics and machine learning are covered as the infrastructure for the generative chemistry. The detailed discussions on utilizing cutting-edge generative architectures, including recurrent neural network, variational autoencoder, adversarial autoencoder, and generative adversarial network for compound generation are focused. Challenges and future perspectives follow.

Read more

Ready to get started?

Join us today