Yunhui Peng
Clemson University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Yunhui Peng.
Proteins | 2016
Yunhui Peng; Emil Alexov
Single amino acid variations (SAV) occurring in human population result in natural differences between individuals or cause diseases. It is well understood that the molecular effect of SAV can be manifested as changes of the wild type characteristics of the corresponding protein, among which are the protein stability and protein interactions. Typically the effect of SAV on protein stability and interactions was assessed via the changes of the wild type folding and binding free energies. However, in terms of SAV affecting protein functionally and disease susceptibility, one wants to know to what extend the wild type function is perturbed by the SAV. Here it is demonstrated that relative, rather than the absolute, change of the folding and binding free energy serves as a good indicator for SAV association with disease. Using HumVar as a source for disease‐causing SAV and experimentally determined free energy changes from ProTherm and SKEMPI databases, correlation coefficients (CC) between the disease index (Pd) and relative folding (Ppr,f) and binding (Ppr,b) probability indexes, respectively, was achieved. The obtained CCs demonstrated the applicability of the proposed approach and it served as good indicator for SAV association with disease. Proteins 2016; 84:232–239.
Journal of Biological Chemistry | 2017
Krithika Vaidyanathan; Tejasvi Niranjan; Nithya Selvan; Chin Fen Teo; Melanie May; Sneha Patel; Brent Weatherly; Cindy Skinner; John M. Opitz; John C. Carey; David H. Viskochil; Jozef Gecz; Marie Shaw; Yunhui Peng; Emil Alexov; Tao Wang; Charles E. Schwartz; Lance Wells
O-GlcNAc is a regulatory post-translational modification of nucleocytoplasmic proteins that has been implicated in multiple biological processes, including transcription. In humans, single genes encode enzymes for its attachment (O-GlcNAc transferase (OGT)) and removal (O-GlcNAcase (OGA)). An X-chromosome exome screen identified a missense mutation, which encodes an amino acid in the tetratricopeptide repeat, in OGT (759G>T (p.L254F)) that segregates with X-linked intellectual disability (XLID) in an affected family. A decrease in steady-state OGT protein levels was observed in isolated lymphoblastoid cell lines from affected individuals, consistent with molecular modeling experiments. Recombinant expression of L254F-OGT demonstrated that the enzyme is active as both a glycosyltransferase and an HCF-1 protease. Despite the reduction in OGT levels seen in the L254F-OGT individual cells, we observed that steady-state global O-GlcNAc levels remained grossly unaltered. Surprisingly, lymphoblastoids from affected individuals displayed a marked decrease in steady-state OGA protein and mRNA levels. We observed an enrichment of the OGT-containing transcriptional repressor complex mSin3A-HDAC1 at the proximal promoter region of OGA and correspondingly decreased OGA promoter activity in affected cells. Global transcriptome analysis of L254F-OGT lymphoblastoids compared with controls revealed a small subset of genes that are differentially expressed. Thus, we have begun to unravel the molecular consequences of the 759G>T (p.L254F) mutation in OGT that uncovered a compensation mechanism, albeit imperfect, given the phenotype of affected individuals, to maintain steady-state O-GlcNAc levels. Thus, a single amino acid substitution in the regulatory domain (the tetratricopeptide repeat domain) of OGT, which catalyzes the O-GlcNAc post-translational modification of nuclear and cytosolic proteins, appears causal for XLID.
International Journal of Molecular Sciences | 2016
Yunhui Peng; Joy Norris; Charles E. Schwartz; Emil Alexov
Missense mutations in spermine synthase (SpmSyn) protein have been shown to cause the Snyder-Robinson syndrome (SRS). Depending on the location within the structure of SpmSyn and type of amino acid substitution, different mechanisms resulting in SRS were proposed. Here we focus on naturally occurring amino acid substitutions causing SRS, which are situated away from the active center of SpmSyn and thus are not directly involved in the catalysis. Two of the mutations, M35R and P112L, are reported for the first time in this study. It is demonstrated, both experimentally and computationally, that for such mutations the major effect resulting in dysfunctional SpmSyn is the destabilization of the protein. In vitro experiments indicated either no presence or very little amount of the mutant SpmSyn in patient cells. In silico modeling predicted that all studied mutations in this work destabilize SpmSyn and some of them abolish homo-dimer formation. Since dimerization and structural stability are equally important for the wild type function of SpmSyn, it is proposed that the SRS caused by mutations occurring in the N-domain of SpmSyn is a result of dysfunctional mutant proteins being partially unfolded and degraded by the proteomic machinery of the cell or being unable to form a homo-dimer.
International Journal of Molecular Sciences | 2015
Yunhui Peng; Jimmy Suryadi; Ye Yang; Tugba G. Kucukkal; Weiguo Cao; Emil Alexov
Mutations in KDM5C gene are linked to X-linked mental retardation, the syndromic Claes-Jensen-type disease. This study focuses on non-synonymous mutations in the KDM5C ARID domain and evaluates the effects of two disease-associated missense mutations (A77T and D87G) and three not-yet-classified missense mutations (R108W, N142S, and R179H). We predict the ARID domain’s folding and binding free energy changes due to mutations, and also study the effects of mutations on protein dynamics. Our computational results indicate that A77T and D87G mutants have minimal effect on the KDM5C ARID domain stability and DNA binding. In parallel, the change in the free energy unfolding caused by the mutants A77T and D87G were experimentally measured by urea-induced unfolding experiments and were shown to be similar to the in silico predictions. The evolutionary conservation analysis shows that the disease-associated mutations are located in a highly-conserved part of the ARID structure (N-terminal domain), indicating their importance for the KDM5C function. N-terminal residues’ high conservation suggests that either the ARID domain utilizes the N-terminal to interact with other KDM5C domains or the N-terminal is involved in some yet unknown function. The analysis indicates that, among the non-classified mutations, R108W is possibly a disease-associated mutation, while N142S and R179H are probably harmless.
Proteins | 2017
Yunhui Peng; Emil Alexov
Protein–nucleic acid interactions play a crucial role in many biological processes. This work investigates the changes of pKa values and protonation states of ionizable groups (including nucleic acid bases) that may occur at protein–nucleic acid binding. Taking advantage of the recently developed pKa calculation tool DelphiPka, we utilize the large protein–nucleic acid interaction database (NPIDB database) to model pKa shifts caused by binding. It has been found that the proteins interfacial basic residues experience favorable electrostatic interactions while the protein acidic residues undergo proton uptake to reduce the energy cost upon the binding. This is in contrast with observations made for protein–protein complexes. In terms of DNA/RNA, both base groups and phosphate groups of nucleotides are found to participate in binding. Some DNA/RNA bases undergo pKa shifts at complex formation, with the binding process tending to suppress charged states of nucleic acid bases. In addition, a weak correlation is found between the pH‐optimum of protein–DNA/RNA binding free energy and the pH‐optimum of protein folding free energy. Overall, the pH‐dependence of protein–nucleic acid binding is not predicted to be as significant as that of protein–protein association. Proteins 2017; 85:282–295.
Bioinformatics | 2018
Yunhui Peng; Lexuan Sun; Zhe Jia; Lin Li; Emil Alexov
Motivation Protein‐DNA interactions are essential for regulating many cellular processes, such as transcription, replication, recombination and translation. Amino acid mutations occurring in DNA‐binding proteins have profound effects on protein‐DNA binding and are linked with many diseases. Hence, accurate and fast predictions of the effects of mutations on protein‐DNA binding affinity are essential for understanding disease‐causing mechanisms and guiding plausible treatments. Results Here we report a new method Single Amino acid Mutation binding free energy change of Protein‐DNA Interaction (SAMPDI). The method utilizes modified Molecular Mechanics Poisson‐Boltzmann Surface Area (MM/PBSA) approach along with an additional set of knowledge‐based terms delivered from investigations of the physicochemical properties of protein‐DNA complexes. The method is benchmarked against experimentally determined binding free energy changes caused by 105 mutations in 13 proteins (compiled ProNIT database and data from recent references), and results in correlation coefficient of 0.72. Availability and implementation http://compbio.clemson.edu/SAMPDI Contact [email protected] Supplementary information Supplementary data are available at Bioinformatics online.
Journal of Theoretical and Computational Chemistry | 2018
Zhe Jia; Lin Li; Yunhui Peng; Feng Ding; Emil Alexov
Integrins are cellular adhesion proteins located on cell surface. They are known to have open and closed conformations that correspond to high and low binding affinity to ligands, respectively. Int...
Proteins | 2016
Yunhui Peng; Emil Alexov
The KDM5C gene (also known as JARID1C and SMCX) is located on the X chromosome and encodes a ubiquitously expressed 1560‐aa protein, which plays an important role in lysine methylation (specifically reverses tri‐ and di‐methylation of Lys4 of histone H3). Currently, 13 missense mutations in KDM5C have been linked to X‐linked mental retardation. However, the molecular mechanism of disease is currently unknown due to the experimental difficulties in expressing such large protein and the lack of experimental 3D structure. In this work, we utilize homology modeling, docking, and experimental data to predict 3D structures of KDM5C domains and their mutual arrangement. The resulting quaternary structure includes KDM5C JmjN, ARID, PHD1, JmjC, ZF domains, substrate histone peptide, enzymatic cofactors, and DNA. The predicted quaternary structure was investigated with molecular dynamic simulation for its stability, and further analysis was carried out to identify features measured experimentally. The predicted structure of KDM5C was used to investigate the effects of disease‐causing mutations and it was shown that the mutations alter domain stability and inter‐domain interactions. The structural model reported in this work could prompt experimental investigations of KDM5C domain‐domain interaction and exploration of undiscovered functionalities. Proteins 2016; 84:1797–1809.
International Journal of Molecular Sciences | 2018
Yunhui Peng; Rebecca Myers; Wenxing Zhang; Emil Alexov
Smith-Lemli-Opitz syndrome (SLOS) is a cholesterol synthesis disorder characterized by physical, mental, and behavioral symptoms. It is caused by mutations in 7-dehydroxycholesterolreductase gene (DHCR7) encoding DHCR7 protein, which is the rate-limiting enzyme in the cholesterol synthesis pathway. Here we demonstrate that pathogenic mutations in DHCR7 protein are located either within the transmembrane region or are near the ligand-binding site, and are highly conserved among species. In contrast, non-pathogenic mutations observed in the general population are located outside the transmembrane region and have different effects on the conformational dynamics of DHCR7. All together, these observations suggest that the non-classified mutation R228Q is pathogenic. Our analyses indicate that pathogenic effects may affect protein stability and dynamics and alter the binding affinity and flexibility of the binding site.
Scientific Reports | 2017
Lin Li; Zhe Jia; Yunhui Peng; Subash Godar; Ivan Getov; Shaolei Teng; Joshua Alper; Emil Alexov
The ability to predict if a given mutation is disease-causing or not has enormous potential to impact human health. Typically, these predictions are made by assessing the effects of mutation on macromolecular stability and amino acid conservation. Here we report a novel feature: the electrostatic component of the force acting between a kinesin motor domain and tubulin. We demonstrate that changes in the electrostatic component of the binding force are able to discriminate between disease-causing and non-disease-causing mutations found in human kinesin motor domains using the receiver operating characteristic (ROC). Because diseases may originate from multiple effects not related to kinesin-microtubule binding, the prediction rate of 0.843 area under the ROC plot due to the change in magnitude of the electrostatic force alone is remarkable. These results reflect the dependence of kinesin’s function on motility along the microtubule, which suggests a precise balance of microtubule binding forces is required.