Jingkai Yu
Wayne State University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jingkai Yu.
Genome Biology | 2007
Jodi R Parrish; Jingkai Yu; Guozhen Liu; Julie A Hines; Jason E. Chan; Bernie A Mangiola; Huamei Zhang; Svetlana Pacifico; Farshad Fotouhi; Victor J. DiRita; Trey Ideker; Phillip C. Andrews; Russell L. Finley
BackgroundData from large-scale protein interaction screens for humans and model eukaryotes have been invaluable for developing systems-level models of biological processes. Despite this value, only a limited amount of interaction data is available for prokaryotes. Here we report the systematic identification of protein interactions for the bacterium Campylobacter jejuni, a food-borne pathogen and a major cause of gastroenteritis worldwide.ResultsUsing high-throughput yeast two-hybrid screens we detected and reproduced 11,687 interactions. The resulting interaction map includes 80% of the predicted C. jejuni NCTC11168 proteins and places a large number of poorly characterized proteins into networks that provide initial clues about their functions. We used the map to identify a number of conserved subnetworks by comparison to protein networks from Escherichia coli and Saccharomyces cerevisiae. We also demonstrate the value of the interactome data for mapping biological pathways by identifying the C. jejuni chemotaxis pathway. Finally, the interaction map also includes a large subnetwork of putative essential genes that may be used to identify potential new antimicrobial drug targets for C. jejuni and related organisms.ConclusionThe C. jejuni protein interaction map is one of the most comprehensive yet determined for a free-living organism and nearly doubles the binary interactions available for the prokaryotic kingdom. This high level of coverage facilitates pathway mapping and function prediction for a large number of C. jejuni proteins as well as orthologous proteins from other organisms. The broad coverage also facilitates cross-species comparisons for the identification of evolutionarily conserved subnetworks of protein interactions.
Nucleic Acids Research | 2011
Thilakam Murali; Svetlana Pacifico; Jingkai Yu; Stephen Guest; George G. Roberts; Russell L. Finley
DroID (http://droidb.org/), the Drosophila Interactions Database, is a comprehensive public resource for Drosophila gene and protein interactions. DroID contains genetic interactions and experimentally detected protein–protein interactions curated from the literature and from external databases, and predicted protein interactions based on experiments in other species. Protein interactions are annotated with experimental details and periodically updated confidence scores. Data in DroID is accessible through user-friendly, intuitive interfaces that allow simple or advanced searches and graphical visualization of interaction networks. DroID has been expanded to include interaction types that enable more complete analyses of the genetic networks that underlie biological processes. In addition to protein–protein and genetic interactions, the database now includes transcription factor–gene and regulatory RNA–gene interactions. In addition, DroID now has more gene expression data that can be used to search and filter interaction networks. Orthologous gene mappings of Drosophila genes to other organisms are also available to facilitate finding interactions based on gene names and identifiers for a number of common model organisms and humans. Improvements have been made to the web and graphical interfaces to help biologists gain a comprehensive view of the interaction networks relevant to the genes and systems that they study.
BMC Genomics | 2008
Jingkai Yu; Svetlana Pacifico; Guozhen Liu; Russell L. Finley
BackgroundCharting the interactions among genes and among their protein products is essential for understanding biological systems. A flood of interaction data is emerging from high throughput technologies, computational approaches, and literature mining methods. Quick and efficient access to this data has become a critical issue for biologists. Several excellent multi-organism databases for gene and protein interactions are available, yet most of these have understandable difficulty maintaining comprehensive information for any one organism. No single database, for example, includes all available interactions, integrated gene expression data, and comprehensive and searchable gene information for the important model organism, Drosophila melanogaster.DescriptionDroID, the Drosophila Interactions Database, is a comprehensive interactions database designed specifically for Drosophila. DroID houses published physical protein interactions, genetic interactions, and computationally predicted interactions, including interologs based on data for other model organisms and humans. All interactions are annotated with original experimental data and source information. DroID can be searched and filtered based on interaction information or a comprehensive set of gene attributes from Flybase. DroID also contains gene expression and expression correlation data that can be searched and used to filter datasets, for example, to focus a study on sub-networks of co-expressed genes. To address the inherent noise in interaction data, DroID employs an updatable confidence scoring system that assigns a score to each physical interaction based on the likelihood that it represents a biologically significant link.ConclusionDroID is the most comprehensive interactions database available for Drosophila. To facilitate downstream analyses, interactions are annotated with original experimental information, gene expression data, and confidence scores. All data in DroID are freely available and can be searched, explored, and downloaded through three different interfaces, including a text based web site, a Java applet with dynamic graphing capabilities (IM Browser), and a Cytoscape plug-in. DroID is available at http://www.droidb.org.
Nature Methods | 2009
Ariel S. Schwartz; Jingkai Yu; Kyle R. Gardenour; Russell L. Finley; Trey Ideker
Comprehensive protein-interaction mapping projects are underway for many model species and humans. A key step in these projects is estimating the time, cost and personnel required for obtaining an accurate and complete map. Here we modeled the cost of interaction-map completion for various experimental designs. We showed that current efforts may require up to 20 independent tests covering each protein pair to approach completion. We explored designs for reducing this cost substantially, including prioritization of protein pairs, probability thresholding and interaction prediction. The best experimental designs lowered cost by fourfold overall and >100-fold in early stages of mapping. We demonstrate the best strategy in an ongoing project in Drosophila melanogaster, in which we mapped 450 high-confidence interactions using 47 microtiter plates, versus thousands of plates expected using current designs. This study provides a framework for assessing the feasibility of interaction mapping projects and for future efforts to increase their efficiency.
Bioinformatics | 2009
Jingkai Yu; Russell L. Finley
Motivation: High-throughput experimental and computational methods are generating a wealth of protein–protein interaction data for a variety of organisms. However, data produced by current state-of-the-art methods include many false positives, which can hinder the analyses needed to derive biological insights. One way to address this problem is to assign confidence scores that reflect the reliability and biological significance of each interaction. Most previously described scoring methods use a set of likely true positives to train a model to score all interactions in a dataset. A single positive training set, however, may be biased and not representative of true interaction space. Results: We demonstrate a method to score protein interactions by utilizing multiple independent sets of training positives to reduce the potential bias inherent in using a single training set. We used a set of benchmark yeast protein interactions to show that our approach outperforms other scoring methods. Our approach can also score interactions across data types, which makes it more widely applicable than many previously proposed methods. We applied the method to protein interaction data from both Drosophila melanogaster and Homo sapiens. Independent evaluations show that the resulting confidence scores accurately reflect the biological significance of the interactions. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics Online.
Journal of Medical Systems | 2006
Jingkai Yu; Farshad Fotouhi
Discovery of the protein interactions that take place within a cell can provide a starting point for understanding biological regulatory pathways. Global interaction patterns among proteins, for example, can suggest new drug targets and aid the design of new drugs by providing a clearer picture of the biological pathways in the neighborhoods of the drug targets. High-throughput experimental screens have been developed to detect protein–protein interactions, however, they show high rates of errors in terms of false positives and false negatives. Many computational approaches have been proposed to tackle the problem of protein–protein interaction prediction. They range from comparative genomics based methods to data integration based approaches. Challenging properties of protein–protein interaction data have to be addressed appropriately before a higher quality interaction map with better coverage can be achieved. This paper presents a survey of major works in computational prediction of protein–protein interactions, explaining their assumptions, main ideas, and limitations.
PLOS ONE | 2013
Qimeng Mu; Tao Hu; Jingkai Yu
PEGylation is a successful approach to improve potency of a therapeutic protein. The improved therapeutic potency is mainly due to the steric shielding effect of PEG. However, the underlying mechanism of this effect on the protein is not well understood, especially on the protein interaction with its high molecular weight substrate or receptor. Here, experimental study and molecular dynamics simulation were used to provide molecular insight into the interaction between the PEGylated protein and its receptor. Staphylokinase (Sak), a therapeutic protein for coronary thrombolysis, was used as a model protein. Four PEGylated Saks were prepared by site-specific conjugation of 5 kDa/20 kDa PEG to N-terminus and C-terminus of Sak, respectively. Experimental study suggests that the native conformation of Sak is essentially not altered by PEGylation. In contrast, the bioactivity, the hydrodynamic volume and the molecular symmetric shape of the PEGylated Sak are altered and dependent on the PEG chain length and the PEGylation site. Molecular modeling of the PEGylated Saks suggests that the PEG chain remains highly flexible and can form a distinctive hydrated layer, thereby resulting in the steric shielding effect of PEG. Docking analyses indicate that the binding affinity of Sak to its receptor is dependent on the PEG chain length and the PEGylation site. Computational simulation results explain experimental data well. Our present study clarifies molecular details of PEG chain on protein surface and may be essential to the rational design, fabrication and clinical application of PEGylated proteins.
Biomacromolecules | 2013
Xiaoying Xue; Dongxia Li; Jingkai Yu; Guanghui Ma; Zhiguo Su; Tao Hu
PEGylation can improve the protein efficacy by prolonging serum half-life and reducing proteolytic sensitivity and immunogenicity. However, PEGylation may decrease the bioactivity of a protein by interfering with binding of its substrate or receptors. Here, staphylokinase (SAK), a thrombolysis agent for therapy of myocardial infarction, was mono-PEGylated at the C-terminus of SAK far from its bioactive domain. Phenyl, propyl, and amyl moieties were used as linkers between SAK and polyethylene glycol (PEG), respectively. Flexible propyl and amyl linkers lead to loose conformation. In contrast, rigid and hydrophobic phenyl linker induces dense PEG conformation that can extensively shield most domains adjacent to C-terminus (e.g., the antigen epitopes and proteolytic sites) of SAK and inefficiently shield its bioactive domain. As compared with loose PEG conformation, dense PEG conformation is more efficient to maintain the bioactivity, increase the plasma half-life, and decrease the proteolytic sensitivity and immunogenicity of the PEGylated SAK.
ACS Chemical Biology | 2015
Fen Wang; Yanjie Wang; Junjie Ji; Zhan Zhou; Jingkai Yu; Hua Zhu; Zhiguo Su; Lixin Zhang; Jianting Zheng
The loading acyltransferase (AT) domains of modular polyketide synthases (PKSs) control the choice of starter units incorporated into polyketides and are therefore attractive targets for the engineering of modular PKSs. Here, we report the structural and biochemical characterizations of the loading AT from avermectin modular PKS, which accepts more than 40 carboxylic acids as alternative starter units for the biosynthesis of a series of congeners. This first structural analysis of loading ATs from modular PKSs revealed the molecular basis for the relaxed substrate specificity. Residues important for substrate binding and discrimination were predicted by modeling a substrate into the active site. A mutant with altered specificity toward a panel of synthetic substrate mimics was generated by site-directed mutagenesis of the active site residues. The hydrolysis of the N-acetylcysteamine thioesters of racemic 2-methylbutyric acid confirmed the stereospecificity of the avermectin loading AT for an S configuration at the C-2 position of the substrate. Together, these results set the stage for region-specific modification of polyketides through active site engineering of loading AT domains of modular PKSs.
Methods of Molecular Biology | 2012
Jingkai Yu; Thilakam Murali; Russell L. Finley
Screens for protein-protein interactions using assays like the yeast two-hybrid system have generated volumes of useful data. The protein interactions from these screens have been used to develop a better understanding of the functions of individual proteins, regulatory pathways, molecular machines, and entire biological systems. The value of this data, however, is limited by the inherent frequency of false positives that arise in most protein interaction screens. Appreciable numbers of false positives can crop up in both low-throughput and high-throughput screens, and even in screens that employ stringent criteria for defining a positive. A number of classification systems have been used to help distinguish false positives from biologically relevant true positives. This chapter describes a system for assigning a confidence score to each interaction based on the probability that it is a true positive. Such confidence scores can be used to prioritize interactions for validation. The scores are also useful for network analysis methods that take advantage of probabilistic edge weights. The scoring method does not rely on gold standard datasets of reliable true positives and true negatives, and thus circumvents the challenges associated with obtaining such datasets. Moreover, the scoring method uses data features that are largely assay-independent, making it useful for interactions obtained from a variety of different technologies and screening methods.