John L. Markley
University of Wisconsin-Madison
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by John L. Markley.
Proteins | 2005
Wim F. Vranken; Wayne Boucher; Tim J. Stevens; Rasmus H. Fogh; Anne Pajon; Miguel Llinás; Eldon L. Ulrich; John L. Markley; John Ionides; Ernest D. Laue
To address data management and data exchange problems in the nuclear magnetic resonance (NMR) community, the Collaborative Computing Project for the NMR community (CCPN) created a “Data Model” that describes all the different types of information needed in an NMR structural study, from molecular structure and NMR parameters to coordinates. This paper describes the development of a set of software applications that use the Data Model and its associated libraries, thus validating the approach. These applications are freely available and provide a pipeline for high‐throughput analysis of NMR data. Three programs work directly with the Data Model: CcpNmr Analysis, an entirely new analysis and interactive display program, the CcpNmr FormatConverter, which allows transfer of data from programs commonly used in NMR to and from the Data Model, and the CLOUDS software for automated structure calculation and assignment (Carnegie Mellon University), which was rewritten to interact directly with the Data Model. The ARIA 2.0 software for structure calculation (Institut Pasteur) and the QUEEN program for validation of restraints (University of Nijmegen) were extended to provide conversion of their data to the Data Model. During these developments the Data Model has been thoroughly tested and used, demonstrating that applications can successfully exchange data via the Data Model. The software architecture developed by CCPN is now ready for new developments, such as integration with additional software applications and extensions of the Data Model into other areas of research. Proteins 2005.
Journal of Biomolecular NMR | 1995
David S. Wishart; Colin G. Bigam; Jian Yao; Frits Abildgaard; H. Jane Dyson; Eric Oldfield; John L. Markley; Brian D. Sykes
SummaryA considerable degree of variability exists in the way that 1H, 13C and 15N chemical shifts are reported and referenced for biomolecules. In this article we explore some of the reasons for this situation and propose guidelines for future chemical shift referencing and for conversion from many common 1H, 13C and 15N chemical shift standards, now used in biomolecular NMR, to those proposed here.
Nucleic Acids Research | 2007
Helen M. Berman; Kim Henrick; Haruki Nakamura; John L. Markley
The worldwide Protein Data Bank (wwPDB) is the international collaboration that manages the deposition, processing and distribution of the PDB archive. The online PDB archive is a repository for the coordinates and related information for more than 38 000 structures, including proteins, nucleic acids and large macromolecular complexes that have been determined using X-ray crystallography, NMR and electron microscopy techniques. The founding members of the wwPDB are RCSB PDB (USA), MSD-EBI (Europe) and PDBj (Japan) [H.M. Berman, K. Henrick and H. Nakamura (2003) Nature Struct. Biol., 10, 980]. The BMRB group (USA) joined the wwPDB in 2006. The mission of the wwPDB is to maintain a single archive of macromolecular structural data that are freely and publicly available to the global community. Additionally, the wwPDB provides a variety of services to a broad community of users. The wwPDB website at provides information about services provided by the individual member organizations and about projects undertaken by the wwPDB.
Journal of Biomolecular NMR | 1991
Beverly R. Seavey; Elizabeth A. Farr; William M. Westler; John L. Markley
SummaryA protein NMR database has been designed and is being implemented. The database is intended to contain solution NMR results from proteins and peptides (larger than 12 residues). A relational database format has been chosen that indexes data by: primary journal citation, molecular species, sequence-related and atom-specific assignments, and experimental conditions. At present, all data are entered from the primary refereed literature. Examples are given of sample queries to the database. Possible distribution formats are discussed.
Proteins | 2005
Aart J. Nederveen; Jurgen F. Doreleijers; Wim F. Vranken; Zachary Miller; Chris A. E. M. Spronk; Sander B. Nabuurs; Peter Güntert; Miron Livny; John L. Markley; Michael Nilges; Eldon L. Ulrich; Robert Kaptein; Alexandre M. J. J. Bonvin
State‐of‐the‐art methods based on CNS and CYANA were used to recalculate the nuclear magnetic resonance (NMR) solution structures of 500+ proteins for which coordinates and NMR restraints are available from the Protein Data Bank. Curated restraints were obtained from the BioMagResBank FRED database. Although the original NMR structures were determined by various methods, they all were recalculated by CNS and CYANA and refined subsequently by restrained molecular dynamics (CNS) in a hydrated environment. We present an extensive analysis of the results, in terms of various quality indicators generated by PROCHECK and WHAT_CHECK. On average, the quality indicators for packing and Ramachandran appearance moved one standard deviation closer to the mean of the reference database. The structural quality of the recalculated structures is discussed in relation to various parameters, including number of restraints per residue, NOE completeness and positional root mean square deviation (RMSD). Correlations between pairs of these quality indicators were generally low; for example, there is a weak correlation between the number of restraints per residue and the Ramachandran appearance according to WHAT_CHECK (r = 0.31). The set of recalculated coordinates constitutes a unified database of protein structures in which potential user‐ and software‐dependent biases have been kept as small as possible. The database can be used by the structural biology community for further development of calculation protocols, validation tools, structure‐based statistical approaches and modeling. The RECOORD database of recalculated structures is publicly available from http://www.ebi.ac.uk/msd/recoord. Proteins 2005.
FEBS Journal | 1998
John L. Markley; Ad Bax; Yoji Arata; C.W. Hilbers; Robert Kaptein; Brian D. Sykes; Peter E. Wright; Kurt Wüthrich
The recommendations presented here are designed to support easier communication of NMR data and NMR structures of proteins and nucleic acids through unified nomenclature and reporting standards. Much of this document pertains to the reporting of data in journal articles; however, in the interest of the future development of structural biology, it is desirable that the bulk of the reported information be stored in computer-accessible form and be freely accessible to the scientific community in standardized formats for data exchange. These recommendations stem from an IUPAC-IUBMB-IUPAB inter-union venture with the direct involvement of ICSU and CODATA. The Task Group has reviewed previous formal recommendations and has extended them in the light of more recent developments in the field of biomolecular NMR spectroscopy. Drafts of the recommendations presented here have been examined critically by more than 50 specialists in the field and have gone through two rounds of extensive modification to incorporate suggestions and criticisms.
Bioinformatics | 2015
Woonghee Lee; Marco Tonelli; John L. Markley
Summary: SPARKY (Goddard and Kneller, SPARKY 3) remains the most popular software program for NMR data analysis, despite the fact that development of the package by its originators ceased in 2001. We have taken over the development of this package and describe NMRFAM-SPARKY, which implements new functions reflecting advances in the biomolecular NMR field. NMRFAM-SPARKY has been repackaged with current versions of Python and Tcl/Tk, which support new tools for NMR peak simulation and graphical assignment determination. These tools, along with chemical shift predictions from the PACSY database, greatly accelerate protein side chain assignments. NMRFAM-SPARKY supports automated data format interconversion for interfacing with a variety of web servers including, PECAN , PINE, TALOS-N, CS-Rosetta, SHIFTX2 and PONDEROSA-C/S. Availability and implementation: The software package, along with binary and source codes, if desired, can be downloaded freely from http://pine.nmrfam.wisc.edu/download_packages.html. Instruction manuals and video tutorials can be found at http://www.nmrfam.wisc.edu/nmrfam-sparky-distribution.htm. Contact: [email protected] or [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
PLOS Computational Biology | 2009
Arash Bahrami; Amir H. Assadi; John L. Markley; Hamid R. Eghbalnia
The process of assigning a finite set of tags or labels to a collection of observations, subject to side conditions, is notable for its computational complexity. This labeling paradigm is of theoretical and practical relevance to a wide range of biological applications, including the analysis of data from DNA microarrays, metabolomics experiments, and biomolecular nuclear magnetic resonance (NMR) spectroscopy. We present a novel algorithm, called Probabilistic Interaction Network of Evidence (PINE), that achieves robust, unsupervised probabilistic labeling of data. The computational core of PINE uses estimates of evidence derived from empirical distributions of previously observed data, along with consistency measures, to drive a fictitious system M with Hamiltonian H to a quasi-stationary state that produces probabilistic label assignments for relevant subsets of the data. We demonstrate the successful application of PINE to a key task in protein NMR spectroscopy: that of converting peak lists extracted from various NMR experiments into assignments associated with probabilities for their correctness. This application, called PINE-NMR, is available from a freely accessible computer server (http://pine.nmrfam.wisc.edu). The PINE-NMR server accepts as input the sequence of the protein plus user-specified combinations of data corresponding to an extensive list of NMR experiments; it provides as output a probabilistic assignment of NMR signals (chemical shifts) to sequence-specific backbone and aliphatic side chain atoms plus a probabilistic determination of the protein secondary structure. PINE-NMR can accommodate prior information about assignments or stable isotope labeling schemes. As part of the analysis, PINE-NMR identifies, verifies, and rectifies problems related to chemical shift referencing or erroneous input data. PINE-NMR achieves robust and consistent results that have been shown to be effective in subsequent steps of NMR structure determination.
Nucleic Acids Research | 2007
Kim Henrick; Zukang Feng; Wolfgang F. Bluhm; Dimitris Dimitropoulos; Jurgen F. Doreleijers; Shuchismita Dutta; Judith L. Flippen-Anderson; John Ionides; Chisa Kamada; Eugene Krissinel; Catherine L. Lawson; John L. Markley; Haruki Nakamura; Richard Newman; Yukiko Shimizu; Jawahar Swaminathan; Sameer Velankar; Jeramia Ory; Eldon L. Ulrich; Wim F. Vranken; John D. Westbrook; Reiko Yamashita; Huanwang Yang; Jasmine Young; Muhammed Yousufuddin; Helen M. Berman
The Worldwide Protein Data Bank (wwPDB; wwpdb.org) is the international collaboration that manages the deposition, processing and distribution of the PDB archive. The online PDB archive at ftp://ftp.wwpdb.org is the repository for the coordinates and related information for more than 47 000 structures, including proteins, nucleic acids and large macromolecular complexes that have been determined using X-ray crystallography, NMR and electron microscopy techniques. The members of the wwPDB–RCSB PDB (USA), MSD-EBI (Europe), PDBj (Japan) and BMRB (USA)–have remediated this archive to address inconsistencies that have been introduced over the years. The scope and methods used in this project are presented.
Biochemistry | 1991
Wojciech R. Rypniewski; Deborah R. Breiter; Matthew M. Benning; Gary E. Wesenberg; Byung-Ha Oh; John L. Markley; Ivan Rayment; Hazel M. Holden
The molecular structure of the oxidized form of the [2Fe-2S] ferredoxin isolated from the cyanobacterium Anabaena species strain PCC 7120 has been determined by X-ray diffraction analysis to a nominal resolution of 2.5 A and refined to a crystallographic R factor of 18.7%. Crystals used in this investigation belong to the space group P2(1)2(1)2(1) with unit cell dimensions of a = 37.42 A, b = 38.12 A, and c = 147.12 A and two molecules in the asymmetric unit. The three-dimensional structure of this ferredoxin was solved by a method that combined X-ray data from one isomorphous heavy-atom derivative with noncrystallographic symmetry averaging and solvent flattening. As in other plant-type [2Fe-2S] ferredoxins, the iron-sulfur cluster is located toward the outer edge of the molecule, and the irons are tetrahedrally coordinated by both inorganic sulfurs and sulfurs provided by protein cysteine residues. The main secondary structural elements include four strands of beta-pleated sheet and three alpha-helical regions.