Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where John Wooley is active.

Publication


Featured researches published by John Wooley.


PLOS Computational Biology | 2010

A Primer on Metagenomics

John Wooley; Adam Godzik; Iddo Friedberg

Metagenomics is a discipline that enables the genomic study of uncultured microorganisms. Faster, cheaper sequencing technologies and the ability to sequence uncultured microbes sampled directly from their habitats are expanding and transforming our view of the microbial world. Distilling meaningful information from the millions of new genomic sequences presents a serious challenge to bioinformaticians. In cultured microbes, the genomic data come from a single clone, making sequence assembly and annotation tractable. In metagenomics, the data come from heterogeneous microbial communities, sometimes containing more than 10,000 species, with the sequence data being noisy and partial. From sampling, to assembly, to gene calling and function prediction, bioinformatics faces new demands in interpreting voluminous, noisy, and often partial sequence data. Although metagenomics is a relative newcomer to science, the past few years have seen an explosion in computational methods applied to metagenomic-based research. It is therefore not within the scope of this article to provide an exhaustive review. Rather, we provide here a concise yet comprehensive introduction to the current computational requirements presented by metagenomics, and review the recent progress made. We also note whether there is software that implements any of the methods presented here, and briefly review its utility. Nevertheless, it would be useful if readers of this article would avail themselves of the comment section provided by this journal, and relate their own experiences. Finally, the last section of this article provides a few representative studies illustrating different facets of recent scientific discoveries made using metagenomics.


Proceedings of the National Academy of Sciences of the United States of America | 2002

Structural Genomics of the Thermotoga maritima Proteome Implemented in a High-throughput Structure Determination Pipeline

Scott A. Lesley; Peter Kuhn; Adam Godzik; Ashley M. Deacon; Irimpan I. Mathews; Andreas Kreusch; Glen Spraggon; Heath E. Klock; Daniel McMullan; Tanya Shin; Juli Vincent; Alyssa Robb; Linda S. Brinen; Mitchell D. Miller; Timothy M. McPhillips; Mark A. Miller; Daniel Scheibe; Jaume M. Canaves; Chittibabu Guda; Lukasz Jaroszewski; Thomas L. Selby; Marc André Elsliger; John Wooley; Susan S. Taylor; Keith O. Hodgson; Ian A. Wilson; Peter G. Schultz; Raymond C. Stevens

Structural genomics is emerging as a principal approach to define protein structure–function relationships. To apply this approach on a genomic scale, novel methods and technologies must be developed to determine large numbers of structures. We describe the design and implementation of a high-throughput structural genomics pipeline and its application to the proteome of the thermophilic bacterium Thermotoga maritima. By using this pipeline, we successfully cloned and attempted expression of 1,376 of the predicted 1,877 genes (73%) and have identified crystallization conditions for 432 proteins, comprising 23% of the T. maritima proteome. Representative structures from TM0423 glycerol dehydrogenase and TM0449 thymidylate synthase-complementing protein are presented as examples of final outputs from the pipeline.


Nucleic Acids Research | 2011

Community cyberinfrastructure for Advanced Microbial Ecology Research and Analysis: the CAMERA resource

Shulei Sun; Jing Chen; Weizhong Li; Ilkay Altintas; Abel W. Lin; Steven T. Peltier; Karen I. Stocks; Eric E. Allen; Mark H. Ellisman; Jeffrey S. Grethe; John Wooley

The Community Cyberinfrastructure for Advanced Microbial Ecology Research and Analysis (CAMERA, http://camera.calit2.net/) is a database and associated computational infrastructure that provides a single system for depositing, locating, analyzing, visualizing and sharing data about microbial biology through an advanced web-based analysis portal. CAMERA collects and links metadata relevant to environmental metagenome data sets with annotation in a semantically-aware environment allowing users to write expressive semantic queries against the database. To meet the needs of the research community, users are able to query metadata categories such as habitat, sample type, time, location and other environmental physicochemical parameters. CAMERA is compliant with the standards promulgated by the Genomic Standards Consortium (GSC), and sustains a role within the GSC in extending standards for content and format of the metagenomic data and metadata and its submission to the CAMERA repository. To ensure wide, ready access to data and annotation, CAMERA also provides data submission tools to allow researchers to share and forward data to other metagenomics sites and community data archives such as GenBank. It has multiple interfaces for easy submission of large or complex data sets, and supports pre-registration of samples for sequencing. CAMERA integrates a growing list of tools and viewers for querying, analyzing, annotating and comparing metagenome and genome data.


Science | 2009

Three-Dimensional Structural View of the Central Metabolic Network of Thermotoga maritima

Ying Zhang; Ines Thiele; Dana Weekes; Zhanwen Li; Lukasz Jaroszewski; Krzysztof Ginalski; Ashley M. Deacon; John Wooley; Scott A. Lesley; Ian A. Wilson; Bernhard O. Palsson; Andrei L. Osterman; Adam Godzik

Now Shown in 3D With the advent of systems-wide technologies and the development of analytical methods, data produced by analyzing individual or small groups of molecular components can be integrated to reassemble whole biological systems. Zhang et al. (p. 1544) have undertaken a major technological challenge: to integrate biochemical data with experimentally determined or predicted three-dimensional structures of all proteins involved in the central metabolism of a bacterial cell. This integration of large-scale data sets provides evolutionary and functional insights and furthers our understanding of the molecular assembly of complex biological networks. Protein structure and biochemical data generate a three-dimensional view of the metabolic network of a bacterial cell. Metabolic pathways have traditionally been described in terms of biochemical reactions and metabolites. With the use of structural genomics and systems biology, we generated a three-dimensional reconstruction of the central metabolic network of the bacterium Thermotoga maritima. The network encompassed 478 proteins, of which 120 were determined by experiment and 358 were modeled. Structural analysis revealed that proteins forming the network are dominated by a small number (only 182) of basic shapes (folds) performing diverse but mostly related functions. Most of these folds are already present in the essential core (~30%) of the network, and its expansion by nonessential proteins is achieved with relatively few additional folds. Thus, integration of structural data with networks analysis generates insight into the function, mechanism, and evolution of biological networks.


PLOS Biology | 2009

Exploration of Uncharted Regions of the Protein Universe

Lukasz Jaroszewski; Zhanwen Li; S. Sri Krishna; Constantina Bakolitsa; John Wooley; Ashley M. Deacon; Ian A. Wilson; Adam Godzik

Determination of first protein structures, from hundreds of families of unknown function, have shown that divergence, rather than novelty, is the dominant force that shapes the evolution of the protein universe.


Acta Crystallographica Section F-structural Biology and Crystallization Communications | 2010

The JCSG High-throughput Structural Biology Pipeline

Marc-André Elsliger; Ashley M. Deacon; Adam Godzik; Scott A. Lesley; John Wooley; Kurt Wüthrich; Ian A. Wilson

The Joint Center for Structural Genomics high-throughput structural biology pipeline has delivered more than 1000 structures to the community over the past ten years and has made a significant contribution to the overall goal of the NIH Protein Structure Initiative (PSI) of expanding structural coverage of the protein universe.


PLOS ONE | 2011

CARD8 and NLRP1 Undergo Autoproteolytic Processing through a ZU5-Like Domain

Andrea D'Osualdo; Christian X. Weichenberger; Roland N. Wagner; Adam Godzik; John Wooley; John C. Reed

The “Function to Find Domain” (FIIND)-containing proteins CARD8 (Cardinal; Tucan) and NLRP1 (NALP1; NAC) are well known components of inflammasomes, multiprotein complexes responsible for activation of caspase-1, a regulator of inflammation and innate immunity. Although identified many years ago, the role of the FIIND is unknown. Here, we report that CARD8 and NLRP1 undergo autoproteolytic cleavage at a conserved SF/S motif within the FIIND. Using bioinformatics and computational modeling approaches, we detected striking structural similarity between the FIIND and the ZU5-UPA domain present in the autoproteolytic protein PIDD. This allowed us to generate a three-dimensional model and to gain insights in the molecular mechanism of the cleavage. Site-directed mutagenesis experiments revealed that the second serine of the SF/S motif is required for CARD8 and NLRP1 autoproteolysis. Furthermore, we discovered an important function for conserved glutamic acid and histidine residues, located in proximity of the cleavage site in regulating the autoprocessing efficiency. Altogether, these results identify a function for the FIIND and show that CARD8 and NLRP1 are ZU5-UPA domain-containing autoproteolytic proteins, thus suggesting a novel mechanism for regulating innate immune responses.


PLOS ONE | 2014

Semantics in Support of Biodiversity Knowledge Discovery: An Introduction to the Biological Collections Ontology and Related Ontologies

Ramona L. Walls; John Deck; Robert P. Guralnick; Steve Baskauf; Reed S. Beaman; Stanley Blum; Shawn Bowers; Pier Luigi Buttigieg; Neil Davies; Dag Terje Filip Endresen; Maria A. Gandolfo; Robert Hanner; Alyssa Janning; Leonard Krishtalka; Andréa M. Matsunaga; Peter E. Midford; Norman Morrison; Éamonn Ó Tuama; Mark Schildhauer; Barry Smith; Brian J. Stucky; Andrea K. Thomer; John Wieczorek; Jamie Whitacre; John Wooley

The study of biodiversity spans many disciplines and includes data pertaining to species distributions and abundances, genetic sequences, trait measurements, and ecological niches, complemented by information on collection and measurement protocols. A review of the current landscape of metadata standards and ontologies in biodiversity science suggests that existing standards such as the Darwin Core terminology are inadequate for describing biodiversity data in a semantically meaningful and computationally useful way. Existing ontologies, such as the Gene Ontology and others in the Open Biological and Biomedical Ontologies (OBO) Foundry library, provide a semantic structure but lack many of the necessary terms to describe biodiversity data in all its dimensions. In this paper, we describe the motivation for and ongoing development of a new Biological Collections Ontology, the Environment Ontology, and the Population and Community Ontology. These ontologies share the aim of improving data aggregation and integration across the biodiversity domain and can be used to describe physical samples and sampling processes (for example, collection, extraction, and preservation techniques), as well as biodiversity observations that involve no physical sampling. Together they encompass studies of: 1) individual organisms, including voucher specimens from ecological studies and museum specimens, 2) bulk or environmental samples (e.g., gut contents, soil, water) that include DNA, other molecules, and potentially many organisms, especially microbes, and 3) survey-based ecological observations. We discuss how these ontologies can be applied to biodiversity use cases that span genetic, organismal, and ecosystem levels of organization. We argue that if adopted as a standard and rigorously applied and enriched by the biodiversity community, these ontologies would significantly reduce barriers to data discovery, integration, and exchange among biodiversity resources and researchers.


Proteins | 2007

Crystal structures of two novel dye-decolorizing peroxidases reveal a beta-barrel fold with a conserved heme-binding motif.

Chloe Zubieta; S. Sri Krishna; Mili Kapoor; Piotr Kozbial; Daniel McMullan; Herbert L. Axelrod; Mitchell D. Miller; Polat Abdubek; Eileen Ambing; Tamara Astakhova; Dennis Carlton; Hsiu-Ju Chiu; Thomas Clayton; Marc C. Deller; Lian Duan; Marc-André Elsliger; Julie Feuerhelm; Slawomir K. Grzechnik; Joanna Hale; Eric Hampton; Gye Won Han; Lukasz Jaroszewski; Kevin K. Jin; Heath E. Klock; Mark W. Knuth; Abhinav Kumar; David Marciano; Andrew T. Morse; Edward Nigoghossian; Linda Okach

BtDyP from Bacteroides thetaiotaomicron (strain VPI‐5482) and TyrA from Shewanella oneidensis are dye‐decolorizing peroxidases (DyPs), members of a new family of heme‐dependent peroxidases recently identified in fungi and bacteria. Here, we report the crystal structures of BtDyP and TyrA at 1.6 and 2.7 Å, respectively. BtDyP assembles into a hexamer, while TyrA assembles into a dimer; the dimerization interface is conserved between the two proteins. Each monomer exhibits a two‐domain, α+β ferredoxin‐like fold. A site for heme binding was identified computationally, and modeling of a heme into the proposed active site allowed for identification of residues likely to be functionally important. Structural and sequence comparisons with other DyPs demonstrate a conservation of putative heme‐binding residues, including an absolutely conserved histidine. Isothermal titration calorimetry experiments confirm heme binding, but with a stoichiometry of 0.3:1 (heme:protein). Proteins 2007.


Proteins | 2002

Crystal structure of thy1, a thymidylate synthase complementing protein from Thermotoga maritima at 2.25 Å resolution

Peter Kuhn; Scott A. Lesley; Irimpan I. Mathews; Jaume M. Canaves; Linda S. Brinen; Xiaoping Dai; Ashley M. Deacon; Marc André Elsliger; Said Eshaghi; Ross Floyd; Adam Godzik; Carina Grittini; Slawomir K. Grzechnik; Chittibabu Guda; Keith O. Hodgson; Lukasz Jaroszewski; Cathy Karlak; Heath E. Klock; Eric Koesema; John M. Kovarik; Andreas Kreusch; Daniel McMullan; Timothy M. McPhillips; Mark A. Miller; Mitchell D. Miller; Andrew T. Morse; Kin Moy; Jie Ouyang; Alyssa Robb; Kevin Rodrigues

Peter Kuhn, Scott A. Lesley, Irimpan I. Mathews, Jaume M. Canaves, Linda S. Brinen, Xiaoping Dai, Ashley M. Deacon, Marc A. Elsliger, Said Eshaghi, Ross Floyd, Adam Godzik, Carina Grittini, Slawomir K. Grzechnik, Chittibabu Guda, Keith O. Hodgson, Lukasz Jaroszewski, Cathy Karlak, Heath E. Klock, Eric Koesema, John M. Kovarik, Andreas T. Kreusch, Daniel McMullan, Timothy M. McPhillips, Mark A. Miller, Mitchell Miller, Andrew Morse, Kin Moy, Jie Ouyang, Alyssa Robb, Kevin Rodrigues, Thomas L. Selby, Glen Spraggon, Raymond C. Stevens, Susan S. Taylor, Henry van den Bedem, Jeff Velasquez, Juli Vincent, Xianhong Wang, Bill West, Guenter Wolf, John Wooley, and Ian A. Wilson* The Joint Center for Structural Genomics Stanford Synchrotron Radiation Laboratory, Stanford University, Menlo Park, California The Genomics Institute of Novartis Foundation, San Diego, California The San Diego Supercomputer Center, La Jolla, California The University of California, San Diego, La Jolla, California The Scripps Research Institute, La Jolla, California

Collaboration


Dive into the John Wooley's collaboration.

Top Co-Authors

Avatar

Heath E. Klock

Genomics Institute of the Novartis Research Foundation

View shared research outputs
Top Co-Authors

Avatar

Daniel McMullan

Genomics Institute of the Novartis Research Foundation

View shared research outputs
Top Co-Authors

Avatar

Mitchell D. Miller

SLAC National Accelerator Laboratory

View shared research outputs
Top Co-Authors

Avatar

Polat Abdubek

Genomics Institute of the Novartis Research Foundation

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Gye Won Han

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Hsiu-Ju Chiu

SLAC National Accelerator Laboratory

View shared research outputs
Top Co-Authors

Avatar

S. Sri Krishna

University of California

View shared research outputs
Top Co-Authors

Avatar

Herbert L. Axelrod

SLAC National Accelerator Laboratory

View shared research outputs
Researchain Logo
Decentralizing Knowledge