Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ron Caspi is active.

Publication


Featured researches published by Ron Caspi.


Briefings in Bioinformatics | 2010

Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology

Peter D. Karp; Suzanne M. Paley; Markus Krummenacker; Mario Latendresse; Joseph M. Dale; Thomas J. Lee; Pallavi Kaipa; Fred Gilham; Aaron Spaulding; Liviu Popescu; Tomer Altman; Ian T. Paulsen; Ingrid M. Keseler; Ron Caspi

Pathway Tools is a production-quality software environment for creating a type of model-organism database called a Pathway/Genome Database (PGDB). A PGDB such as EcoCyc integrates the evolving understanding of the genes, proteins, metabolic network and regulatory network of an organism. This article provides an overview of Pathway Tools capabilities. The software performs multiple computational inferences including prediction of metabolic pathways, prediction of metabolic pathway hole fillers and prediction of operons. It enables interactive editing of PGDBs by DB curators. It supports web publishing of PGDBs, and provides a large number of query and visualization tools. The software also supports comparative analyses of PGDBs, and provides several systems biology analyses of PGDBs including reachability analysis of metabolic networks, and interactive tracing of metabolites through a metabolic network. More than 800 PGDBs have been created using Pathway Tools by scientists around the world, many of which are curated DBs for important model organisms. Those PGDBs can be exchanged using a peer-to-peer DB sharing system called the PGDB Registry.


Plant Physiology | 2010

Creation of a Genome-Wide Metabolic Pathway Database for Populus trichocarpa Using a New Approach for Reconstruction and Curation of Metabolic Pathways for Plants

Peifen Zhang; Kate Dreher; A. Karthikeyan; Anjo Chi; Anuradha Pujar; Ron Caspi; Peter D. Karp; Vanessa Kirkup; Mario Latendresse; Cynthia Lee; Lukas A. Mueller; Robert J. Muller; Seung Y. Rhee

Metabolic networks reconstructed from sequenced genomes or transcriptomes can help visualize and analyze large-scale experimental data, predict metabolic phenotypes, discover enzymes, engineer metabolic pathways, and study metabolic pathway evolution. We developed a general approach for reconstructing metabolic pathway complements of plant genomes. Two new reference databases were created and added to the core of the infrastructure: a comprehensive, all-plant reference pathway database, PlantCyc, and a reference enzyme sequence database, RESD, for annotating metabolic functions of protein sequences. PlantCyc (version 3.0) includes 714 metabolic pathways and 2,619 reactions from over 300 species. RESD (version 1.0) contains 14,187 literature-supported enzyme sequences from across all kingdoms. We used RESD, PlantCyc, and MetaCyc (an all-species reference metabolic pathway database), in conjunction with the pathway prediction software Pathway Tools, to reconstruct a metabolic pathway database, PoplarCyc, from the recently sequenced genome of Populus trichocarpa. PoplarCyc (version 1.0) contains 321 pathways with 1,807 assigned enzymes. Comparing PoplarCyc (version 1.0) with AraCyc (version 6.0, Arabidopsis [Arabidopsis thaliana]) showed comparable numbers of pathways distributed across all domains of metabolism in both databases, except for a higher number of AraCyc pathways in secondary metabolism and a 1.5-fold increase in carbohydrate metabolic enzymes in PoplarCyc. Here, we introduce these new resources and demonstrate the feasibility of using them to identify candidate enzymes for specific pathways and to analyze metabolite profiling data through concrete examples. These resources can be searched by text or BLAST, browsed, and downloaded from our project Web site (http://plantcyc.org).


BMC Bioinformatics | 2013

A systematic comparison of the MetaCyc and KEGG pathway databases

Tomer Altman; Michael Travers; Anamika Kothari; Ron Caspi; Peter D. Karp

BackgroundThe MetaCyc and KEGG projects have developed large metabolic pathway databases that are used for a variety of applications including genome analysis and metabolic engineering. We present a comparison of the compound, reaction, and pathway content of MetaCyc version 16.0 and a KEGG version downloaded on Feb-27-2012 to increase understanding of their relative sizes, their degree of overlap, and their scope. To assess their overlap, we must know the correspondences between compounds, reactions, and pathways in MetaCyc, and those in KEGG. We devoted significant effort to computational and manual matching of these entities, and we evaluated the accuracy of the correspondences.ResultsKEGG contains 179 module pathways versus 1,846 base pathways in MetaCyc; KEGG contains 237 map pathways versus 296 super pathways in MetaCyc. KEGG pathways contain 3.3 times as many reactions on average as do MetaCyc pathways, and the databases employ different conceptualizations of metabolic pathways. KEGG contains 8,692 reactions versus 10,262 for MetaCyc. 6,174 KEGG reactions are components of KEGG pathways versus 6,348 for MetaCyc. KEGG contains 16,586 compounds versus 11,991 for MetaCyc. 6,912 KEGG compounds act as substrates in KEGG reactions versus 8,891 for MetaCyc. MetaCyc contains a broader set of database attributes than does KEGG, such as relationships from a compound to enzymes that it regulates, identification of spontaneous reactions, and the expected taxonomic range of metabolic pathways. MetaCyc contains many pathways not found in KEGG, from plants, fungi, metazoa, and actinobacteria; KEGG contains pathways not found in MetaCyc, for xenobiotic degradation, glycan metabolism, and metabolism of terpenoids and polyketides. MetaCyc contains fewer unbalanced reactions, which facilitates metabolic modeling such as using flux-balance analysis. MetaCyc includes generic reactions that may be instantiated computationally.ConclusionsKEGG contains significantly more compounds than does MetaCyc, whereas MetaCyc contains significantly more reactions and pathways than does KEGG, in particular KEGG modules are quite incomplete. The number of reactions occurring in pathways in the two DBs are quite similar.


Standards in Genomic Sciences | 2011

The Pathway Tools Pathway Prediction Algorithm

Peter D. Karp; Mario Latendresse; Ron Caspi

The PathoLogic component of the Pathway Tools software performs prediction of metabolic pathways in sequenced and annotated genomes. This article provides a detailed presentation of the PathoLogic algorithm. The algorithm consists of two phases. The reactome inference phase infers the reactions catalyzed by the organism from the set of enzymes present in the annotated genome. The pathway inference phase infers the metabolic pathways present in the organism from the reactions catalyzed by the organism. Both phases draw on the MetaCyc database of metabolic reactions and pathways. MetaCyc contains two data fields to support pathway inference: the expected taxonomic range of each pathway, and a list of key reactions for pathways. These fields have significantly increased the predictive accuracy of PathoLogic.


Nucleic Acids Research | 2017

The EcoCyc database: reflecting new knowledge about Escherichia coli K-12

Ingrid M. Keseler; Amanda Mackie; Alberto Santos-Zavaleta; Richard Billington; César Bonavides-Martínez; Ron Caspi; Carol A. Fulcher; Socorro Gama-Castro; Anamika Kothari; Markus Krummenacker; Mario Latendresse; Luis Muñiz-Rascado; Quang Ong; Suzanne M. Paley; Martín Peralta-Gil; Pallavi Subhraveti; David A. Velázquez-Ramírez; Daniel Weaver; Julio Collado-Vides; Ian T. Paulsen; Peter D. Karp

EcoCyc (EcoCyc.org) is a freely accessible, comprehensive database that collects and summarizes experimental data for Escherichia coli K-12, the best-studied bacterial model organism. New experimental discoveries about gene products, their function and regulation, new metabolic pathways, enzymes and cofactors are regularly added to EcoCyc. New SmartTable tools allow users to browse collections of related EcoCyc content. SmartTables can also serve as repositories for user- or curator-generated lists. EcoCyc now supports running and modifying E. coli metabolic models directly on the EcoCyc website.


Archives of Toxicology | 2011

A Survey of Metabolic Databases Emphasizing the MetaCyc Family

Peter D. Karp; Ron Caspi

Thanks to the confluence of genome sequencing and bioinformatics, the number of metabolic databases has expanded from a handful in the mid-1990s to several thousand today. These databases lie within distinct families that have common ancestry and common attributes. The main families are the MetaCyc, KEGG, Reactome, Model SEED, and BiGG families. We survey these database families, as well as important individual metabolic databases, including multiple human metabolic databases. The MetaCyc family is described in particular detail. It contains well over 1,000 databases, including highly curated databases for Escherichia coli, Saccharomyces cerevisiae, Mus musculus, and Arabidopsis thaliana. These databases are available through a number of web sites that offer a range of software tools for querying and visualizing metabolic networks. These web sites also provide multiple tools for analysis of gene expression and metabolomics data, including visualization of those datasets on metabolic network diagrams and over-representation analysis of gene sets and metabolite sets.


Tuberculosis | 2013

Updating and curating metabolic pathways of TB

Richard A. Slayden; Mary Jackson; Jeremy Zucker; Melissa V. Ramirez; Clinton C Dawson; Rebecca Crew; Nicole S. Sampson; Suzanne T. Thomas; Neema Jamshidi; Peter Sisk; Ron Caspi; Dean C. Crick; Michael R. McNeil; Martin S. Pavelka; Michael Niederweis; Axel Siroy; Valentina Dona; Johnjoe McFadden; Helena I. Boshoff; Jocelyne M. Lew

The sequencing of complete genomes has accelerated biomedical research by providing information about the overall coding capacity of bacterial chromosomes. The original TB annotation resulted in putative functional assignment of ∼60% of the genes to specific metabolic functions, however, the other 40% of the encoded ORFs where annotated as conserved hypothetical proteins, hypothetical proteins or encoding proteins of unknown function. The TB research community is now at the beginning of the next phases of post-genomics; namely reannotation and functional characterization by targeted experimentation. Arguably, this is the most significant time for basic microbiology in recent history. To foster basic TB research, the Tuberculosis Community Annotation Project (TBCAP) jamboree exercise began the reannotation effort by providing additional information for previous annotations, and refining and substantiating the functional assignment of ORFs and genes within metabolic pathways. The overall goal of the TBCAP 2012 exercise was to gather and compile various data types and use this information with oversight from the scientific community to provide additional information to support the functional annotations of encoding genes. Another objective of this effort was to standardize the publicly accessible Mycobacterium tuberculosis reference sequence and its annotation. The greatest benefit of functional annotation information of genome sequence is that it fuels TB research for drug discovery, diagnostics, vaccine development and epidemiology.


Current protocols in human genetics | 2007

Using the MetaCyc Pathway Database and the BioCyc Database Collection

Ron Caspi; Peter D. Karp

The MetaCyc database (http://metacyc.org) is a collection of more than a thousand metabolic pathways from a wide variety of organisms, collected from the literature by manual curation. BioCyc contains more than 370 Pathway/Genome Databases, each of which describes the genome and predicted metabolic pathways of a single organism, as computationally predicted from the annotated genomes, using MetaCyc data as a reference. The protocols in this unit introduce the user to the extensive data content available in MetaCyc and BioCyc, and to mechanisms for querying, visualizing, and analyzing these data using the Pathway Tools software. Curr. Protoc. Bioinform. 20:1.17.1‐1.17.51.


Fems Microbiology Letters | 2013

The challenge of constructing, classifying, and representing metabolic pathways

Ron Caspi; Kate Dreher; Peter D. Karp

Scientists, educators, and students benefit from having free and centralized access to the wealth of metabolic information that has been gathered over the decades. Curators of the MetaCyc database work to present this information in an easily understandable pathway-based framework. MetaCyc is used not only as an encyclopedic resource for metabolic information but also as a template for the pathway prediction software that generates pathway/genome databases for thousands of organisms with sequenced genomes (available at www.biocyc.org). Curators need to define pathway boundaries and classify pathways within a broader pathway ontology to maximize the utility of the pathways to both users and the pathway prediction software. These seemingly simple tasks pose several challenges. This review describes these challenges as well as the criteria that need to be considered, and the rules that have been developed by MetaCyc curators as they make decisions regarding the representation and classification of metabolic pathway information in MetaCyc. The functional consequences of these decisions in regard to pathway prediction in new species are also discussed.


Nucleic Acids Research | 2018

The MetaCyc database of metabolic pathways and enzymes

Ron Caspi; Richard Billington; Carol A. Fulcher; Ingrid M. Keseler; Anamika Kothari; Markus Krummenacker; Mario Latendresse; Peter E. Midford; Quang Ong; Wai Kit Ong; Suzanne M. Paley; Pallavi Subhraveti; Peter D. Karp

Abstract MetaCyc (https://MetaCyc.org) is a comprehensive reference database of metabolic pathways and enzymes from all domains of life. It contains more than 2570 pathways derived from >54 000 publications, making it the largest curated collection of metabolic pathways. The data in MetaCyc is strictly evidence-based and richly curated, resulting in an encyclopedic reference tool for metabolism. MetaCyc is also used as a knowledge base for generating thousands of organism-specific Pathway/Genome Databases (PGDBs), which are available in the BioCyc (https://BioCyc.org) and other PGDB collections. This article provides an update on the developments in MetaCyc during the past two years, including the expansion of data and addition of new features.

Collaboration


Dive into the Ron Caspi's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Kate Dreher

Carnegie Institution for Science

View shared research outputs
Researchain Logo
Decentralizing Knowledge