Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Jacob A. Cram is active.

Publication


Featured researches published by Jacob A. Cram.


The ISME Journal | 2016

Correlation detection strategies in microbial data sets vary widely in sensitivity and precision

Sophie Weiss; Will Van Treuren; Catherine A. Lozupone; Karoline Faust; Jonathan Friedman; Ye Deng; Li Charlie Xia; Zhenjiang Zech Xu; Luke K. Ursell; Eric J. Alm; Amanda Birmingham; Jacob A. Cram; Jed A. Fuhrman; Jeroen Raes; Fengzhu Sun; Jizhong Zhou; Rob Knight

Disruption of healthy microbial communities has been linked to numerous diseases, yet microbial interactions are little understood. This is due in part to the large number of bacteria, and the much larger number of interactions (easily in the millions), making experimental investigation very difficult at best and necessitating the nascent field of computational exploration through microbial correlation networks. We benchmark the performance of eight correlation techniques on simulated and real data in response to challenges specific to microbiome studies: fractional sampling of ribosomal RNA sequences, uneven sampling depths, rare microbes and a high proportion of zero counts. Also tested is the ability to distinguish signals from noise, and detect a range of ecological and time-series relationships. Finally, we provide specific recommendations for correlation technique usage. Although some methods perform better than others, there is still considerable need for improvement in current techniques.


BMC Systems Biology | 2011

Extended local similarity analysis (eLSA) of microbial community and other time series data with replicates

Li Charlie Xia; Joshua A. Steele; Jacob A. Cram; Zoe G. Cardon; Sheri L. Simmons; Joseph J. Vallino; Jed A. Fuhrman; Fengzhu Sun

BackgroundThe increasing availability of time series microbial community data from metagenomics and other molecular biological studies has enabled the analysis of large-scale microbial co-occurrence and association networks. Among the many analytical techniques available, the Local Similarity Analysis (LSA) method is unique in that it captures local and potentially time-delayed co-occurrence and association patterns in time series data that cannot otherwise be identified by ordinary correlation analysis. However LSA, as originally developed, does not consider time series data with replicates, which hinders the full exploitation of available information. With replicates, it is possible to understand the variability of local similarity (LS) score and to obtain its confidence interval.ResultsWe extended our LSA technique to time series data with replicates and termed it extended LSA, or eLSA. Simulations showed the capability of eLSA to capture subinterval and time-delayed associations. We implemented the eLSA technique into an easy-to-use analytic software package. The software pipeline integrates data normalization, statistical correlation calculation, statistical significance evaluation, and association network construction steps. We applied the eLSA technique to microbial community and gene expression datasets, where unique time-dependent associations were identified.ConclusionsThe extended LSA analysis technique was demonstrated to reveal statistically significant local and potentially time-delayed association patterns in replicated time series data beyond that of ordinary correlation analysis. These statistically significant associations can provide insights to the real dynamics of biological systems. The newly designed eLSA software efficiently streamlines the analysis and is freely available from the eLSA homepage, which can be accessed at http://meta.usc.edu/softs/lsa.


The ISME Journal | 2013

Short-term observations of marine bacterial and viral communities: patterns, connections and resilience

David M. Needham; Cheryl-Emiliane T Chow; Jacob A. Cram; Rohan Sachdeva; Alma Parada; Jed A. Fuhrman

Observation of short-term temporal variation in bacterial and viral communities is important for understanding patterns of aquatic microbial diversity. We collected surface seawater once daily for 38 consecutive days with seven more samples interspersed over 40 more days at one location ∼2 km from Santa Catalina Island, California. Bacterial communities were analyzed by automated ribosomal intergenic spacer analysis (ARISA) and viral communities were analyzed by terminal restriction fragment length polymorphism (TRFLP) of the conserved T4-like myoviral gene encoding the major capsid protein (g23). Common bacterial and viral taxa were consistently dominant, and relatively few displayed dramatic increases/decreases or ‘boom/bust’ patterns that might be expected from dynamic predator-prey interactions. Association network analysis showed most significant covariations (associations) occurred among bacterial taxa or among viral taxa and there were several modular (highly-interconnected) associations (P⩽0.005). Associations observed between bacteria and viruses (P⩽0.005) occurred with a median time lag of 2 days. Regression of all pairwise Bray-Curtis similarities between samples indicated a rate of bacterial community change that slows from 2.1%–0.18% per day over a week to 2 months; the rate stays around 0.4% per day for viruses. Our interpretation is that, over the scale of days, individual bacterial and viral OTUs can be dynamic and patterned; resulting in statistical associations regarded as potential ecological interactions. However, over the scale of weeks, average bacterial community variation is slower, suggesting that there is strong community-level ecological resilience, that is, a tendency to converge towards a ‘mean’ microbial community set by longer-term controlling factors.


The ISME Journal | 2013

Temporal variability and coherence of euphotic zone bacterial communities over a decade in the Southern California Bight

Cheryl-Emiliane T Chow; Rohan Sachdeva; Jacob A. Cram; Joshua A. Steele; David M. Needham; Anand Patel; Alma Parada; Jed A. Fuhrman

Time-series are critical to understanding long-term natural variability in the oceans. Bacterial communities in the euphotic zone were investigated for over a decade at the San Pedro Ocean Time-series station (SPOT) off southern California. Community composition was assessed by Automated Ribosomal Intergenic Spacer Analysis (ARISA) and coupled with measurements of oceanographic parameters for the surface ocean (0–5 m) and deep chlorophyll maximum (DCM, average depth ∼30 m). SAR11 and cyanobacterial ecotypes comprised typically more than one-third of the measured community; diversity within both was temporally variable, although a few operational taxonomic units (OTUs) were consistently more abundant. Persistent OTUs, mostly Alphaproteobacteria (SAR11 clade), Actinobacteria and Flavobacteria, tended to be abundant, in contrast to many rarer yet intermittent and ephemeral OTUs. Association networks revealed potential niches for key OTUs from SAR11, cyanobacteria, SAR86 and other common clades on the basis of robust correlations. Resilience was evident by the average communities drifting only slightly as years passed. Average Bray-Curtis similarity between any pair of dates was ∼40%, with a slight decrease over the decade and obvious near-surface seasonality; communities 8–10 years apart were slightly more different than those 1–4 years apart with the highest rate of change at 0–5 m between communities <4 years apart. The surface exhibited more pronounced seasonality than the DCM. Inter-depth Bray-Curtis similarities repeatedly decreased as the water column stratified each summer. Environmental factors were better predictors of shifts in community composition than months or elapsed time alone; yet, the best predictor was community composition at the other depth (that is, 0–5 m versus DCM).


Bioinformatics | 2013

Efficient statistical significance approximation for local similarity analysis of high-throughput time series data

Li Charlie Xia; Dongmei Ai; Jacob A. Cram; Jed A. Fuhrman; Fengzhu Sun

MOTIVATION Local similarity analysis of biological time series data helps elucidate the varying dynamics of biological systems. However, its applications to large scale high-throughput data are limited by slow permutation procedures for statistical significance evaluation. RESULTS We developed a theoretical approach to approximate the statistical significance of local similarity analysis based on the approximate tail distribution of the maximum partial sum of independent identically distributed (i.i.d.) random variables. Simulations show that the derived formula approximates the tail distribution reasonably well (starting at time points > 10 with no delay and > 20 with delay) and provides P-values comparable with those from permutations. The new approach enables efficient calculation of statistical significance for pairwise local similarity analysis, making possible all-to-all local association studies otherwise prohibitive. As a demonstration, local similarity analysis of human microbiome time series shows that core operational taxonomic units (OTUs) are highly synergetic and some of the associations are body-site specific across samples. AVAILABILITY The new approach is implemented in our eLSA package, which now provides pipelines for faster local similarity analysis of time series data. The tool is freely available from eLSAs website: http://meta.usc.edu/softs/lsa. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online. CONTACT [email protected].


PLOS ONE | 2011

Accurate genome relative abundance estimation based on shotgun metagenomic reads.

Li Charlie Xia; Jacob A. Cram; Ting Chen; Jed A. Fuhrman; Fengzhu Sun

Accurate estimation of microbial community composition based on metagenomic sequencing data is fundamental for subsequent metagenomics analysis. Prevalent estimation methods are mainly based on directly summarizing alignment results or its variants; often result in biased and/or unstable estimates. We have developed a unified probabilistic framework (named GRAMMy) by explicitly modeling read assignment ambiguities, genome size biases and read distributions along the genomes. Maximum likelihood method is employed to compute Genome Relative Abundance of microbial communities using the Mixture Model theory (GRAMMy). GRAMMy has been demonstrated to give estimates that are accurate and robust across both simulated and real read benchmark datasets. We applied GRAMMy to a collection of 34 metagenomic read sets from four metagenomics projects and identified 99 frequent species (minimally 0.5% abundant in at least 50% of the data- sets) in the human gut samples. Our results show substantial improvements over previous studies, such as adjusting the over-estimated abundance for Bacteroides species for human gut samples, by providing a new reference-based strategy for metagenomic sample comparisons. GRAMMy can be used flexibly with many read assignment tools (mapping, alignment or composition-based) even with low-sensitivity mapping results from huge short-read datasets. It will be increasingly useful as an accurate and robust tool for abundance estimation with the growing size of read sets and the expanding database of reference genomes.


The ISME Journal | 2015

Seasonal and interannual variability of the marine bacterioplankton community throughout the water column over ten years.

Jacob A. Cram; Cheryl-Emiliane T Chow; Rohan Sachdeva; David M. Needham; Alma Parada; Joshua A. Steele; Jed A. Fuhrman

Microbial activities that affect global oceanographic and atmospheric processes happen throughout the water column, yet the long-term ecological dynamics of microbes have been studied largely in the euphotic zone and adjacent seasonally mixed depths. We investigated temporal patterns in the community structure of free-living bacteria, by sampling approximately monthly from 5 m, the deep chlorophyll maximum (∼15–40 m), 150, 500 and 890 m, in San Pedro Channel (maximum depth 900 m, hypoxic below ∼500 m), off the coast of Southern California. Community structure and biodiversity (inverse Simpson index) showed seasonal patterns near the surface and bottom of the water column, but not at intermediate depths. Inverse Simpson’s index was highest in the winter in surface waters and in the spring at 890 m, and varied interannually at all depths. Biodiversity appeared to be driven partially by exchange of microbes between depths and was highest when communities were changing slowly over time. Meanwhile, communities from the surface through 500 m varied interannually. After accounting for seasonality, several environmental parameters co-varied with community structure at the surface and 890 m, but not at the intermediate depths. Abundant and seasonally variable groups included, at 890 m, Nitrospina, Flavobacteria and Marine Group A. Seasonality at 890 m is likely driven by variability in sinking particles, which originate in surface waters, pass transiently through the middle water column and accumulate on the seafloor where they alter the chemical environment. Seasonal subeuphotic groups are likely those whose ecology is strongly influenced by these particles. This surface-to-bottom, decade-long, study identifies seasonality and interannual variability not only of overall community structure, but also of numerous taxonomic groups and near-species level operational taxonomic units.


The ISME Journal | 2015

Cross-depth analysis of marine bacterial networks suggests downward propagation of temporal changes

Jacob A. Cram; Li Charlie Xia; David M. Needham; Rohan Sachdeva; Fengzhu Sun; Jed A. Fuhrman

Interactions among microbes and stratification across depths are both believed to be important drivers of microbial communities, though little is known about how microbial associations differ between and across depths. We have monitored the free-living microbial community at the San Pedro Ocean Time-series station, monthly, for a decade, at five different depths: 5 m, the deep chlorophyll maximum layer, 150 m, 500 m and 890 m (just above the sea floor). Here, we introduce microbial association networks that combine data from multiple ocean depths to investigate both within- and between-depth relationships, sometimes time-lagged, among microbes and environmental parameters. The euphotic zone, deep chlorophyll maximum and 890 m depth each contain two negatively correlated ‘modules’ (groups of many inter-correlated bacteria and environmental conditions) suggesting regular transitions between two contrasting environmental states. Two-thirds of pairwise correlations of bacterial taxa between depths lagged such that changes in the abundance of deeper organisms followed changes in shallower organisms. Taken in conjunction with previous observations of seasonality at 890 m, these trends suggest that planktonic microbial communities throughout the water column are linked to environmental conditions and/or microbial communities in overlying waters. Poorly understood groups including Marine Group A, Nitrospina and AEGEAN-169 clades contained taxa that showed diverse association patterns, suggesting these groups contain multiple ecological species, each shaped by different factors, which we have started to delineate. These observations build upon previous work at this location, lending further credence to the hypothesis that sinking particles and vertically migrating animals transport materials that significantly shape the time-varying patterns of microbial community composition.


BMC Bioinformatics | 2015

Statistical significance approximation in local trend analysis of high-throughput time-series data using the theory of Markov chains

Li Charlie Xia; Dongmei Ai; Jacob A. Cram; Xiaoyi Liang; Jed A. Fuhrman; Fengzhu Sun

BackgroundLocal trend (i.e. shape) analysis of time series data reveals co-changing patterns in dynamics of biological systems. However, slow permutation procedures to evaluate the statistical significance of local trend scores have limited its applications to high-throughput time series data analysis, e.g., data from the next generation sequencing technology based studies.ResultsBy extending the theories for the tail probability of the range of sum of Markovian random variables, we propose formulae for approximating the statistical significance of local trend scores. Using simulations and real data, we show that the approximate p-value is close to that obtained using a large number of permutations (starting at time points >20 with no delay and >30 with delay of at most three time steps) in that the non-zero decimals of the p-values obtained by the approximation and the permutations are mostly the same when the approximate p-value is less than 0.05. In addition, the approximate p-value is slightly larger than that based on permutations making hypothesis testing based on the approximate p-value conservative. The approximation enables efficient calculation of p-values for pairwise local trend analysis, making large scale all-versus-all comparisons possible. We also propose a hybrid approach by integrating the approximation and permutations to obtain accurate p-values for significantly associated pairs. We further demonstrate its use with the analysis of the Polymouth Marine Laboratory (PML) microbial community time series from high-throughput sequencing data and found interesting organism co-occurrence dynamic patterns.AvailabilityThe software tool is integrated into the eLSA software package that now provides accelerated local trend and similarity analysis pipelines for time series data. The package is freely available from the eLSA website: http://bitbucket.org/charade/elsa.


The ISME Journal | 2018

Dynamics and interactions of highly resolved marine plankton via automated high-frequency sampling

David M. Needham; Erin B. Fichot; Ellice Wang; Lyria Berdjeb; Jacob A. Cram; Cédric G. Fichot; Jed A. Fuhrman

Short timescale observations are valuable for understanding microbial ecological processes. We assessed dynamics in relative abundance and potential activities by sequencing the small sub-unit ribosomal RNA gene (rRNA gene) and rRNA molecules (rRNA) of Bacteria, Archaea, and Eukaryota once to twice daily between March 2014 and May 2014 from the surface ocean off Catalina Island, California. Typically Ostreococcus, Braarudosphaera, Teleaulax, and Synechococcus dominated phytoplankton sequences (including chloroplasts) while SAR11, Sulfitobacter, and Fluviicola dominated non-phytoplankton Bacteria and Archaea. We observed short-lived increases of diatoms, mostly Pseudo-nitzschia and Chaetoceros, with quickly responding Bacteria and Archaea including Flavobacteriaceae (Polaribacter & Formosa), Roseovarius, and Euryarchaeota (MGII), notably the exact amplicon sequence variants we observed responding similarly to another diatom bloom nearby, 3 years prior. We observed correlations representing known interactions among abundant phytoplankton rRNA sequences, demonstrating the biogeochemical and ecological relevance of such interactions: (1) The kleptochloroplastidic ciliate Mesodinium 18S rRNA gene sequences and a single Teleaulax taxon (via 16S rRNA gene sequences) were correlated (Spearman r = 0.83) yet uncorrelated to a Teleaulax 18S rRNA gene OTU, or any other taxon (consistent with a kleptochloroplastidic or karyokleptic relationship) and (2) the photosynthetic prymnesiophyte Braarudosphaera bigelowii and two strains of diazotrophic cyanobacterium UCYN-A were correlated and each taxon was also correlated to other taxa, including B. bigelowii to a verrucomicrobium and a dictyochophyte phytoplankter (all r > 0.8). We also report strong correlations (r > 0.7) between various ciliates, bacteria, and phytoplankton, suggesting interactions via currently unknown mechanisms. These data reiterate the utility of high-frequency time series to show rapid microbial reactions to stimuli, and provide new information about in situ dynamics of previously recognized and hypothesized interactions.

Collaboration


Dive into the Jacob A. Cram's collaboration.

Top Co-Authors

Avatar

Jed A. Fuhrman

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

David M. Needham

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Fengzhu Sun

University of Southern California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Rohan Sachdeva

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Alma Parada

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Cheryl-Emiliane T Chow

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Joshua A. Steele

University of Southern California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ellice Wang

University of Southern California

View shared research outputs
Researchain Logo
Decentralizing Knowledge