Benjamin J. Callahan
Stanford University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Benjamin J. Callahan.
Nature Methods | 2016
Benjamin J. Callahan; Paul J. McMurdie; Michael J Rosen; Andrew W Han; Amy Jo A Johnson; Susan Holmes
We present the open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors (https://github.com/benjjneb/dada2). DADA2 infers sample sequences exactly and resolves differences of as little as 1 nucleotide. In several mock communities, DADA2 identified more real variants and output fewer spurious sequences than other methods. We applied DADA2 to vaginal samples from a cohort of pregnant women, revealing a diversity of previously undetected Lactobacillus crispatus variants.
Proceedings of the National Academy of Sciences of the United States of America | 2015
Daniel B. DiGiulio; Benjamin J. Callahan; Paul J. McMurdie; Elizabeth K. Costello; Deirdre J. Lyell; Anna Robaczewska; Christine L. Sun; Daniela S. Aliaga Goltsman; Ronald J. Wong; Gary M. Shaw; David K. Stevenson; Susan Holmes; David A. Relman
Significance The human indigenous microbial communities (microbiota) play critical roles in health and may be especially important for mother and fetus during pregnancy. Using a case-control cohort of 40 women, we characterized weekly variation in the vaginal, gut, and oral microbiota during and after pregnancy. Microbiota membership remained relatively stable at each body site during pregnancy. An altered vaginal microbial community was associated with preterm birth; this finding was corroborated by an analysis of samples from an additional cohort of nine women. We also discovered an abrupt change in the vaginal microbiota at delivery that persisted in some cases for at least 1 y. Our findings suggest that pregnancy outcomes might be predicted by features of the microbiota early in gestation. Despite the critical role of the human microbiota in health, our understanding of microbiota compositional dynamics during and after pregnancy is incomplete. We conducted a case-control study of 49 pregnant women, 15 of whom delivered preterm. From 40 of these women, we analyzed bacterial taxonomic composition of 3,767 specimens collected prospectively and weekly during gestation and monthly after delivery from the vagina, distal gut, saliva, and tooth/gum. Linear mixed-effects modeling, medoid-based clustering, and Markov chain modeling were used to analyze community temporal trends, community structure, and vaginal community state transitions. Microbiota community taxonomic composition and diversity remained remarkably stable at all four body sites during pregnancy (P > 0.05 for trends over time). Prevalence of a Lactobacillus-poor vaginal community state type (CST 4) was inversely correlated with gestational age at delivery (P = 0.0039). Risk for preterm birth was more pronounced for subjects with CST 4 accompanied by elevated Gardnerella or Ureaplasma abundances. This finding was validated with a set of 246 vaginal specimens from nine women (four of whom delivered preterm). Most women experienced a postdelivery disturbance in the vaginal community characterized by a decrease in Lactobacillus species and an increase in diverse anaerobes such as Peptoniphilus, Prevotella, and Anaerococcus species. This disturbance was unrelated to gestational age at delivery and persisted for up to 1 y. These findings have important implications for predicting premature labor, a major global health problem, and for understanding the potential impact of a persistent, altered postpartum microbiota on maternal health, including outcomes of pregnancies following short interpregnancy intervals.
Proceedings of the National Academy of Sciences of the United States of America | 2011
Diamantis Sellis; Benjamin J. Callahan; Dmitri A. Petrov; Philipp W. Messer
Molecular adaptation is typically assumed to proceed by sequential fixation of beneficial mutations. In diploids, this picture presupposes that for most adaptive mutations, the homozygotes have a higher fitness than the heterozygotes. Here, we show that contrary to this expectation, a substantial proportion of adaptive mutations should display heterozygote advantage. This feature of adaptation in diploids emerges naturally from the primary importance of the fitness of heterozygotes for the invasion of new adaptive mutations. We formalize this result in the framework of Fishers influential geometric model of adaptation. We find that in diploids, adaptation should often proceed through a succession of short-lived balanced states that maintain substantially higher levels of phenotypic and fitness variation in the population compared with classic adaptive walks. In fast-changing environments, this variation produces a diversity advantage that allows diploids to remain better adapted compared with haploids despite the disadvantage associated with the presence of unfit homozygotes. The short-lived balanced states arising during adaptive walks should be mostly invisible to current scans for long-term balancing selection. Instead, they should leave signatures of incomplete selective sweeps, which do appear to be common in many species. Our results also raise the possibility that balancing selection, as a natural consequence of frequent adaptation, might play a more prominent role among the forces maintaining genetic variation than is commonly recognized.
The ISME Journal | 2017
Benjamin J. Callahan; Paul J. McMurdie; Susan Holmes
Recent advances have made it possible to analyze high-throughput marker-gene sequencing data without resorting to the customary construction of molecular operational taxonomic units (OTUs): clusters of sequencing reads that differ by less than a fixed dissimilarity threshold. New methods control errors sufficiently such that amplicon sequence variants (ASVs) can be resolved exactly, down to the level of single-nucleotide differences over the sequenced gene region. The benefits of finer resolution are immediately apparent, and arguments for ASV methods have focused on their improved resolution. Less obvious, but we believe more important, are the broad benefits that derive from the status of ASVs as consistent labels with intrinsic biological meaning identified independently from a reference database. Here we discuss how these features grant ASVs the combined advantages of closed-reference OTUs—including computational costs that scale linearly with study size, simple merging between independently processed data sets, and forward prediction—and of de novo OTUs—including accurate measurement of diversity and applicability to communities lacking deep coverage in reference databases. We argue that the improvements in reusability, reproducibility and comprehensiveness are sufficiently great that ASVs should replace OTUs as the standard unit of marker-gene analysis and reporting.
Nature Communications | 2016
Elisabeth Bik; Elizabeth K. Costello; Alexandra D. Switzer; Benjamin J. Callahan; Susan Holmes; Randall S. Wells; Kevin P. Carlin; Eric D. Jensen; Stephanie Venn-Watson; David A. Relman
Marine mammals play crucial ecological roles in the oceans, but little is known about their microbiotas. Here we study the bacterial communities in 337 samples from 5 body sites in 48 healthy dolphins and 18 healthy sea lions, as well as those of adjacent seawater and other hosts. The bacterial taxonomic compositions are distinct from those of other mammals, dietary fish and seawater, are highly diverse and vary according to body site and host species. Dolphins harbour 30 bacterial phyla, with 25 of them in the mouth, several abundant but poorly characterized Tenericutes species in gastric fluid and a surprisingly paucity of Bacteroidetes in distal gut. About 70% of near-full length bacterial 16S ribosomal RNA sequences from dolphins are unique. Host habitat, diet and phylogeny all contribute to variation in marine mammal distal gut microbiota composition. Our findings help elucidate the factors structuring marine mammal microbiotas and may enhance monitoring of marine mammal health.
PLOS Genetics | 2011
Benjamin J. Callahan; Richard A. Neher; Doris Bachtrog; Peter Andolfatto; Boris I. Shraiman
Here we investigate the correlations between coding sequence substitutions as a function of their separation along the protein sequence. We consider both substitutions between the reference genomes of several Drosophilids as well as polymorphisms in a population sample of Zimbabwean Drosophila melanogaster. We find that amino acid substitutions are “clustered” along the protein sequence, that is, the frequency of additional substitutions is strongly enhanced within ≈10 residues of a first such substitution. No such clustering is observed for synonymous substitutions, supporting a “correlation length” associated with selection on proteins as the causative mechanism. Clustering is stronger between substitutions that arose in the same lineage than it is between substitutions that arose in different lineages. We consider several possible origins of clustering, concluding that epistasis (interactions between amino acids within a protein that affect function) and positional heterogeneity in the strength of purifying selection are primarily responsible. The role of epistasis is directly supported by the tendency of nearby substitutions that arose on the same lineage to preserve the total charge of the residues within the correlation length and by the preferential cosegregation of neighboring derived alleles in our population sample. We interpret the observed length scale of clustering as a statistical reflection of the functional locality (or modularity) of proteins: amino acids that are near each other on the protein backbone are more likely to contribute to, and collaborate toward, a common subfunction.
BMC Bioinformatics | 2012
Michael J Rosen; Benjamin J. Callahan; Daniel S. Fisher; Susan Holmes
BackgroundPCR amplification and high-throughput sequencing theoretically enable the characterization of the finest-scale diversity in natural microbial and viral populations, but each of these methods introduces random errors that are difficult to distinguish from genuine biological diversity. Several approaches have been proposed to denoise these data but lack either speed or accuracy.ResultsWe introduce a new denoising algorithm that we call DADA (Divisive Amplicon Denoising Algorithm). Without training data, DADA infers both the sample genotypes and error parameters that produced a metagenome data set. We demonstrate performance on control data sequenced on Roche’s 454 platform, and compare the results to the most accurate denoising software currently available, AmpliconNoise.ConclusionsDADA is more accurate and over an order of magnitude faster than AmpliconNoise. It eliminates the need for training data to establish error parameters, fully utilizes sequence-abundance information, and enables inclusion of context-dependent PCR error rates. It should be readily extensible to other sequencing platforms such as Illumina.
Proceedings of the National Academy of Sciences of the United States of America | 2017
Benjamin J. Callahan; Daniel B. DiGiulio; Daniela S. Aliaga Goltsman; Christine L. Sun; Elizabeth K. Costello; Pratheepa Jeganathan; Joseph Biggio; Ronald J. Wong; Maurice L. Druzin; Gary M. Shaw; David K. Stevenson; Susan Holmes; David A. Relman
Significance Premature birth (PTB) is a major global public health burden. Previous studies have suggested an association between altered vaginal microbiota composition and PTB, although findings across studies have been inconsistent. To address these inconsistencies, improve upon our previous signature, and better understand the vaginal microbiota’s role in PTB, we conducted a case-control study in two cohorts of pregnant women: one predominantly Caucasian at low risk of PTB, the second predominantly African American at high risk. With the results, we were able to replicate our signature in the first cohort and refine our signature of PTB for both cohorts. Our findings elucidate the ecology of the vaginal microbiota and advance our ability to predict and understand the causes of PTB. Preterm birth (PTB) is the leading cause of neonatal morbidity and mortality. Previous studies have suggested that the maternal vaginal microbiota contributes to the pathophysiology of PTB, but conflicting results in recent years have raised doubts. We conducted a study of PTB compared with term birth in two cohorts of pregnant women: one predominantly Caucasian (n = 39) at low risk for PTB, the second predominantly African American and at high-risk (n = 96). We profiled the taxonomic composition of 2,179 vaginal swabs collected prospectively and weekly during gestation using 16S rRNA gene sequencing. Previously proposed associations between PTB and lower Lactobacillus and higher Gardnerella abundances replicated in the low-risk cohort, but not in the high-risk cohort. High-resolution bioinformatics enabled taxonomic assignment to the species and subspecies levels, revealing that Lactobacillus crispatus was associated with low risk of PTB in both cohorts, while Lactobacillus iners was not, and that a subspecies clade of Gardnerella vaginalis explained the genus association with PTB. Patterns of cooccurrence between L. crispatus and Gardnerella were highly exclusive, while Gardnerella and L. iners often coexisted at high frequencies. We argue that the vaginal microbiota is better represented by the quantitative frequencies of these key taxa than by classifying communities into five community state types. Our findings extend and corroborate the association between the vaginal microbiota and PTB, demonstrate the benefits of high-resolution statistical bioinformatics in clinical microbiome studies, and suggest that previous conflicting results may reflect the different risk profile of women of black race.
Proceedings of the National Academy of Sciences of the United States of America | 2009
Benjamin J. Callahan; Mukund Thattai; Boris I. Shraiman
Polyketides are a class of biologically active heteropolymers produced by assembly line-like multiprotein complexes of modular polyketide synthases (PKS). The polyketide product is encoded in the order of the PKS proteins in the assembly line, suggesting that polyketide diversity derives from combinatorial rearrangement of these PKS complexes. Remarkably, the order of PKS genes on the chromosome follows the order of PKS proteins in the assembly line: This fact is commonly referred to as “collinearity”. Here we propose an evolutionary origin for collinearity and demonstrate the mechanism by using a computational model of PKS evolution in a population. Assuming continuous evolutionary pressure for novel polyketides, and that new polyketide pathways are formed by horizontal transfer/recombination of PKS-encoding DNA, we demonstrate the existence of a broad range of parameters for which collinearity emerges spontaneously. Collinearity confers no fitness advantage in our model; it is established and maintained through a “secondary selection” mechanism, as a trait which increases the probability of forming long, novel PKS complexes through recombination. Consequently, collinearity hitchhikes on the successful genotypes which periodically sweep through the evolving population. In addition to computer simulation of a simplified model of PKS evolution, we provide a mathematical framework describing the secondary selection mechanism, which generalizes beyond the context of the present model.
Evolution | 2014
Benjamin J. Callahan; Tadashi Fukami; Daniel S. Fisher
Many species engage in adaptive niche construction: modification of the local environment that increases the modifying organisms competitive fitness. Adaptive niche construction provides an alternative pathway to higher fitness, shaping the environment rather than conforming to it. Yet, experimental evidence for the evolutionary emergence of adaptive niche construction is lacking, leaving its role in evolution uncertain. Here we report a direct observation of the de novo evolution of adaptive niche construction in populations of the bacteria Pseudomonas fluorescens. In a laboratory experiment, we allowed several bacterial populations to adapt to a novel environment and assessed whether niche construction evolved over time. We found that adaptive niche construction emerged rapidly, within approximately 100 generations, and became ubiquitous after approximately 400 generations. The large fitness effect of this niche construction was dominated by the low fitness of evolved strains in the ancestrally modified environment: evolved niche constructors were highly dependent on their specific environmental modifications. Populations were subjected to frequent resetting of environmental conditions and severe reduction of spatial habitat structure, both of which are thought to make adaptive niche construction difficult to evolve. Our finding that adaptive niche construction nevertheless evolved repeatably suggests that it may play a more important role in evolution than generally thought.