Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Colton J. Lloyd is active.

Publication


Featured researches published by Colton J. Lloyd.


Current Opinion in Biotechnology | 2015

Next-generation genome-scale models for metabolic engineering.

Zachary A. King; Colton J. Lloyd; Adam M. Feist; Bernhard O. Palsson

Constraint-based reconstruction and analysis (COBRA) methods have become widely used tools for metabolic engineering in both academic and industrial laboratories. By employing a genome-scale in silico representation of the metabolic network of a host organism, COBRA methods can be used to predict optimal genetic modifications that improve the rate and yield of chemical production. A new generation of COBRA models and methods is now being developed--encompassing many biological processes and simulation strategies-and next-generation models enable new types of predictions. Here, three key examples of applying COBRA methods to strain optimization are presented and discussed. Then, an outlook is provided on the next generation of COBRA models and the new types of predictions they will enable for systems metabolic engineering.


Nature Biotechnology | 2017

iML1515, a knowledgebase that computes Escherichia coli traits

Jonathan M. Monk; Colton J. Lloyd; Elizabeth Brunk; Nathan Mih; Anand Sastry; Zachary A. King; Rikiya Takeuchi; Wataru Nomura; Zhen Zhang; Hirotada Mori; Adam M. Feist; Bernhard O. Palsson

iML1515, a knowledgebase that computes Escherichia coli traits To the Editor: Extracting knowledge from the many types of big data produced by high-throughput methods remains a challenge, even when data are from Escherichia coli, the best characterized bacterial species. Here, we present iML1515, the most complete genome-scale reconstruction of the metabolic network in E. coli K-12 MG1655 to date, and we demonstrate how it can be used to address this challenge. Enabling analysis of several data types, including transcriptomes, proteomes, and metabolomes, iML1515 accounts for 1,515 open reading frames and 2,719 metabolic reactions involving 1,192 unique metabolites. The iML1515 knowledgebase is linked to 1,515 protein structures to provide an integrated modeling framework bridging systems and structural biology. We apply iML1515 to build metabolic models of E. coli human gut microbiome strains from metagenomic sequencing data. We then use iML1515 to build metabolic models for E. coli clinical isolates and predict their metabolic capabilities. Finally, we use iML1515 to carry out a comparative structural proteome analysis of 1,122 E. coli strains and identify multi-strain sequence variations.


Proceedings of the National Academy of Sciences of the United States of America | 2015

Systems biology definition of the core proteome of metabolism and expression is consistent with high-throughput data

Laurence Yang; Justin Tan; Edward J. O’Brien; Jonathan M. Monk; Donghyuk Kim; Howard J. Li; Pep Charusanti; Ali Ebrahim; Colton J. Lloyd; James T. Yurkovich; Bin Du; Andreas Dräger; Alex Thomas; Yuekai Sun; Michael A. Saunders; Bernhard O. Palsson

Significance Defining a core functional proteome supporting the living process has importance for both developing fundamental understanding of cell functions and for synthetic biology applications. Comparative genomics has been the primary approach to achieve such a definition. Here, we use genome-scale models to define a core proteome that computationally supports basic cellular function. This core proteome for metabolism and protein expression, defined through systems biology methods, is validated and characterized by using multiple disparate data types. Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5′UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.


Applied and Environmental Microbiology | 2017

Laboratory Evolution to Alternating Substrate Environments Yields Distinct Phenotypic and Genetic Adaptive Strategies

Troy E. Sandberg; Colton J. Lloyd; Bernhard O. Palsson; Adam M. Feist

ABSTRACT Adaptive laboratory evolution (ALE) experiments are often designed to maintain a static culturing environment to minimize confounding variables that could influence the adaptive process, but dynamic nutrient conditions occur frequently in natural and bioprocessing settings. To study the nature of carbon substrate fitness tradeoffs, we evolved batch cultures of Escherichia coli via serial propagation into tubes alternating between glucose and either xylose, glycerol, or acetate. Genome sequencing of evolved cultures revealed several genetic changes preferentially selected for under dynamic conditions and different adaptation strategies depending on the substrates being switched between; in some environments, a persistent “generalist” strain developed, while in another, two “specialist” subpopulations arose that alternated dominance. Diauxic lag phenotype varied across the generalists and specialists, in one case being completely abolished, while gene expression data distinguished the transcriptional strategies implemented by strains in pursuit of growth optimality. Genome-scale metabolic modeling techniques were then used to help explain the inherent substrate differences giving rise to the observed distinct adaptive strategies. This study gives insight into the population dynamics of adaptation in an alternating environment and into the underlying metabolic and genetic mechanisms. Furthermore, ALE-generated optimized strains have phenotypes with potential industrial bioprocessing applications. IMPORTANCE Evolution and natural selection inexorably lead to an organisms improved fitness in a given environment, whether in a laboratory or natural setting. However, despite the frequent natural occurrence of complex and dynamic growth environments, laboratory evolution experiments typically maintain simple, static culturing environments so as to reduce selection pressure complexity. In this study, we investigated the adaptive strategies underlying evolution to fluctuating environments by evolving Escherichia coli to conditions of frequently switching growth substrate. Characterization of evolved strains via a number of different data types revealed the various genetic and phenotypic changes implemented in pursuit of growth optimality and how these differed across the different growth substrates and switching protocols. This work not only helps to establish general principles of adaptation to complex environments but also suggests strategies for experimental design to achieve desired evolutionary outcomes.


Scientific Reports | 2016

Principles of proteome allocation are revealed using proteomic data and genome-scale models

Laurence Yang; James T. Yurkovich; Colton J. Lloyd; Ali Ebrahim; Michael A. Saunders; Bernhard O. Palsson

Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thus represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. This flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.


Proceedings of the National Academy of Sciences of the United States of America | 2017

Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities

Xin Fang; Anand Sastry; Nathan Mih; Donghyuk Kim; Justin Tan; James T. Yurkovich; Colton J. Lloyd; Ye Gao; Laurence Yang; Bernhard O. Palsson

Significance While the transcriptional regulatory network (TRN) of Escherichia coli has expanded considerably in recent years through new chromatin immunoprecipitation (ChIP) data, an open question remains: Does the global TRN, reconstructed by combining ChIP data for individual transcriptions factors, consistently explain observed differential gene expression? We have reconstructed a high-confidence TRN, determined its consistency with transcriptomics and predictive capabilities across multiple conditions, extracted 10 functional regulatory modules, and characterized this network at the sequence and structural levels. Our multiomics algorithmic pipeline is expected to facilitate rigorous validation and prioritization of experiments to elucidate TRNs in other bacteria. Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the Escherichia coli TRN—probably the best characterized TRN—several questions remain. Here, we address three questions: (i) How complete is our knowledge of the E. coli TRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism’s TRN from disparate data types.


bioRxiv | 2017

COBRAme: A Computational Framework for Building and Manipulating Models of Metabolism and Gene Expression

Colton J. Lloyd; Ali Ebrahim; Laurence Yang; Zachary A. King; Edward Catoiu; Edward J. O'Brien; Joanne K. Liu; Bernhard O. Palsson

Genome-scale models of metabolism and macromolecular expression (ME-models) explicitly compute the optimal proteome composition of a growing cell. ME-models expand upon the well-established genome-scale models of metabolism (M-models), and they enable new and exciting insights that are fundamental to understanding the basis of cellular growth. ME-models have increased predictive capabilities and accuracy due to their inclusion of the biosynthetic costs for the machinery of life, but they come with a significant increase in model size and complexity. This challenge results in models which are both difficult to compute and challenging to understand conceptually. As a result, ME-models exist for only two organisms (Escherichia coli and Thermotoga maritima) and are still used by relatively few researchers. To address these challenges, we have developed a new software framework called COBRAme for building and simulating ME-models. It is coded in Python and built on COBRApy, a popular platform for using M-models. COBRAme streamlines computation and analysis of ME-models. It provides tools to simplify the construction and manipulation of ME-models to enable ME-model reconstructions for new organisms. We used COBRAme to reconstruct a condensed E. coli ME-model called iLE1678-ME. This new model gives virtually identical solutions to previous ME-models while using ¼ the number of free variables and solving in ~10 minutes, a marked improvement over the ~6 hour solve time of previous ME-model formulations. This manuscript outlines the architecture of COBRAme and demonstrates how ME-models can be built and edited most efficiently using the software.


PLOS Computational Biology | 2018

COBRAme: A computational framework for genome-scale models of metabolism and gene expression

Colton J. Lloyd; Ali Ebrahim; Laurence Yang; Zachary A. King; Edward Catoiu; Edward J. O’Brien; Joanne K. Liu; Bernhard O. Palsson

Genome-scale models of metabolism and macromolecular expression (ME-models) explicitly compute the optimal proteome composition of a growing cell. ME-models expand upon the well-established genome-scale models of metabolism (M-models), and they enable a new fundamental understanding of cellular growth. ME-models have increased predictive capabilities and accuracy due to their inclusion of the biosynthetic costs for the machinery of life, but they come with a significant increase in model size and complexity. This challenge results in models which are both difficult to compute and challenging to understand conceptually. As a result, ME-models exist for only two organisms (Escherichia coli and Thermotoga maritima) and are still used by relatively few researchers. To address these challenges, we have developed a new software framework called COBRAme for building and simulating ME-models. It is coded in Python and built on COBRApy, a popular platform for using M-models. COBRAme streamlines computation and analysis of ME-models. It provides tools to simplify constructing and editing ME-models to enable ME-model reconstructions for new organisms. We used COBRAme to reconstruct a condensed E. coli ME-model called iJL1678b-ME. This reformulated model gives functionally identical solutions to previous E. coli ME-models while using 1/6 the number of free variables and solving in less than 10 minutes, a marked improvement over the 6 hour solve time of previous ME-model formulations. Errors in previous ME-models were also corrected leading to 52 additional genes that must be expressed in iJL1678b-ME to grow aerobically in glucose minimal in silico media. This manuscript outlines the architecture of COBRAme and demonstrates how ME-models can be created, modified, and shared most efficiently using the new software framework.


bioRxiv | 2017

Multi-scale model of the proteomic and metabolic consequences of reactive oxygen species

Laurence Yang; Nathan Mih; James T. Yurkovich; Joon Ho Park; Sangwoo Seo; Donghyuk Kim; Jonathan M. Monk; Colton J. Lloyd; Justin Tan; Ye Gao; Jared T. Broddrick; Ke Chen; David Heckmann; Adam M. Feist; Bernhard O. Palsson

All aerobically growing microbes must deal with oxidative stress from intrinsically-generated reactive oxygen species (ROS), or from external ROS in the context of infection. To study the systems biology of microbial ROS response, we developed a genome-scale model of proteome damage and maintenance in response to ROS, by extending a genome-scale metabolism and macro-molecular expression (ME) model of E. coli. This OxidizeME model recapitulated measured microbial oxidative stress response including metal-loenzyme inactivation by ROS and amino acid auxotrophies. OxidizeME also correctly predicted differential expression under ROS stress. We used OxidizeME to investigate how environmental context affects the flexibility of ROS stress response. The context-dependency of microbial stress response has important implications for infectious disease. OxidizeME provides a computational resource for model-driven experiment design in this direction.Catalysis using iron-sulfur clusters and transition metals can be traced back to the last universal common ancestor. The damage to metalloproteins caused by reactive oxygen species (ROS) can completely inhibit cell growth when unmanaged and thus elicits an essential stress response that is universal and fundamental in biology. We develop a computable multi-scale description of the ROS stress response in Escherichia coli. We show that this quantitative framework allows for the understanding and prediction of ROS stress responses at three levels: 1) pathways: amino acid auxotrophies, 2) networks: the systemic response to ROS stress, and 3) genetic basis: adaptation to ROS stress during laboratory evolution. These results show that we can now develop fundamental and quantitative genotype-phenotype relationships for stress responses on a genome-wide basis.


bioRxiv | 2018

Model-driven design and evolution of non-trivial synthetic syntrophic pairs

Colton J. Lloyd; Zachary A. King; Troy E. Sandberg; Ying Hefner; Connor Olson; Patrick Phaneuf; Edward J. O'Brien; Adam M. Feist

Synthetic microbial communities are attractive for applied biotechnology and healthcare applications through their ability to efficiently partition complex metabolic functions. By pairing auxotrophic mutants in co-culture, nascent E. coli communities can be established where strain pairs are metabolically coupled. Intuitive synthetic communities have been demonstrated, but the full space of cross-feeding metabolites has yet to be explored. A novel algorithm, OptAux, was constructed to design 66 multi-knockout E. coli auxotrophic strains that require significant metabolite cross-feeding when paired in co-culture. Three OptAux predicted auxotrophic strains were co-cultured with an L-histidine auxotroph and validated via adaptive laboratory evolution (ALE). Time-course sequencing revealed the genetic changes employed by each strain to achieve higher community fitness and provided insights on mechanisms for sharing and adapting to the syntrophic niche. A community model of metabolism and gene expression was utilized to predict the relative community composition and fundamental characteristics of the evolved communities. This work presents a novel computational method to elucidate metabolic changes that empower community formation and thus guide the optimization of co-cultures for a desired application.

Collaboration


Dive into the Colton J. Lloyd's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Laurence Yang

University of California

View shared research outputs
Top Co-Authors

Avatar

Adam M. Feist

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ali Ebrahim

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Justin Tan

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Nathan Mih

University of California

View shared research outputs
Researchain Logo
Decentralizing Knowledge