Alexey E. Kazakov
Lawrence Berkeley National Laboratory
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Alexey E. Kazakov.
BMC Genomics | 2013
Pavel S. Novichkov; Alexey E. Kazakov; Dmitry A. Ravcheev; Semen A. Leyn; Galina Yu Kovaleva; Roman A. Sutormin; Marat D. Kazanov; William J Riehl; Adam P. Arkin; Inna Dubchak; Dmitry A. Rodionov
BackgroundGenome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in prokaryotes is one of the critical tasks of modern genomics. Bacteria from different taxonomic groups, whose lifestyles and natural environments are substantially different, possess highly diverged transcriptional regulatory networks. The comparative genomics approaches are useful for in silico reconstruction of bacterial regulons and networks operated by both transcription factors (TFs) and RNA regulatory elements (riboswitches).DescriptionRegPrecise (http://regprecise.lbl.gov) is a web resource for collection, visualization and analysis of transcriptional regulons reconstructed by comparative genomics. We significantly expanded a reference collection of manually curated regulons we introduced earlier. RegPrecise 3.0 provides access to inferred regulatory interactions organized by phylogenetic, structural and functional properties. Taxonomy-specific collections include 781 TF regulogs inferred in more than 160 genomes representing 14 taxonomic groups of Bacteria. TF-specific collections include regulogs for a selected subset of 40 TFs reconstructed across more than 30 taxonomic lineages. Novel collections of regulons operated by RNA regulatory elements (riboswitches) include near 400 regulogs inferred in 24 bacterial lineages. RegPrecise 3.0 provides four classifications of the reference regulons implemented as controlled vocabularies: 55 TF protein families; 43 RNA motif families; ~150 biological processes or metabolic pathways; and ~200 effectors or environmental signals. Genome-wide visualization of regulatory networks and metabolic pathways covered by the reference regulons are available for all studied genomes. A separate section of RegPrecise 3.0 contains draft regulatory networks in 640 genomes obtained by an conservative propagation of the reference regulons to closely related genomes.ConclusionsRegPrecise 3.0 gives access to the transcriptional regulons reconstructed in bacterial genomes. Analytical capabilities include exploration of: regulon content, structure and function; TF binding site motifs; conservation and variations in genome-wide regulatory networks across all taxonomic groups of Bacteria. RegPrecise 3.0 was selected as a core resource on transcriptional regulation of the Department of Energy Systems Biology Knowledgebase, an emerging software and data environment designed to enable researchers to collaboratively generate, test and share new hypotheses about gene and protein functions, perform large-scale analyses, and model interactions in microbes, plants, and their communities.
Nucleic Acids Research | 2010
Pavel S. Novichkov; Dmitry A. Rodionov; Elena D. Stavrovskaya; Elena S. Novichkova; Alexey E. Kazakov; Mikhail S. Gelfand; Adam P. Arkin; Andrey A. Mironov; Inna Dubchak
RegPredict web server is designed to provide comparative genomics tools for reconstruction and analysis of microbial regulons using comparative genomics approach. The server allows the user to rapidly generate reference sets of regulons and regulatory motif profiles in a group of prokaryotic genomes. The new concept of a cluster of co-regulated orthologous operons allows the user to distribute the analysis of large regulons and to perform the comparative analysis of multiple clusters independently. Two major workflows currently implemented in RegPredict are: (i) regulon reconstruction for a known regulatory motif and (ii) ab initio inference of a novel regulon using several scenarios for the generation of starting gene sets. RegPredict provides a comprehensive collection of manually curated positional weight matrices of regulatory motifs. It is based on genomic sequences, ortholog and operon predictions from the MicrobesOnline. An interactive web interface of RegPredict integrates and presents diverse genomic and functional information about the candidate regulon members from several web resources. RegPredict is freely accessible at http://regpredict.lbl.gov.
Journal of Bacteriology | 2012
Dmitry A. Ravcheev; Xiaoqing Li; Haythem Latif; Karsten Zengler; Semen A. Leyn; Yuri D. Korostelev; Alexey E. Kazakov; Pavel S. Novichkov; Andrei L. Osterman; Dmitry A. Rodionov
Redox-sensing repressor Rex was previously implicated in the control of anaerobic respiration in response to the cellular NADH/NAD(+) levels in gram-positive bacteria. We utilized the comparative genomics approach to infer candidate Rex-binding DNA motifs and assess the Rex regulon content in 119 genomes from 11 taxonomic groups. Both DNA-binding and NAD-sensing domains are broadly conserved in Rex orthologs identified in the phyla Firmicutes, Thermotogales, Actinobacteria, Chloroflexi, Deinococcus-Thermus, and Proteobacteria. The identified DNA-binding motifs showed significant conservation in these species, with the only exception detected in Clostridia, where the Rex motif deviates in two positions from the generalized consensus, TTGTGAANNNNTTCACAA. Comparative analysis of candidate Rex sites revealed remarkable variations in functional repertoires of candidate Rex-regulated genes in various microorganisms. Most of the reconstructed regulatory interactions are lineage specific, suggesting frequent events of gain and loss of regulator binding sites in the evolution of Rex regulons. We identified more than 50 novel Rex-regulated operons encoding functions that are essential for resumption of the NADH:NAD(+) balance. The novel functional role of Rex in the control of the central carbon metabolism and hydrogen production genes was validated by in vitro DNA binding assays using the TM0169 protein in the hydrogen-producing bacterium Thermotoga maritima.
BMC Genomics | 2011
Dmitry A. Rodionov; Pavel S. Novichkov; Elena D. Stavrovskaya; Irina A. Rodionova; Xiaoqing Li; Marat D. Kazanov; Dmitry A. Ravcheev; Anna V. Gerasimova; Alexey E. Kazakov; Galina Yu Kovaleva; Elizabeth A. Permina; Olga N. Laikova; Ross Overbeek; Margaret F. Romine; James K. Fredrickson; Adam P. Arkin; Inna Dubchak; Andrei L. Osterman; Mikhail S. Gelfand
BackgroundGenome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in bacteria is one of the critical tasks of modern genomics. The Shewanella genus is comprised of metabolically versatile gamma-proteobacteria, whose lifestyles and natural environments are substantially different from Escherichia coli and other model bacterial species. The comparative genomics approaches and computational identification of regulatory sites are useful for the in silico reconstruction of transcriptional regulatory networks in bacteria.ResultsTo explore conservation and variations in the Shewanella transcriptional networks we analyzed the repertoire of transcription factors and performed genomics-based reconstruction and comparative analysis of regulons in 16 Shewanella genomes. The inferred regulatory network includes 82 transcription factors and their DNA binding sites, 8 riboswitches and 6 translational attenuators. Forty five regulons were newly inferred from the genome context analysis, whereas others were propagated from previously characterized regulons in the Enterobacteria and Pseudomonas spp.. Multiple variations in regulatory strategies between the Shewanella spp. and E. coli include regulon contraction and expansion (as in the case of PdhR, HexR, FadR), numerous cases of recruiting non-orthologous regulators to control equivalent pathways (e.g. PsrA for fatty acid degradation) and, conversely, orthologous regulators to control distinct pathways (e.g. TyrR, ArgR, Crp).ConclusionsWe tentatively defined the first reference collection of ~100 transcriptional regulons in 16 Shewanella genomes. The resulting regulatory network contains ~600 regulated genes per genome that are mostly involved in metabolism of carbohydrates, amino acids, fatty acids, vitamins, metals, and stress responses. Several reconstructed regulons including NagR for N-acetylglucosamine catabolism were experimentally validated in S. oneidensis MR-1. Analysis of correlations in gene expression patterns helps to interpret the reconstructed regulatory network. The inferred regulatory interactions will provide an additional regulatory constrains for an integrated model of metabolism and regulation in S. oneidensis MR-1.
Journal of Bacteriology | 2009
Alexey E. Kazakov; Dmitry A. Rodionov; Eric J. Alm; Adam P. Arkin; Inna Dubchak; Mikhail S. Gelfand
Bacteria can use branched-chain amino acids (ILV, i.e., isoleucine, leucine, valine) and fatty acids (FAs) as sole carbon and energy sources converting ILV into acetyl-coenzyme A (CoA), propanoyl-CoA, and propionyl-CoA, respectively. In this work, we used the comparative genomic approach to identify candidate transcriptional factors and DNA motifs that control ILV and FA utilization pathways in proteobacteria. The metabolic regulons were characterized based on the identification and comparison of candidate transcription factor binding sites in groups of phylogenetically related genomes. The reconstructed ILV/FA regulatory network demonstrates considerable variability and involves six transcriptional factors from the MerR, TetR, and GntR families binding to 11 distinct DNA motifs. The ILV degradation genes in gamma- and betaproteobacteria are regulated mainly by a novel regulator from the MerR family (e.g., LiuR in Pseudomonas aeruginosa) (40 species); in addition, the TetR-type regulator LiuQ was identified in some betaproteobacteria (eight species). Besides the core set of ILV utilization genes, the LiuR regulon in some lineages is expanded to include genes from other metabolic pathways, such as the glyoxylate shunt and glutamate synthase in Shewanella species. The FA degradation genes are controlled by four regulators including FadR in gammaproteobacteria (34 species), PsrA in gamma- and betaproteobacteria (45 species), FadP in betaproteobacteria (14 species), and LiuR orthologs in alphaproteobacteria (22 species). The remarkable variability of the regulatory systems associated with the FA degradation pathway is discussed from functional and evolutionary points of view.
BMC Genomics | 2013
Michael J. Cipriano; Pavel N Novichkov; Alexey E. Kazakov; Dmitry A. Rodionov; Adam P. Arkin; Mikhail S. Gelfand; Inna Dubchak
BackgroundDue to the constantly growing number of sequenced microbial genomes, comparative genomics has been playing a major role in the investigation of regulatory interactions in bacteria. Regulon inference mostly remains a field of semi-manual examination since absence of a knowledgebase and informatics platform for automated and systematic investigation restricts opportunities for computational prediction. Additionally, confirming computationally inferred regulons by experimental data is critically important.DescriptionRegTransBase is an open-access platform with a user-friendly web interface publicly available at http://regtransbase.lbl.gov. It consists of two databases – a manually collected hierarchical regulatory interactions database based on more than 7000 scientific papers which can serve as a knowledgebase for verification of predictions, and a large set of curated by experts transcription factor binding sites used in regulon inference by a variety of tools. RegTransBase captures the knowledge from published scientific literature using controlled vocabularies and contains various types of experimental data, such as: the activation or repression of transcription by an identified direct regulator; determination of the transcriptional regulatory function of a protein (or RNA) directly binding to DNA or RNA; mapping of binding sites for a regulatory protein; characterization of regulatory mutations. Analysis of the data collected from literature resulted in the creation of Putative Regulons from Experimental Data that are also available in RegTransBase.ConclusionsRegTransBase is a powerful user-friendly platform for the investigation of regulation in prokaryotes. It uses a collection of validated regulatory sequences that can be easily extracted and used to infer regulatory interactions by comparative genomics techniques thus assisting researchers in the interpretation of transcriptional regulation data.
Journal of Bacteriology | 2011
Anna Gerasimova; Alexey E. Kazakov; Adam P. Arkin; Inna Dubchak; Mikhail S. Gelfand
In response to stresses, Mycobacterium cells become dormant. This process is regulated by the DosR transcription factor. In Mycobacterium tuberculosis, the dormancy regulon is well characterized and contains the dosR gene itself and dosS and dosT genes encoding DosR kinases, nitroreductases (acg; Rv3131), diacylglycerol acyltransferase (DGAT) (Rv3130c), and many universal stress proteins (USPs). In this study, we apply comparative genomic analysis to characterize the DosR regulons in nine Mycobacterium genomes, Rhodococcus sp. RHA1, Nocardia farcinica, and Saccharopolyspora erythraea. The regulons are highly labile, containing eight core gene groups (regulators, kinases, USPs, DGATs, nitroreductases, ferredoxins, heat shock proteins, and the orthologs of the predicted kinase [Rv2004c] from M. tuberculosis) and 10 additional genes with more restricted taxonomic distribution that are mostly involved in anaerobic respiration. The largest regulon is observed in M. marinum and the smallest in M. abscessus. Analysis of large gene families encoding USPs, nitroreductases, and DGATs demonstrates a mosaic distribution of regulated and nonregulated members, suggesting frequent acquisition and loss of DosR-binding sites.
Mbio | 2014
Jennifer V. Kuehl; Morgan N. Price; Jayashree Ray; Zuelma Esquivel; Alexey E. Kazakov; Michelle Nguyen; Raquel Kuehn; Ronald W. Davis; Terry C. Hazen; Adam P. Arkin; Adam M. Deutschbauer
ABSTRACT The genomes of sulfate-reducing bacteria remain poorly characterized, largely due to a paucity of experimental data and genetic tools. To meet this challenge, we generated an archived library of 15,477 mapped transposon insertion mutants in the sulfate-reducing bacterium Desulfovibrio alaskensis G20. To demonstrate the utility of the individual mutants, we profiled gene expression in mutants of six regulatory genes and used these data, together with 1,313 high-confidence transcription start sites identified by tiling microarrays and transcriptome sequencing (5′ RNA-Seq), to update the regulons of Fur and Rex and to confirm the predicted regulons of LysX, PhnF, PerR, and Dde_3000, a histidine kinase. In addition to enabling single mutant investigations, the D. alaskensis G20 transposon mutants also contain DNA bar codes, which enables the pooling and analysis of mutant fitness for thousands of strains simultaneously. Using two pools of mutants that represent insertions in 2,369 unique protein-coding genes, we demonstrate that the hypothetical gene Dde_3007 is required for methionine biosynthesis. Using comparative genomics, we propose that Dde_3007 performs a missing step in methionine biosynthesis by transferring a sulfur group to O-phosphohomoserine to form homocysteine. Additionally, we show that the entire choline utilization cluster is important for fitness in choline sulfate medium, which confirms that a functional microcompartment is required for choline oxidation. Finally, we demonstrate that Dde_3291, a MerR-like transcription factor, is a choline-dependent activator of the choline utilization cluster. Taken together, our data set and genetic resources provide a foundation for systems-level investigation of a poorly studied group of bacteria of environmental and industrial importance. IMPORTANCE Sulfate-reducing bacteria contribute to global nutrient cycles and are a nuisance for the petroleum industry. Despite their environmental and industrial significance, the genomes of sulfate-reducing bacteria remain poorly characterized. Here, we describe a genetic approach to fill gaps in our knowledge of sulfate-reducing bacteria. We generated a large collection of archived, transposon mutants in Desulfovibrio alaskensis G20 and used the phenotypes of these mutant strains to infer the function of genes involved in gene regulation, methionine biosynthesis, and choline utilization. Our findings and mutant resources will enable systematic investigations into gene function, energy generation, stress response, and metabolism for this important group of bacteria. Sulfate-reducing bacteria contribute to global nutrient cycles and are a nuisance for the petroleum industry. Despite their environmental and industrial significance, the genomes of sulfate-reducing bacteria remain poorly characterized. Here, we describe a genetic approach to fill gaps in our knowledge of sulfate-reducing bacteria. We generated a large collection of archived, transposon mutants in Desulfovibrio alaskensis G20 and used the phenotypes of these mutant strains to infer the function of genes involved in gene regulation, methionine biosynthesis, and choline utilization. Our findings and mutant resources will enable systematic investigations into gene function, energy generation, stress response, and metabolism for this important group of bacteria.
Journal of Bacteriology | 2013
Alexey E. Kazakov; Dmitry A. Rodionov; Morgan N. Price; Adam P. Arkin; Inna Dubchak; Pavel S. Novichkov
Accurate detection of transcriptional regulatory elements is essential for high-quality genome annotation, metabolic reconstruction, and modeling of regulatory networks. We developed a computational approach for reconstruction of regulons operated by transcription factors (TFs) from large protein families and applied this novel approach to three TF families in 10 Desulfovibrionales genomes. Phylogenetic analyses of 125 regulators from the ArsR, Crp/Fnr, and GntR families revealed that 65% of these regulators (termed reference TFs) are well conserved in Desulfovibrionales, while the remaining 35% of regulators (termed singleton TFs) are species specific and show a mosaic distribution. For regulon reconstruction in the group of singleton TFs, the standard orthology-based approach was inefficient, and thus, we developed a novel approach based on the simultaneous study of all homologous TFs from the same family in a group of genomes. As a result, we identified binding for 21 singleton TFs and for all reference TFs in all three analyzed families. Within each TF family we observed structural similarities between DNA-binding motifs of different reference and singleton TFs. The collection of reconstructed regulons is available at the RegPrecise database (http://regprecise.lbl.gov/RegPrecise/Desulfovibrionales.jsp).
Journal of Bacteriology | 2013
Alexey E. Kazakov; Lara Rajeev; Eric G. Luning; Grant M. Zane; Kavya Siddartha; Dmitry A. Rodionov; Inna Dubchak; Adam P. Arkin; Judy D. Wall; Aindrila Mukhopadhyay; Pavel S. Novichkov
The trace elements molybdenum and tungsten are essential components of cofactors of many metalloenzymes. However, in sulfate-reducing bacteria, high concentrations of molybdate and tungstate oxyanions inhibit growth, thus requiring the tight regulation of their homeostasis. By a combination of bioinformatic and experimental techniques, we identified a novel regulator family, tungstate-responsive regulator (TunR), controlling the homeostasis of tungstate and molybdate in sulfate-reducing deltaproteobacteria. The effector-sensing domains of these regulators are similar to those of the known molybdate-responsive regulator ModE, while their DNA-binding domains are homologous to XerC/XerD site-specific recombinases. Using a comparative genomics approach, we identified DNA motifs and reconstructed regulons for 40 TunR family members. Positional analysis of TunR sites and putative promoters allowed us to classify most TunR proteins into two groups: (i) activators of modABC genes encoding a high-affinity molybdenum and tungsten transporting system and (ii) repressors of genes for toluene sulfonate uptake (TSUP) family transporters. The activation of modA and modBC genes by TunR in Desulfovibrio vulgaris Hildenborough was confirmed in vivo, and we discovered that the activation was diminished in the presence of tungstate. A predicted 30-bp TunR-binding motif was confirmed by in vitro binding assays. A novel TunR family of bacterial transcriptional factors controls tungstate and molybdate homeostasis in sulfate-reducing deltaproteobacteria. We proposed that TunR proteins participate in protection of the cells from the inhibition by these oxyanions. To our knowledge, this is a unique case of a family of bacterial transcriptional factors evolved from site-specific recombinases.