Is this you? Create Your Porfile

Pavel S. Novichkov

Lawrence Berkeley National Laboratory

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Pavel S. Novichkov is active.

Explore More

Publication

Featured researches published by Pavel S. Novichkov.

Nucleic Acids Research | 2010

MicrobesOnline: an integrated portal for comparative and functional genomics

Paramvir Dehal; Marcin P. Joachimiak; Morgan N. Price; John T. Bates; Jason K. Baumohl; Dylan Chivian; Greg D. Friedland; Katherine H. Huang; Keith Keller; Pavel S. Novichkov; Inna Dubchak; Eric Alm; Adam P. Arkin

Since 2003, MicrobesOnline (http://www.microbesonline.org) has been providing a community resource for comparative and functional genome analysis. The portal includes over 1000 complete genomes of bacteria, archaea and fungi and thousands of expression microarrays from diverse organisms ranging from model organisms such as Escherichia coli and Saccharomyces cerevisiae to environmental microbes such as Desulfovibrio vulgaris and Shewanella oneidensis. To assist in annotating genes and in reconstructing their evolutionary history, MicrobesOnline includes a comparative genome browser based on phylogenetic trees for every gene family as well as a species tree. To identify co-regulated genes, MicrobesOnline can search for genes based on their expression profile, and provides tools for identifying regulatory motifs and seeing if they are conserved. MicrobesOnline also includes fast phylogenetic profile searches, comparative views of metabolic pathways, operon predictions, a workbench for sequence analysis and integration with RegTransBase and other microbial genome resources. The next update of MicrobesOnline will contain significant new functionality, including comparative analysis of metagenomic sequence data. Programmatic access to the database, along with source code and documentation, is available at http://microbesonline.org/programmers.html.

Biology Direct | 2007

Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea

Kira S. Makarova; Alexander V. Sorokin; Pavel S. Novichkov; Yuri I. Wolf; Eugene V. Koonin

BackgroundAn evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes.ResultsNew Archaeal Clusters of Orthologous Genes (arCOGs) were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon) using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover ~88% of the genes in a genome compared to a ~76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; ~40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome) consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA) is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile that, in addition to the core archaeal functions, encoded more idiosyncratic systems, e.g., the CASS systems of antivirus defense and some toxin-antitoxin systems.ConclusionThe arCOGs provide a convenient, flexible framework for functional annotation of archaeal genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archaeal hyperthermophiles. ArCOGs and related information are available at: ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.ReviewersThis article was reviewed by Peer Bork, Patrick Forterre, and Purificacion Lopez-Garcia.

Nucleic Acids Research | 2010

RegPrecise: a database of curated genomic inferences of transcriptional regulatory interactions in prokaryotes.

Pavel S. Novichkov; Olga N. Laikova; Elena S. Novichkova; Mikhail S. Gelfand; Adam P. Arkin; Inna Dubchak; Dmitry A. Rodionov

The RegPrecise database (http://regprecise.lbl.gov) was developed for capturing, visualization and analysis of predicted transcription factor regulons in prokaryotes that were reconstructed and manually curated by utilizing the comparative genomic approach. A significant number of high-quality inferences of transcriptional regulatory interactions have been already accumulated for diverse taxonomic groups of bacteria. The reconstructed regulons include transcription factors, their cognate DNA motifs and regulated genes/operons linked to the candidate transcription factor binding sites. The RegPrecise allows for browsing the regulon collections for: (i) conservation of DNA binding sites and regulated genes for a particular regulon across diverse taxonomic lineages; (ii) sets of regulons for a family of transcription factors; (iii) repertoire of regulons in a particular taxonomic group of species; (iv) regulons associated with a metabolic pathway or a biological process in various genomes. The initial release of the database includes ∼11 500 candidate binding sites for ∼400 orthologous groups of transcription factors from over 350 prokaryotic genomes. Majority of these data are represented by genome-wide regulon reconstructions in Shewanella and Streptococcus genera and a large-scale prediction of regulons for the LacI family of transcription factors. Another section in the database represents the results of accurate regulon propagation to the closely related genomes.

Proceedings of the National Academy of Sciences of the United States of America | 2009

The universal distribution of evolutionary rates of genes and distinct characteristics of eukaryotic genes of different apparent ages

Yuri I. Wolf; Pavel S. Novichkov; Georgy P. Karev; Eugene V. Koonin; David J. Lipman

The evolutionary rates of protein-coding genes in an organism span, approximately, 3 orders of magnitude and show a universal, approximately log-normal distribution in a broad variety of species from prokaryotes to mammals. This universal distribution implies a steady-state process, with identical distributions of evolutionary rates among genes that are gained and genes that are lost. A mathematical model of such process is developed under the single assumption of the constancy of the distributions of the propensities for gene loss (PGL). This model predicts that genes of different ages, that is, genes with homologs detectable at different phylogenetic depths, substantially differ in those variables that correlate with PGL. We computationally partition protein-coding genes from humans, flies, and Aspergillus fungus into age classes, and show that genes of different ages retain the universal log-normal distribution of evolutionary rates, with a shift toward higher rates in “younger” classes but also with a substantial overlap. The only exception involves human primate-specific genes that show a heavy tail of rapidly evolving genes, probably owing to gene annotation artifacts. As predicted, the gene age classes differ in characteristics correlated with PGL. Compared with “young” genes (e.g., mammal-specific human ones), “old” genes (e.g., eukaryote-specific), on average, are longer, are expressed at a higher level, possess a higher intron density, evolve slower on the short time scale, and are subject to stronger purifying selection. Thus, genome evolution fits a simple model with approximately uniform rates of gene gain and loss, without major bursts of genomic innovation.

BMC Genomics | 2013

RegPrecise 3.0 – A resource for genome-scale exploration of transcriptional regulation in bacteria

Pavel S. Novichkov; Alexey E. Kazakov; Dmitry A. Ravcheev; Semen A. Leyn; Galina Yu Kovaleva; Roman A. Sutormin; Marat D. Kazanov; William J Riehl; Adam P. Arkin; Inna Dubchak; Dmitry A. Rodionov

BackgroundGenome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in prokaryotes is one of the critical tasks of modern genomics. Bacteria from different taxonomic groups, whose lifestyles and natural environments are substantially different, possess highly diverged transcriptional regulatory networks. The comparative genomics approaches are useful for in silico reconstruction of bacterial regulons and networks operated by both transcription factors (TFs) and RNA regulatory elements (riboswitches).DescriptionRegPrecise (http://regprecise.lbl.gov) is a web resource for collection, visualization and analysis of transcriptional regulons reconstructed by comparative genomics. We significantly expanded a reference collection of manually curated regulons we introduced earlier. RegPrecise 3.0 provides access to inferred regulatory interactions organized by phylogenetic, structural and functional properties. Taxonomy-specific collections include 781 TF regulogs inferred in more than 160 genomes representing 14 taxonomic groups of Bacteria. TF-specific collections include regulogs for a selected subset of 40 TFs reconstructed across more than 30 taxonomic lineages. Novel collections of regulons operated by RNA regulatory elements (riboswitches) include near 400 regulogs inferred in 24 bacterial lineages. RegPrecise 3.0 provides four classifications of the reference regulons implemented as controlled vocabularies: 55 TF protein families; 43 RNA motif families; ~150 biological processes or metabolic pathways; and ~200 effectors or environmental signals. Genome-wide visualization of regulatory networks and metabolic pathways covered by the reference regulons are available for all studied genomes. A separate section of RegPrecise 3.0 contains draft regulatory networks in 640 genomes obtained by an conservative propagation of the reference regulons to closely related genomes.ConclusionsRegPrecise 3.0 gives access to the transcriptional regulons reconstructed in bacterial genomes. Analytical capabilities include exploration of: regulon content, structure and function; TF binding site motifs; conservation and variations in genome-wide regulatory networks across all taxonomic groups of Bacteria. RegPrecise 3.0 was selected as a core resource on transcriptional regulation of the Department of Energy Systems Biology Knowledgebase, an emerging software and data environment designed to enable researchers to collaboratively generate, test and share new hypotheses about gene and protein functions, perform large-scale analyses, and model interactions in microbes, plants, and their communities.

Nucleic Acids Research | 2010

RegPredict: an integrated system for regulon inference in prokaryotes by comparative genomics approach

Pavel S. Novichkov; Dmitry A. Rodionov; Elena D. Stavrovskaya; Elena S. Novichkova; Alexey E. Kazakov; Mikhail S. Gelfand; Adam P. Arkin; Andrey A. Mironov; Inna Dubchak

RegPredict web server is designed to provide comparative genomics tools for reconstruction and analysis of microbial regulons using comparative genomics approach. The server allows the user to rapidly generate reference sets of regulons and regulatory motif profiles in a group of prokaryotic genomes. The new concept of a cluster of co-regulated orthologous operons allows the user to distribute the analysis of large regulons and to perform the comparative analysis of multiple clusters independently. Two major workflows currently implemented in RegPredict are: (i) regulon reconstruction for a known regulatory motif and (ii) ab initio inference of a novel regulon using several scenarios for the generation of starting gene sets. RegPredict provides a comprehensive collection of manually curated positional weight matrices of regulatory motifs. It is based on genomic sequences, ortholog and operon predictions from the MicrobesOnline. An interactive web interface of RegPredict integrates and presents diverse genomic and functional information about the candidate regulon members from several web resources. RegPredict is freely accessible at http://regpredict.lbl.gov.

Nucleic Acids Research | 2007

RegTransBase—a database of regulatory sequences and interactions in a wide range of prokaryotic genomes

Alexei E. Kazakov; Michael J. Cipriano; Pavel S. Novichkov; Simon Minovitsky; Dmitry V. Vinogradov; Adam P. Arkin; Andrey A. Mironov; Mikhail S. Gelfand; Inna Dubchak

RegTransBase is a manually curated database of regulatory interactions in prokaryotes that captures the knowledge in public scientific literature using a controlled vocabulary. Although several databases describing interactions between regulatory proteins and their binding sites are already being maintained, they either focus mostly on the model organisms Escherichia coli and Bacillus subtilis or are entirely computationally derived. RegTransBase describes a large number of regulatory interactions reported in many organisms and contains the following types of experimental data: the activation or repression of transcription by an identified direct regulator, determining the transcriptional regulatory function of a protein (or RNA) directly binding to DNA (RNA), mapping or prediction of a binding site for a regulatory protein and characterization of regulatory mutations. Currently, RegTransBase content is derived from about 3000 relevant articles describing over 7000 experiments in relation to 128 microbes. It contains data on the regulation of about 7500 genes and evidence for 6500 interactions with 650 regulators. RegTransBase also contains manually created position weight matrices (PWM) that can be used to identify candidate regulatory sites in over 60 species. RegTransBase is available at .

Journal of Bacteriology | 2012

Transcriptional Regulation of Central Carbon and Energy Metabolism in Bacteria by Redox-Responsive Repressor Rex

Dmitry A. Ravcheev; Xiaoqing Li; Haythem Latif; Karsten Zengler; Semen A. Leyn; Yuri D. Korostelev; Alexey E. Kazakov; Pavel S. Novichkov; Andrei L. Osterman; Dmitry A. Rodionov

Redox-sensing repressor Rex was previously implicated in the control of anaerobic respiration in response to the cellular NADH/NAD(+) levels in gram-positive bacteria. We utilized the comparative genomics approach to infer candidate Rex-binding DNA motifs and assess the Rex regulon content in 119 genomes from 11 taxonomic groups. Both DNA-binding and NAD-sensing domains are broadly conserved in Rex orthologs identified in the phyla Firmicutes, Thermotogales, Actinobacteria, Chloroflexi, Deinococcus-Thermus, and Proteobacteria. The identified DNA-binding motifs showed significant conservation in these species, with the only exception detected in Clostridia, where the Rex motif deviates in two positions from the generalized consensus, TTGTGAANNNNTTCACAA. Comparative analysis of candidate Rex sites revealed remarkable variations in functional repertoires of candidate Rex-regulated genes in various microorganisms. Most of the reconstructed regulatory interactions are lineage specific, suggesting frequent events of gain and loss of regulator binding sites in the evolution of Rex regulons. We identified more than 50 novel Rex-regulated operons encoding functions that are essential for resumption of the NADH:NAD(+) balance. The novel functional role of Rex in the control of the central carbon metabolism and hydrogen production genes was validated by in vitro DNA binding assays using the TM0169 protein in the hydrogen-producing bacterium Thermotoga maritima.

BMC Genomics | 2011

Comparative genomic reconstruction of transcriptional networks controlling central metabolism in the Shewanella genus

Dmitry A. Rodionov; Pavel S. Novichkov; Elena D. Stavrovskaya; Irina A. Rodionova; Xiaoqing Li; Marat D. Kazanov; Dmitry A. Ravcheev; Anna V. Gerasimova; Alexey E. Kazakov; Galina Yu Kovaleva; Elizabeth A. Permina; Olga N. Laikova; Ross Overbeek; Margaret F. Romine; James K. Fredrickson; Adam P. Arkin; Inna Dubchak; Andrei L. Osterman; Mikhail S. Gelfand

BackgroundGenome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in bacteria is one of the critical tasks of modern genomics. The Shewanella genus is comprised of metabolically versatile gamma-proteobacteria, whose lifestyles and natural environments are substantially different from Escherichia coli and other model bacterial species. The comparative genomics approaches and computational identification of regulatory sites are useful for the in silico reconstruction of transcriptional regulatory networks in bacteria.ResultsTo explore conservation and variations in the Shewanella transcriptional networks we analyzed the repertoire of transcription factors and performed genomics-based reconstruction and comparative analysis of regulons in 16 Shewanella genomes. The inferred regulatory network includes 82 transcription factors and their DNA binding sites, 8 riboswitches and 6 translational attenuators. Forty five regulons were newly inferred from the genome context analysis, whereas others were propagated from previously characterized regulons in the Enterobacteria and Pseudomonas spp.. Multiple variations in regulatory strategies between the Shewanella spp. and E. coli include regulon contraction and expansion (as in the case of PdhR, HexR, FadR), numerous cases of recruiting non-orthologous regulators to control equivalent pathways (e.g. PsrA for fatty acid degradation) and, conversely, orthologous regulators to control distinct pathways (e.g. TyrR, ArgR, Crp).ConclusionsWe tentatively defined the first reference collection of ~100 transcriptional regulons in 16 Shewanella genomes. The resulting regulatory network contains ~600 regulated genes per genome that are mostly involved in metabolism of carbohydrates, amino acids, fatty acids, vitamins, metals, and stress responses. Several reconstructed regulons including NagR for N-acetylglucosamine catabolism were experimentally validated in S. oneidensis MR-1. Analysis of correlations in gene expression patterns helps to interpret the reconstructed regulatory network. The inferred regulatory interactions will provide an additional regulatory constrains for an integrated model of metabolism and regulation in S. oneidensis MR-1.

Journal of Bacteriology | 2004

Genome-Wide Molecular Clock and Horizontal Gene Transfer in Bacterial Evolution

Pavel S. Novichkov; Marina V. Omelchenko; Mikhail S. Gelfand; Andrei A. Mironov; Yuri I. Wolf; Eugene V. Koonin

We describe a simple theoretical framework for identifying orthologous sets of genes that deviate from a clock-like model of evolution. The approach used is based on comparing the evolutionary distances within a set of orthologs to a standard intergenomic distance, which was defined as the median of the distribution of the distances between all one-to-one orthologs. Under the clock-like model, the points on a plot of intergenic distances versus intergenomic distances are expected to fit a straight line. A statistical technique to identify significant deviations from the clock-like behavior is described. For several hundred analyzed orthologous sets representing three well-defined bacterial lineages, the alpha-Proteobacteria, the gamma-Proteobacteria, and the Bacillus-Clostridium group, the clock-like null hypothesis could not be rejected for approximately 70% of the sets, whereas the rest showed substantial anomalies. Subsequent detailed phylogenetic analysis of the genes with the strongest deviations indicated that over one-half of these genes probably underwent a distinct form of horizontal gene transfer, xenologous gene displacement, in which a gene is displaced by an ortholog from a different lineage. The remaining deviations from the clock-like model could be explained by lineage-specific acceleration of evolution. The results indicate that although xenologous gene displacement is a major force in bacterial evolution, a significant majority of orthologous gene sets in three major bacterial lineages evolved in accordance with the clock-like model. The approach described here allows rapid detection of deviations from this mode of evolution on the genome scale.

Explore More