Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ilya E. Vorontsov is active.

Publication


Featured researches published by Ilya E. Vorontsov.


Nucleic Acids Research | 2013

HOCOMOCO: a comprehensive collection of human transcription factor binding sites models

Ivan V. Kulakovskiy; Yulia A. Medvedeva; Ulf Schaefer; Artem S. Kasianov; Ilya E. Vorontsov; Vladimir B. Bajic; Vsevolod J. Makeev

Transcription factor (TF) binding site (TFBS) models are crucial for computational reconstruction of transcription regulatory networks. In existing repositories, a TF often has several models (also called binding profiles or motifs), obtained from different experimental data. Having a single TFBS model for a TF is more pragmatic for practical applications. We show that integration of TFBS data from various types of experiments into a single model typically results in the improved model quality probably due to partial correction of source specific technique bias. We present the Homo sapiens comprehensive model collection (HOCOMOCO, http://autosome.ru/HOCOMOCO/, http://cbrc.kaust.edu.sa/hocomoco/) containing carefully hand-curated TFBS models constructed by integration of binding sequences obtained by both low- and high-throughput methods. To construct position weight matrices to represent these TFBS models, we used ChIPMunk software in four computational modes, including newly developed periodic positional prior mode associated with DNA helix pitch. We selected only one TFBS model per TF, unless there was a clear experimental evidence for two rather distinct TFBS models. We assigned a quality rating to each model. HOCOMOCO contains 426 systematically curated TFBS models for 401 human TFs, where 172 models are based on more than one data source.


Nucleic Acids Research | 2016

HOCOMOCO: expansion and enhancement of the collection of transcription factor binding sites models

Ivan V. Kulakovskiy; Ilya E. Vorontsov; Ivan S. Yevshin; Anastasiia V. Soboleva; Artem S. Kasianov; Haitham Ashoor; Wail Ba-alawi; Vladimir B. Bajic; Yulia A. Medvedeva; Fedor A. Kolpakov; Vsevolod J. Makeev

Models of transcription factor (TF) binding sites provide a basis for a wide spectrum of studies in regulatory genomics, from reconstruction of regulatory networks to functional annotation of transcripts and sequence variants. While TFs may recognize different sequence patterns in different conditions, it is pragmatic to have a single generic model for each particular TF as a baseline for practical applications. Here we present the expanded and enhanced version of HOCOMOCO (http://hocomoco.autosome.ru and http://www.cbrc.kaust.edu.sa/hocomoco10), the collection of models of DNA patterns, recognized by transcription factors. HOCOMOCO now provides position weight matrix (PWM) models for binding sites of 601 human TFs and, in addition, PWMs for 396 mouse TFs. Furthermore, we introduce the largest up to date collection of dinucleotide PWM models for 86 (52) human (mouse) TFs. The update is based on the analysis of massive ChIP-Seq and HT-SELEX datasets, with the validation of the resulting models on in vivo data. To facilitate a practical application, all HOCOMOCO models are linked to gene and protein databases (Entrez Gene, HGNC, UniProt) and accompanied by precomputed score thresholds. Finally, we provide command-line tools for PWM and diPWM threshold estimation and motif finding in nucleotide sequences.


Database | 2015

EpiFactors: a comprehensive database of human epigenetic factors and complexes

Yulia A. Medvedeva; Andreas Lennartsson; Rezvan Ehsani; Ivan V. Kulakovskiy; Ilya E. Vorontsov; Pouda Panahandeh; Grigory Khimulya; Takeya Kasukawa; Finn Drabløs

Epigenetics refers to stable and long-term alterations of cellular traits that are not caused by changes in the DNA sequence per se. Rather, covalent modifications of DNA and histones affect gene expression and genome stability via proteins that recognize and act upon such modifications. Many enzymes that catalyse epigenetic modifications or are critical for enzymatic complexes have been discovered, and this is encouraging investigators to study the role of these proteins in diverse normal and pathological processes. Rapidly growing knowledge in the area has resulted in the need for a resource that compiles, organizes and presents curated information to the researchers in an easily accessible and user-friendly form. Here we present EpiFactors, a manually curated database providing information about epigenetic regulators, their complexes, targets and products. EpiFactors contains information on 815 proteins, including 95 histones and protamines. For 789 of these genes, we include expressions values across several samples, in particular a collection of 458 human primary cell samples (for approximately 200 cell types, in many cases from three individual donors), covering most mammalian cell steady states, 255 different cancer cell lines (representing approximately 150 cancer subtypes) and 134 human postmortem tissues. Expression values were obtained by the FANTOM5 consortium using Cap Analysis of Gene Expression technique. EpiFactors also contains information on 69 protein complexes that are involved in epigenetic regulation. The resource is practical for a wide range of users, including biologists, pharmacologists and clinicians. Database URL: http://epifactors.autosome.ru


Algorithms for Molecular Biology | 2013

Jaccard index based similarity measure to compare transcription factor binding site models

Ilya E. Vorontsov; Ivan V. Kulakovskiy; Vsevolod J. Makeev

BackgroundPositional weight matrix (PWM) remains the most popular for quantification of transcription factor (TF) binding. PWM supplied with a score threshold defines a set of putative transcription factor binding sites (TFBS), thus providing a TFBS model.TF binding DNA fragments obtained by different experimental methods usually give similar but not identical PWMs. This is also common for different TFs from the same structural family. Thus it is often necessary to measure the similarity between PWMs. The popular tools compare PWMs directly using matrix elements. Yet, for log-odds PWMs, negative elements do not contribute to the scores of highly scoring TFBS and thus may be different without affecting the sets of the best recognized binding sites. Moreover, the two TFBS sets recognized by a given pair of PWMs can be more or less different depending on the score thresholds.ResultsWe propose a practical approach for comparing two TFBS models, each consisting of a PWM and the respective scoring threshold. The proposed measure is a variant of the Jaccard index between two TFBS sets. The measure defines a metric space for TFBS models of all finite lengths. The algorithm can compare TFBS models constructed using substantially different approaches, like PWMs with raw positional counts and log-odds. We present the efficient software implementation: MACRO-APE (MAtrix CompaRisOn by Approximate P-value Estimation).ConclusionsMACRO-APE can be effectively used to compute the Jaccard index based similarity for two TFBS models. A two-pass scanning algorithm is presented to scan a given collection of PWMs for PWMs similar to a given query.Availability and implementationMACRO-APE is implemented in ruby 1.9; software including source code and a manual is freely available at http://autosome.ru/macroape/ and in supplementary materials.


Nucleic Acids Research | 2018

HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis

Ivan V. Kulakovskiy; Ilya E. Vorontsov; Ivan S. Yevshin; Ruslan N. Sharipov; Alla D. Fedorova; Eugene I. Rumynskiy; Yulia A. Medvedeva; Arturo Magana-Mora; Vladimir B. Bajic; Dmitri A. Papatsenko; Fedor A. Kolpakov; Vsevolod J. Makeev

Abstract We present a major update of the HOCOMOCO collection that consists of patterns describing DNA binding specificities for human and mouse transcription factors. In this release, we profited from a nearly doubled volume of published in vivo experiments on transcription factor (TF) binding to expand the repertoire of binding models, replace low-quality models previously based on in vitro data only and cover more than a hundred TFs with previously unknown binding specificities. This was achieved by systematic motif discovery from more than five thousand ChIP-Seq experiments uniformly processed within the BioUML framework with several ChIP-Seq peak calling tools and aggregated in the GTRD database. HOCOMOCO v11 contains binding models for 453 mouse and 680 human transcription factors and includes 1302 mononucleotide and 576 dinucleotide position weight matrices, which describe primary binding preferences of each transcription factor and reliable alternative binding specificities. An interactive interface and bulk downloads are available on the web: http://hocomoco.autosome.ru and http://www.cbrc.kaust.edu.sa/hocomoco11. In this release, we complement HOCOMOCO by MoLoTool (Motif Location Toolbox, http://molotool.autosome.ru) that applies HOCOMOCO models for visualization of binding sites in short DNA sequences.


international conference on bioinformatics | 2015

PERFECTOS-APE - Predicting Regulatory Functional Effect of SNPs by Approximate P-value Estimation

Ilya E. Vorontsov; Ivan V. Kulakovskiy; Grigory Khimulya; Daria D. Nikolaeva; Vsevolod J. Makeev

Single nucleotide polymorphisms (SNPs) and variants (SNVs) are often found in regulatory regions of human genome. Nucleotide substitutions in promoter and enhancer regions may affect transcription factor (TF) binding and alter gene expression regulation. Nowadays binding patterns are known for hundreds of human TFs. Thus one can assess possible functional effects of allele variations or mutations in TF binding sites using


BMC Genomics | 2016

Negative selection maintains transcription factor binding motifs in human cancer

Ilya E. Vorontsov; Grigory Khimulya; Elena Lukianova; Daria D. Nikolaeva; Irina A. Eliseeva; Ivan V. Kulakovskiy; Vsevolod J. Makeev

BackgroundSomatic mutations in cancer cells affect various genomic elements disrupting important cell functions. In particular, mutations in DNA binding sites recognized by transcription factors can alter regulator binding affinities and, consequently, expression of target genes. A number of promoter mutations have been linked with an increased risk of cancer. Cancer somatic mutations in binding sites of selected transcription factors have been found under positive selection. However, action and significance of negative selection in non-coding regions remain controversial.ResultsHere we present analysis of transcription factor binding motifs co-localized with non-coding variants. To avoid statistical bias we account for mutation signatures of different cancer types. For many transcription factors, including multiple members of FOX, HOX, and NR families, we show that human cancers accumulate fewer mutations than expected by chance that increase or decrease affinity of predicted binding sites. Such stability of binding motifs is even more exhibited in DNase accessible regions.ConclusionsOur data demonstrate negative selection against binding sites alterations and suggest that such selection pressure protects cancer cells from rewiring of regulatory circuits. Further analysis of transcription factors with conserved binding motifs can reveal cell regulatory pathways crucial for the survivability of various human cancers.


Biochimica et Biophysica Acta | 2016

Early B-cell factor 1 (EBF1) is critical for transcriptional control of SLAMF1 gene in human B cells.

Anton M. Schwartz; Lidia V. Putlyaeva; Milica Covich; Anna V. Klepikova; Kseniya A. Akulich; Ilya E. Vorontsov; Kirill V. Korneev; Sergey E. Dmitriev; O. L. Polanovsky; Svetlana P. Sidorenko; Ivan V. Kulakovskiy; Dmitry V. Kuprash

Signaling lymphocytic activation molecule family member 1 (SLAMF1)/CD150 is a co-stimulatory receptor expressed on a variety of hematopoietic cells, in particular on mature lymphocytes activated by specific antigen, costimulation and cytokines. Changes in CD150 expression level have been reported in association with autoimmunity and with B-cell chronic lymphocytic leukemia. We characterized the core promoter for SLAMF1 gene in human B-cell lines and explored binding sites for a number of transcription factors involved in B cell differentiation and activation. Mutations of SP1, STAT6, IRF4, NF-kB, ELF1, TCF3, and SPI1/PU.1 sites resulted in significantly decreased promoter activity of varying magnitude, depending on the cell line tested. The most profound effect on the promoter strength was observed upon mutation of the binding site for Early B-cell factor 1 (EBF1). This mutation produced a 10-20 fold drop in promoter activity and pinpointed EBF1 as the master regulator of human SLAMF1 gene in B cells. We also identified three potent transcriptional enhancers in human SLAMF1 locus, each containing functional EBF1 binding sites. Thus, EBF1 interacts with specific binding sites located both in the promoter and in the enhancer regions of the SLAMF1 gene and is critical for its expression in human B cells.


Gene | 2017

Multiple single nucleotide polymorphisms in the first intron of the IL2RA gene affect transcription factor binding and enhancer activity

Anton M. Schwartz; Denis E. Demin; Ilya E. Vorontsov; Artem S. Kasyanov; Lidia V. Putlyaeva; Karina A. Tatosyan; Ivan V. Kulakovskiy; Dmitry V. Kuprash

IL2RA gene encodes the alpha subunit of a high-affinity receptor for interleukin-2 which is expressed by several distinct populations of lymphocytes involved in autoimmune processes. A large number of polymorphic alleles of the IL2RA locus are associated with the development of various autoimmune diseases. With bioinformatics analysis we the dissected the first intron of the IL2RA gene and selected several single nucleotide polymorphisms (SNPs) that may influence the regulation of the IL2RA gene in cell types relevant to autoimmune pathology. We described five enhancers containing the selected SNPs that stimulated activity of the IL2RA promoter in a cell-type specific manner, and tested the effect of specific SNP alleles on activity of the respective enhancers (E1 to E5, labeled according to the distance to the promoter). The E4 enhancer with minor T variant of rs61839660 SNP demonstrated reduced activity due to disrupted binding of MEF2A/C transcription factors (TFs). Neither rs706778 nor rs706779 SNPs, both associated with a number of autoimmune diseases, had any effect on the activity of the enhancer E2. However, rare variants of several SNPs (rs139767239, rs115133228, rs12722502, rs12722635) genetically linked to either rs706778 and/or rs706779 significantly influenced the activity of E1, E3 and E5 enhancers, presumably by disrupting EBF1, GABPA and ELF1 binding sites.


PLOS ONE | 2017

The single nucleotide variant rs12722489 determines differential estrogen receptor binding and enhancer properties of an IL2RA intronic region

Marina A. Afanasyeva; Lidia V. Putlyaeva; Denis E. Demin; Ivan V. Kulakovskiy; Ilya E. Vorontsov; Marina V. Fridman; Vsevolod J. Makeev; Dmitry V. Kuprash; Anton M. Schwartz

We studied functional effect of rs12722489 single nucleotide polymorphism located in the first intron of human IL2RA gene on transcriptional regulation. This polymorphism is associated with multiple autoimmune conditions (rheumatoid arthritis, multiple sclerosis, Crohns disease, and ulcerative colitis). Analysis in silico suggested significant difference in the affinity of estrogen receptor (ER) binding site between alternative allelic variants, with stronger predicted affinity for the risk (G) allele. Electrophoretic mobility shift assay showed that purified human ERα bound only G variant of a 32-bp genomic sequence containing rs12722489. Chromatin immunoprecipitation demonstrated that endogenous human ERα interacted with rs12722489 genomic region in vivo and DNA pull-down assay confirmed differential allelic binding of amplified 189-bp genomic fragments containing rs12722489 with endogenous human ERα. In a luciferase reporter assay, a kilobase-long genomic segment containing G but not A allele of rs12722489 demonstrated enhancer properties in MT-2 cell line, an HTLV-1 transformed human cell line with a regulatory T cell phenotype.

Collaboration


Dive into the Ilya E. Vorontsov's collaboration.

Top Co-Authors

Avatar

Ivan V. Kulakovskiy

Engelhardt Institute of Molecular Biology

View shared research outputs
Top Co-Authors

Avatar

Vsevolod J. Makeev

Russian Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Yulia A. Medvedeva

Russian Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Fedor A. Kolpakov

Russian Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Ivan S. Yevshin

Russian Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Vladimir B. Bajic

King Abdullah University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Anton M. Schwartz

Engelhardt Institute of Molecular Biology

View shared research outputs
Top Co-Authors

Avatar

Artem S. Kasianov

Russian Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Dmitry V. Kuprash

Engelhardt Institute of Molecular Biology

View shared research outputs
Top Co-Authors

Avatar

Grigory Khimulya

Russian Academy of Sciences

View shared research outputs
Researchain Logo
Decentralizing Knowledge