Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Boris V. Dobrov is active.

Publication


Featured researches published by Boris V. Dobrov.


cross language evaluation forum | 2005

Socio-political thesaurus in concept-based information retrieval

Mikhail Ageev; Boris V. Dobrov; Natalia V. Loukachevitch

In CLEF 2005 experiments we used a bilingual Russian-English Socio-Political Thesaurus that we developed over more than 10 years as a tool for automatic text processing in information retrieval tasks. The same resource and the same algorithms were used for the ad-hoc and domain–specific task.


pattern recognition and machine intelligence | 2011

Combining evidence for automatic extraction of terms

Boris V. Dobrov; Natalia V. Loukachevitch

The paper describes the method of extraction of two-word domain terms combining their features. The features are computed from three sources: the occurrence statistics in a domain-specific text collection, the statistics of global search engines, and a domain-specific thesaurus. The evaluation of the approach is based on the terminology of manually created thesauri. We show that the use of multiple features considerably improves the automatic extraction of domain-specific terms. We compare the quality of the proposed method in two different domains.


artificial intelligence and natural language | 2017

RuThes Thesaurus in Detecting Russian Paraphrases

Natalia V. Loukachevitch; Aleksandr Shevelev; Valerie Mozharova; Boris V. Dobrov; Andrey Pavlov

In this paper we study the contribution of semantic features to the detection of Russian paraphrases. The features were calculated on the Russian Thesaurus RuThes. First, we applied RuThes synonyms in clustering news articles, many of which had been created with rewriting (that is paraphrasing) of source news, and found significant improvement. Second, we applied several semantic similarity measures proposed for English thesaurus WordNet to RuThes thesaurus and utilized them for detecting Russian paraphrased sentences.


web intelligence, mining and semantics | 2018

Thesaurus-Based Topic Models and Their Evaluation

Natalia V. Loukachevitch; Kirill Ivanov; Boris V. Dobrov

In this paper we study thesaurus-based topic models and evaluate them from the point of view of topic coherence. Thesaurus-based topic model enhances scores of related terms found in the same text, which means that the model encourages these terms to be in the same topics. We evaluate various variants of such models. At the first step, we carry out manual evaluation of the obtained topics. At the second step, we study the possibility to use the collected manual data for evaluating new variants of thesaurus-based models, propose a method and select the best of its parameters in cross-validation. At the third step, we apply the created evaluation method to estimate the influence of word frequencies on adding thesaurus relations during generating topic models.


International Conference on Data Analytics and Management in Data Intensive Domains | 2017

News Timeline Generation: Accounting for Structural Aspects and Temporal Nature of News Stream

Mikhail Tikhomirov; Boris V. Dobrov

The number of news articles that are published daily is larger than any person can afford to study. Correct summarization of the information allows for an easy search for the event of interest. This research was designed to address the issue of constructing annotations of news story. Standard multi-document summarization approaches are not able to extract all information relevant to the event. This is due to the fact that such approaches do not take into account the variability of the event context in time. We have implemented a system that automatically builds timeline summary. We investigated impact of three factors: query extension, accounting for temporal nature and structure of news article in form of inverted pyramid. The annotations that we generate are composed of sentences sorted in chronological order, which together contain the main details of the news story. The paper shows that taking into account the described factors positively affects the quality of the annotations created.


intelligent information systems | 2004

On-line Thematic and Metadata Analysis of Document Collection

Mikhail Ageev; Boris V. Dobrov; Nikolai Makarov-Zemlyanskii

We describe interactive search tools based on thematic and metadata analysis of a document collection. The tools provide the following facilities: estimation of the main themes discussed in a document set, interactive narrowing of a query, cross-language search, exploring the topic-time dependencies, and exploring the dependencies between authors of articles, scientific organizations and themes. We describe an algorithm for thematic and metadata analysis and show some examples of their applications. All the tools described are integrated into the large full-text information retrieval system UIS RUSSIA.


Proceedings of the Seventh Global Wordnet Conference | 2014

RuThes Linguistic Ontology vs. Russian Wordnets

Natalia V. Loukachevitch; Boris V. Dobrov


text retrieval conference | 1997

Conceptual Indexing Using Thematic Representation of Texts

Boris V. Dobrov; Natalia V. Loukachevitch; Tatyana N. Yudina


language resources and evaluation | 2002

Evaluation of Thesaurus on Sociopolitical Life as Information-Retrieval Tool.

Natalia V. Loukachevitch; Boris V. Dobrov


recent advances in natural language processing | 2011

Multiple Evidence for Term Extraction in Broad Domains

Boris V. Dobrov; Natalia V. Loukachevitch

Collaboration


Dive into the Boris V. Dobrov's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge