Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Alexandre Klementiev is active.

Publication


Featured researches published by Alexandre Klementiev.


meeting of the association for computational linguistics | 2006

Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora

Alexandre Klementiev; Dan Roth

Named Entity recognition (NER) is an important part of many natural language processing tasks. Current approaches often employ machine learning techniques and require supervised data. However, many languages lack such resources. This paper presents an (almost) unsupervised learning algorithm for automatic discovery of Named Entities (NEs) in a resource free language, given a bilingual corpora in which it is weakly temporally aligned with a resource rich language. NEs have similar time distributions across such corpora, and often some of the tokens in a multi-word NE are transliterated. We develop an algorithm that exploits both observations iteratively. The algorithm makes use of a new, frequency based, metric for time distributions and a resource free discriminative approach to transliteration. Seeded with a small number of transliteration pairs, our algorithm discovers multi-word NEs, and takes advantage of a dictionary (if one exists) to account for translated or partially translated NEs. We evaluate the algorithm on an English-Russian corpus, and show high level of NEs discovery in Russian.


international conference on machine learning | 2008

Unsupervised rank aggregation with distance-based models

Alexandre Klementiev; Dan Roth; Kevin Small

The need to meaningfully combine sets of rankings often comes up when one deals with ranked data. Although a number of heuristic and supervised learning approaches to rank aggregation exist, they require domain knowledge or supervised ranked data, both of which are expensive to acquire. In order to address these limitations, we propose a mathematical and algorithmic framework for learning to aggregate (partial) rankings without supervision. We instantiate the framework for the cases of combining permutations and combining top-k lists, and propose a novel metric for the latter. Experiments in both scenarios demonstrate the effectiveness of the proposed formalism.


european conference on machine learning | 2007

An Unsupervised Learning Algorithm for Rank Aggregation

Alexandre Klementiev; Dan Roth; Kevin Small

Many applications in information retrieval, natural language processing, data mining, and related fields require a ranking of instances with respect to a specified criteria as opposed to a classification. Furthermore, for many such problems, multiple established ranking models have been well studied and it is desirable to combine their results into a joint ranking, a formalism denoted as rank aggregation. This work presents a novel unsupervisedlearning algorithm for rank aggregation (ULARA) which returns a linear combination of the individual ranking functions based on the principle of rewarding ordering agreement between the rankers. In addition to presenting ULARA, we demonstrate its effectiveness on a data fusion task across ad hoc retrieval systems.


language and technology conference | 2006

Named Entity Transliteration and Discovery from Multilingual Comparable Corpora

Alexandre Klementiev; Dan Roth

Named Entity recognition (NER) is an important part of many natural language processing tasks. Most current approaches employ machine learning techniques and require supervised data. However, many languages lack such resources. This paper presents an algorithm to automatically discover Named Entities (NEs) in a resource free language, given a bilingual corpora in which it is weakly temporally aligned with a resource rich language. We observe that NEs have similar time distributions across such corpora, and that they are often transliterated, and develop an algorithm that exploits both iteratively. The algorithm makes use of a new, frequency based, metric for time distributions and a resource free discriminative approach to transliteration. We evaluate the algorithm on an English-Russian corpus, and show high level of NEs discovery in Russian.


international conference on computational linguistics | 2012

Inducing Crosslingual Distributed Representations of Words

Alexandre Klementiev; Ivan Titov; Binod Bhattarai


conference of the european chapter of the association for computational linguistics | 2012

Toward Statistical Machine Translation without Parallel Corpora

Alexandre Klementiev; Ann Irvine; Chris Callison-Burch; David Yarowsky


conference of the european chapter of the association for computational linguistics | 2012

A Bayesian Approach to Unsupervised Semantic Role Induction

Ivan Titov; Alexandre Klementiev


meeting of the association for computational linguistics | 2011

A Bayesian Model for Unsupervised Semantic Parsing

Ivan Titov; Alexandre Klementiev


north american chapter of the association for computational linguistics | 2010

Using Mechanical Turk to Annotate Lexicons for Less Commonly Used Languages

Ann Irvine; Alexandre Klementiev


international joint conference on artificial intelligence | 2009

Unsupervised rank aggregation with domain-specific expertise

Alexandre Klementiev; Dan Roth; Kevin Small; Ivan Titov

Collaboration


Dive into the Alexandre Klementiev's collaboration.

Top Co-Authors

Avatar

Ivan Titov

University of Amsterdam

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ann Irvine

Johns Hopkins University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

David Yarowsky

Johns Hopkins University

View shared research outputs
Researchain Logo
Decentralizing Knowledge