Is this you? Create Your Porfile

Alexandre Klementiev

University of Illinois at Urbana–Champaign

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Alexandre Klementiev is active.

Explore More

Publication

Featured researches published by Alexandre Klementiev.

meeting of the association for computational linguistics | 2006

Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora

Alexandre Klementiev; Dan Roth

Named Entity recognition (NER) is an important part of many natural language processing tasks. Current approaches often employ machine learning techniques and require supervised data. However, many languages lack such resources. This paper presents an (almost) unsupervised learning algorithm for automatic discovery of Named Entities (NEs) in a resource free language, given a bilingual corpora in which it is weakly temporally aligned with a resource rich language. NEs have similar time distributions across such corpora, and often some of the tokens in a multi-word NE are transliterated. We develop an algorithm that exploits both observations iteratively. The algorithm makes use of a new, frequency based, metric for time distributions and a resource free discriminative approach to transliteration. Seeded with a small number of transliteration pairs, our algorithm discovers multi-word NEs, and takes advantage of a dictionary (if one exists) to account for translated or partially translated NEs. We evaluate the algorithm on an English-Russian corpus, and show high level of NEs discovery in Russian.

international conference on machine learning | 2008

Unsupervised rank aggregation with distance-based models

Alexandre Klementiev; Dan Roth; Kevin Small

The need to meaningfully combine sets of rankings often comes up when one deals with ranked data. Although a number of heuristic and supervised learning approaches to rank aggregation exist, they require domain knowledge or supervised ranked data, both of which are expensive to acquire. In order to address these limitations, we propose a mathematical and algorithmic framework for learning to aggregate (partial) rankings without supervision. We instantiate the framework for the cases of combining permutations and combining top-k lists, and propose a novel metric for the latter. Experiments in both scenarios demonstrate the effectiveness of the proposed formalism.

european conference on machine learning | 2007

An Unsupervised Learning Algorithm for Rank Aggregation

Alexandre Klementiev; Dan Roth; Kevin Small

Many applications in information retrieval, natural language processing, data mining, and related fields require a ranking of instances with respect to a specified criteria as opposed to a classification. Furthermore, for many such problems, multiple established ranking models have been well studied and it is desirable to combine their results into a joint ranking, a formalism denoted as rank aggregation. This work presents a novel unsupervisedlearning algorithm for rank aggregation (ULARA) which returns a linear combination of the individual ranking functions based on the principle of rewarding ordering agreement between the rankers. In addition to presenting ULARA, we demonstrate its effectiveness on a data fusion task across ad hoc retrieval systems.

language and technology conference | 2006

Named Entity Transliteration and Discovery from Multilingual Comparable Corpora

Alexandre Klementiev; Dan Roth

Named Entity recognition (NER) is an important part of many natural language processing tasks. Most current approaches employ machine learning techniques and require supervised data. However, many languages lack such resources. This paper presents an algorithm to automatically discover Named Entities (NEs) in a resource free language, given a bilingual corpora in which it is weakly temporally aligned with a resource rich language. We observe that NEs have similar time distributions across such corpora, and that they are often transliterated, and develop an algorithm that exploits both iteratively. The algorithm makes use of a new, frequency based, metric for time distributions and a resource free discriminative approach to transliteration. We evaluate the algorithm on an English-Russian corpus, and show high level of NEs discovery in Russian.

international conference on computational linguistics | 2012

Inducing Crosslingual Distributed Representations of Words

Alexandre Klementiev; Ivan Titov; Binod Bhattarai

conference of the european chapter of the association for computational linguistics | 2012

Toward Statistical Machine Translation without Parallel Corpora

Alexandre Klementiev; Ann Irvine; Chris Callison-Burch; David Yarowsky

conference of the european chapter of the association for computational linguistics | 2012

A Bayesian Approach to Unsupervised Semantic Role Induction

Ivan Titov; Alexandre Klementiev

meeting of the association for computational linguistics | 2011

A Bayesian Model for Unsupervised Semantic Parsing

Ivan Titov; Alexandre Klementiev

north american chapter of the association for computational linguistics | 2010

Using Mechanical Turk to Annotate Lexicons for Less Commonly Used Languages

Ann Irvine; Alexandre Klementiev

international joint conference on artificial intelligence | 2009

Unsupervised rank aggregation with domain-specific expertise

Alexandre Klementiev; Dan Roth; Kevin Small; Ivan Titov

Explore More

Collaboration

Dive into the Alexandre Klementiev's collaboration.

Top Co-Authors

Ivan Titov

University of Amsterdam

View shared research outputs

Top Co-Authors

Kevin Small

Tufts University

View shared research outputs

Top Co-Authors

Ann Irvine

Johns Hopkins University

View shared research outputs

Top Co-Authors

Chris Callison-Burch

University of Pennsylvania

View shared research outputs

Top Co-Authors

Ashutosh Modi

Saarland University

View shared research outputs

Top Co-Authors

David Yarowsky

Johns Hopkins University

View shared research outputs

Explore More