Alejandro Molina | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Alejandro Molina is active.

Explore More

Publication

Featured researches published by Alejandro Molina.

mexican international conference on artificial intelligence | 2011

Discourse segmentation for sentence compression

Alejandro Molina; Juan-Manuel Torres-Moreno; Eric SanJuan; Iria da Cunha; Gerardo Sierra; Patricia Velázquez-Morales

Earlier studies have raised the possibility of summarizing at the level of the sentence. This simplification should help in adapting textual content in a limited space. Therefore, sentence compression is an important resource for automatic summarization systems. However, there are few studies that consider sentence-level discourse segmentation for compression task; to our knowledge, none in Spanish. In this paper, we study the relationship between discourse segmentation and compression for sentences in Spanish. We use a discourse segmenter and observe to what extent the passages deleted by annotators fit in discourse structures detected by the system. The main idea is to verify whether the automatic discourse segmentation can serve as a basis in the identification of segments to be eliminated in the sentence compression task. We show that discourse segmentation could be a first solid step towards a sentence compression system.

international conference on computational linguistics | 2013

Discursive sentence compression

Alejandro Molina; Juan-Manuel Torres-Moreno; Eric SanJuan; Iria da Cunha; Gerardo Sierra Martínez

This paper presents a method for automatic summarization by deleting intra-sentence discourse segments. First, each sentence is divided into elementary discourse units and, then, less informative segments are deleted. To analyze the results, we have set up an annotation campaign, thanks to which we have found interesting aspects regarding the elimination of discourse segments as an alternative to sentence compression task. Results show that the degree of disagreement in determining the optimal compressed sentence is high and increases with the complexity of the sentence. However, there is some agreement on the decision to delete discourse segments. The informativeness of each segment is calculated using textual energy, a method that has shown good results in automatic summarization.

latin american web congress | 2009

Web Exploitation for Definition Extraction

Gerardo Sierra; Rodrigo Alarcón; Alejandro Molina; Edwin Aldana

We present a system to extract definitions from a term using the Web. Definitions are organized according to a typology and its context. The structural and functional design is described; emphasizing its relevant components: the extractor of definitional context to candidates using the Yahoo!’s BOSS API; the extractor of definitional contexts and a clustering module based on textual energy as a measure of similarity.

cross language evaluation forum | 2013

A Turing Test to Evaluate a Complex Summarization Task

Alejandro Molina; Eric SanJuan; Juan-Manuel Torres-Moreno

This paper deals with a new strategy to evaluate a Natural Language Processing NLP complex task using the Turing test. Automatic summarization based on sentence compression requires to asses informativeness and modify inner sentence structures. This is much more intrinsically related with real rephrasing than plain sentence extraction and ranking paradigm so new evaluation methods are needed. We propose a novel imitation game to evaluate Automatic Summarization by Compression ASC. Rationale of this Turing-like evaluation could be applied to many other NLP complex tasks like Machine translation or Text Generation. We show that a state of the art ASC system can pass such a test and simulate a human summary in 60% of the cases.

mexican international conference on artificial intelligence | 2013

Analysis and Transformation of Textual Energy Distribution

Alejandro Molina; Juan-Manuel Torres-Moreno; Eric SanJuan; Gerardo Sierra; Julio Rojas-Mora

In this paper we revisit the Textual Energy model. We deal with the two major disadvantages of the Textual Energy: the asymmetry of the distribution and the unbounded ness of the maximum value. Although this model has been successfully used in several NLP tasks like summarization, clustering and sentence compression, no correction of these problems has been proposed until now. Concerning the maximum value, we analyze the computation of Textual Energy matrix and we conclude that energy values are dominated by the lexical richness in quadratic growth of the vocabulary size. Using the Box-Cox transformation, we show empirical evidence that a log transformation could correct both problems.

language resources and evaluation | 2014

Investigating the Image of Entities in Social Media: Dataset Design and First Results

Julien Velcin; Young-Min Kim; Caroline Brun; Jean-Yves Dormagen; Eric SanJuan; Leila Khouas; Anne Peradotto; Stéphane Bonnevay; Claude Roux; Julien Boyadjian; Alejandro Molina; Marie. Auteur du texte Neihouser

Linguamática | 2010

La compresion de frases: un recurso para la optimizacion de resumen automatico de documentos

Alejandro Molina; Iria da Cunha; Juan-Manuel Torres-Moreno; Patricia Velázquez-Morales

Archive | 2015

Regroupement sémantique de définitions en espagnol

Gerardo Sierra; Juan-Manuel Torres-Moreno; Alejandro Molina

arXiv: Information Retrieval | 2014

Algorithmes de classification et d'optimisation : participation du LIA/ADOC à DEFT'14

Luis Adrián Cabrera-Diego; Stéphane Huet; Bassam Jabaian; Alejandro Molina; Juan-Manuel Torres-Moreno; Marc El-Bèze; Barthélémy Durette; Québec Canada

Research on computing science | 2014