Is this you? Create Your Porfile

Javier Artiles

National University of Distance Education

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Javier Artiles is active.

Explore More

Publication

Featured researches published by Javier Artiles.

Information Retrieval | 2009

A comparison of extrinsic clustering evaluation metrics based on formal constraints

Enrique Amigó; Julio Gonzalo; Javier Artiles; Felisa Verdejo

There is a wide set of evaluation metrics available to compare the quality of text clustering algorithms. In this article, we define a few intuitive formal constraints on such metrics which shed light on which aspects of the quality of a clustering are captured by different metric families. These formal constraints are validated in an experiment involving human assessments, and compared with other constraints proposed in the literature. Our analysis of a wide range of metrics shows that only BCubed satisfies all formal constraints. We also extend the analysis to the problem of overlapping clustering, where items can simultaneously belong to more than one cluster. As Bcubed cannot be directly applied to this task, we propose a modified version of Bcubed that avoids the problems found with other metrics.

meeting of the association for computational linguistics | 2007

The SemEval-2007 WePS Evaluation: Establishing a benchmark for the Web People Search Task

Javier Artiles; Julio Gonzalo; Satoshi Sekine

This paper presents the task definition, resources, participation, and comparative results for the Web People Search task, which was organized as part of the SemEval-2007 evaluation exercise. This task consists of clustering a set of documents that mention an ambiguous person name according to the actual entities referred to using that name.

empirical methods in natural language processing | 2009

The role of named entities in Web People Search

Javier Artiles; Enrique Amigó; Julio Gonzalo

The ambiguity of person names in the Web has become a new area of interest for NLP researchers. This challenging problem has been formulated as the task of clustering Web search results (returned in response to a person name query) according to the individual they mention. In this paper we compare the coverage, reliability and independence of a number of features that are potential information sources for this clustering task, paying special attention to the role of named entities in the texts to be clustered. Although named entities are used in most approaches, our results show that, independently of the Machine Learning or Clustering algorithm used, named entity recognition and classification per se only make a small contribution to solve the problem.

european conference on information retrieval | 2008

Exploring the effects of language skills on multilingual web search

Jennifer Marlow; Paul D. Clough; Juan Miguel Cigarrán Recuero; Javier Artiles

Multilingual access is an important area of research, especially given the growth in multilingual users of online resources. A large body of research exists for Cross-Language Information Retrieval (CLIR); however, little of this work has considered the language skills of the end user, a critical factor in providing effective multilingual search functionality. In this paper we describe an experiment carried out to further understand the effects of language skills on multilingual search. Using the Google Translate service, we show that users have varied language skills that are non-trivial to assess and can impact their multilingual searching experience and search effectiveness.

meeting of the association for computational linguistics | 2009

The Impact of Query Refinement in the Web People Search Task

Javier Artiles; Julio Gonzalo; Enrique Amigó

Searching for a person name in a Web Search Engine usually leads to a number of web pages that refer to several people sharing the same name. In this paper we study whether it is reasonable to assume that pages about the desired person can be filtered by the user by adding query terms. Our results indicate that, although in most occasions there is a query refinement that gives all and only those pages related to an individual, it is unlikely that the user is able to find this expression a priori.

cross language evaluation forum | 2004

UNED at ImageCLEF 2004: detecting named entities and noun phrases for automatic query expansion and structuring

Víctor Peinado; Javier Artiles; Fernando López-Ostenero; Julio Gonzalo; Felisa Verdejo

This paper describes UNED experiments at the Image CLEF bilingual ad hoc task. Two different strategies are attempted: i) automatic expansion and translation using noun phrases; ii) automatic detection of named entities in the query for structured search on image caption fields. All our experiments obtain results above the average MAP for the bilingual task. Structured searches using named entities improve performance over a strong baseline (Pirkolas structured query approach), achieving one of the best results for the whole bilingual track. Expansion with noun phrases, however, degrades results, possibly due to the mismatch between train and test collections.

international conference on computational linguistics | 2012

Combining flat and structured approaches for temporal slot filling or: how much to compress?

Qi Li; Javier Artiles; Taylor Cassidy; Heng Ji

In this paper, we present a hybrid approach to Temporal Slot Filling (TSF) task. Our method decomposes the task into two steps: temporal classification and temporal aggregation. As in many other NLP tasks, a key challenge lies in capturing relations between text elements separated by a long context. We have observed that features derived from a structured text representation can help compressing the context and reducing ambiguity. On the other hand, surface lexical features are more robust and work better in some cases. Experiments on the KBP2011 temporal training data set show that both surface and structured approaches outperform a baseline bag-of-word based classifier and the proposed hybrid method can further improve the performance significantly. Our system achieved the top performance in KBP2011 evaluation.

Archive | 2009