Vincent Bouvier
Aix-Marseille University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Vincent Bouvier.
Innovations in Intelligent Machines (4) | 2014
Patrice Bellot; Ludovic Bonnefoy; Vincent Bouvier; Frédéric Duvert; Young-Min Kim
The issues for Natural Language Processing and Information Retrieval have been studied for long time but the recent availability of very large resources (Web pages, digital documents…) and the development of statistical machine learning methods exploiting annotated texts (manual encoding by crowdsourcing is a new major way) have transformed these fields. This allows not limiting these approaches to highly specialized domains and reducing the cost of their implementation. For this chapter, our aim is to present some popular text-mining statistical approaches for information retrieval and information extraction and to discuss the practical limits of actual systems that introduce challenges for future.
web intelligence | 2015
Vincent Bouvier; Patrice Bellot
Filtering pages about an entity (person, company, music band...) so that only interesting pages are kept is a real challenge. The interest can be qualified using criteria such as recency, novelty. In the last decade, we have seen classification systems trained to detect the interest for a document regarding an entity. For scalability reasons, it is not possible to consider a manual annotation of a training set for each tracked entity. Some approaches strive to build entity independent systems. These approaches obtain the state of the art performances, but we show that they can be improved. Time features differ from one entity to another, therefore no relevant statistics can be estimated out of these observations by a single classifier. Instead of having one model per entity or one model for all entities, we propose an approach that uses one model per cluster of entities based on the entity web popularity. We also introduce different strategies for automatic classification model selection. We test our approach on the Knowledge Base Acceleration (KBA) framework from TREC and we show that our approach brings significant improvements over a non-cluster-based method.
Document numérique | 2015
Vincent Bouvier; Patrice Bellot
Cet article s’interesse a une problematique de filtrage cible de documents. En plus de detecter et de desambiguiser les entites dans un flux de documents, notre approche ambitionne de selectionner seulement les documents qui presentent des informations nouvelles concernant les entites cibles. Nous proposons une nouvelle approche faiblement supervisee a base de combinaison de modeles de langue dynamiques et temporels (time-aware) qui permettent de suivre l’evolution des entites. Nous mettons en place des metacriteres qui permettent la desambiguisation d’entite dans un document, une estimation de la nouveaute et par dela l’interet de conserver ou non les documents selon une approche de classification par forets aleatoires. Nous montrons sur les donnees de la tâche KBA (Knowledge Base Acceleration) de TREC que nos strategies aboutissent a des performances meilleures que l’etat de l’art.
international acm sigir conference on research and development in information retrieval | 2013
Ludovic Bonnefoy; Vincent Bouvier; Patrice Bellot
text retrieval conference | 2012
Ludovic Bonnefoy; Vincent Bouvier; Patrice Bellot
CLEF (Working Notes) | 2013
Jean-Valère Cossu; Benjamin Bigot; Ludovic Bonnefoy; Mohamed Morchid; Xavier Bost; Grégory Senay; Richard Dufour; Vincent Bouvier; Juan-Manuel Torres-Moreno; Marc El-Bèze
text retrieval conference | 2013
Vincent Bouvier; Patrice Bellot
CORIA | 2015
Vincent Bouvier; Patrice Bellot
CORIA | 2015
Vincent Bouvier; Patrice Bellot
text retrieval conference | 2014
Vincent Bouvier; Patrice Bellot