Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Benoît Habert is active.

Publication


Featured researches published by Benoît Habert.


international conference on computational linguistics | 1996

Symbolic word clustering for medium-size corpora

Benoît Habert; Elie Naulleau; Adeline Nazarenko

When trying to identify essential concepts and relationships in a medium-size corpus, it is not always possible to rely on statistical methods, as the frequencies are too low. We present an alternative method, symbolic, based on the simplification of parse trees. We discuss the results on nominal phrases of two technical corpora, analyzed by two different robust parsers used for terminology updating in an industrial company. We compare our results with Hindles scores of similarity.


Computers and The Humanities | 1999

Elementary dependency trees for identifying corpus-specific semantic classes

Benoît Habert; C. Fabre

Elementary dependency relationships between words within parse trees produced by robust analyzers on a corpus help automate the discovery of semantic classes relevant for the underlying domain. We introduce two methods for extracting elementary syntactic dependencies from normalized parse trees. The groupings which are obtained help identify coarse-grain semantic categories and isolate lexical idiosyncrasies belonging to a specific sublanguage. A comparison shows a satisfactory overlapping with an existing nomenclature for medical language processing. This symbolic approach is efficient on medium size corpora which resist to statistical clustering methods but seems more appropriate for specialized texts.


Archive | 1996

Les linguistiques de corpus

Adeline Nazarenko; Benoît Habert; André Salem


Terminology | 1996

Empirical observation of term variations and principles for their description

Béatrice Daille; Benoît Habert; Christian Jacquemin; Jean Royauté


Studies in health technology and informatics | 2001

Building a text corpus for representing the variety of medical language.

Pierre Zweigenbaum; Pierre Jacquemart; Natalia Grabar; Benoît Habert


language resources and evaluation | 1998

Towards tokenization evaluation

Benoît Habert; Gilles Adda; M. Adda Decker; P. Boula de Mareuil; Silvana Ferrari; Olivier Ferret; Gabriel Illouz; P. Paraubeck


american medical informatics association annual symposium | 1997

Corpus-based identification and refinement of semantic classes.

Adeline Nazarenko; Pierre Zweigenbaum; Jacques Bouaud; Benoît Habert


Archive | 2001

Corpus-based extension of a terminological semantic lexicon

Adeline Nazarenko; Pierre Zweigenbaum; Benoît Habert; Jacques Bouaud


DiSS '03: Disfluency in Spontaneous Speech. Workshop | 2003

A disfluency study for cleaning spontaneous speech automatic transcripts and improving speech language models

Martine Adda-Decker; Benoît Habert; Claude Barras; Gilles Adda; Philippe Boula de Mareüil; Patrick Paroubek


Mots | 1989

La langue de bois en éclat : les défigements dans les titres de presse quotidienne française

Pierre Fiala; Benoît Habert

Collaboration


Dive into the Benoît Habert's collaboration.

Top Co-Authors

Avatar

Pierre Zweigenbaum

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Gilles Adda

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar

Helka Folch

University of Paris-Sud

View shared research outputs
Top Co-Authors

Avatar

Claude Barras

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar

Philippe Boula de Mareüil

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jacques Bouaud

École Normale Supérieure

View shared research outputs
Top Co-Authors

Avatar

André Salem

Centre national de la recherche scientifique

View shared research outputs
Researchain Logo
Decentralizing Knowledge