Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Hugo Hernault is active.

Publication


Featured researches published by Hugo Hernault.


international conference on computational linguistics | 2010

A sequential model for discourse segmentation

Hugo Hernault; Danushka Bollegala; Mitsuru Ishizuka

Identifying discourse relations in a text is essential for various tasks in Natural Language Processing, such as automatic text summarization, question-answering, and dialogue generation. The first step of this process is segmenting a text into elementary units. In this paper, we present a novel model of discourse segmentation based on sequential data labeling. Namely, we use Conditional Random Fields to train a discourse segmenter on the RST Discourse Treebank, using a set of lexical and syntactic features. Our system is compared to other statistical and rule-based segmenters, including one based on Support Vector Machines. Experimental results indicate that our sequential model outperforms current state-of-the-art discourse segmenters, with an F-score of 0.94. This performance level is close to the human agreement F-score of 0.98.


international conference on computational linguistics | 2011

Semi-supervised discourse relation classification with structural learning

Hugo Hernault; Danushka Bollegala; Mitsuru Ishizuka

The corpora available for training discourse relation classifiers are annotated using a general set of discourse relations. However, for certain applications, custom discourse relations are required. Creating a new annotated corpus with a new relation taxonomy is a timeconsuming and costly process. We address this problem by proposing a semi-supervised approach to discourse relation classification based on Structural Learning. First, we solve a set of auxiliary classification problems using unlabeled data. Second, the learned classifiers are used to extend feature vectors to train a discourse relation classifier. By defining a relevant set of auxiliary classification problems, we show that the proposed method brings improvement of at least 50% in accuracy and F-score on the RST Discourse Treebank and Penn Discourse Treebank, when small training sets of ca. 1000 training instances are employed. This is an attractive perspective for training discourse relation classifiers on domains where little amount of labeled training data is available.


document engineering | 2009

From rhetorical structures to document structure: shallow pragmatic analysis for document engineering

Gersende Georg; Hugo Hernault; Marc Cavazza; Helmut Prendinger; Mitsuru Ishizuka

In this paper, we extend previous work on the automatic structuring of medical documents using content analysis. Our long-term objective is to take advantage of specific rhetoric markers encountered in specialized medical documents (clinical guidelines) to automatically structure free text according to its role in the document. This should enable to generate multiple views of the same document depending on the target audience, generate document summaries, as well as facilitating knowledge extraction from text. We have established in previous work that the structure of clinical guidelines could be refined through the identification of a limited set of deontic operators. We now propose to extend this approach by analyzing the text delimited by these operators using Rhetorical Structure Theory. The emphasis on causality and time in RST proves a powerful complement to the recognition of deontic structures while retaining the same philosophy of high-level recognition of sentence structure, which can be converted into application-specific mark-ups. Throughout the paper, we illustrate our findings through results produced by the automatic processing of English guidelines for the management of hypertension and Alzheimer disease.


Dialogue & Discourse | 2010

HILDA: A Discourse Parser Using Support Vector Machine Classification

Hugo Hernault; Helmut Prendinger; David A. duVerle; Mitsuru Ishizuka


intelligent virtual agents | 2007

T2D: Generating Dialogues Between Virtual Agents Automatically from Text

Paul Piwek; Hugo Hernault; Helmut Prendinger; Mitsuru Ishizuka


empirical methods in natural language processing | 2010

A Semi-Supervised Approach to Improve Classification of Infrequent Discourse Relations Using Feature Vector Extension

Hugo Hernault; Danushka Bollegala; Mitsuru Ishizuka


intelligent virtual agents | 2008

Generating Dialogues for Virtual Agents Using Nested Textual Coherence Relations

Hugo Hernault; Paul Piwek; Helmut Prendinger; Mitsuru Ishizuka


Archive | 2008

Generating Questions: An Inclusive Characterization and a Dialogue-based Application

Paul Piwek; Helmut Prendinger; Hugo Hernault; Mitsuru Ishizuka


national conference on artificial intelligence | 2011

Evaluating HILDA in the CODA project: A case study in question generation using automatic discourse analysis

Pascal Kuyten; Hugo Hernault; Helmut Prendinger; Mitsuru Ishizuka


annual meeting of the special interest group on discourse and dialogue | 2010

Towards Semi-Supervised Classification of Discourse Relations using Feature Correlations

Hugo Hernault; Danushka Bollegala; Mitsuru Ishizuka

Collaboration


Dive into the Hugo Hernault's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Helmut Prendinger

National Institute of Informatics

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge