Emmanuel Ferreira | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Emmanuel Ferreira is active.

Explore More

Publication

Featured researches published by Emmanuel Ferreira.

Computer Speech & Language | 2015

Reinforcement-learning based dialogue system for human-robot interactions with socially-inspired rewards

Emmanuel Ferreira; Fabrice Lefèvre

HighlightsWe integrate user appraisals in a POMDP-based dialogue manager procedure.We employ additional socially-inspired rewards in a RL setup to guide the learning.A unified framework for speeding up the policy optimisation and user adaptation.We consider a potential-based reward shaping with a sample efficient RL algorithm.Evaluated using both user simulator (information retrieval) and user trials (HRI). This paper investigates some conditions under which polarized user appraisals gathered throughout the course of a vocal interaction between a machine and a human can be integrated in a reinforcement learning-based dialogue manager. More specifically, we discuss how this information can be cast into socially-inspired rewards for speeding up the policy optimisation for both efficient task completion and user adaptation in an online learning setting. For this purpose a potential-based reward shaping method is combined with a sample efficient reinforcement learning algorithm to offer a principled framework to cope with these potentially noisy interim rewards. The proposed scheme will greatly facilitate the systems development by allowing the designer to teach his system through explicit positive/negative feedbacks given as hints about task progress, in the early stage of training. At a later stage, the approach will be used as a way to ease the adaptation of the dialogue policy to specific user profiles. Experiments carried out using a state-of-the-art goal-oriented dialogue management framework, the Hidden Information State (HIS), support our claims in two configurations: firstly, with a user simulator in the tourist information domain (and thus simulated appraisals), and secondly, in the context of man-robot dialogue with real user trials.

international joint conference on artificial intelligence | 2013

Social signal and user adaptation in reinforcement learning-based dialogue management

Emmanuel Ferreira; Fabrice Lefèvre

This paper investigates the conditions under which cues from social signals can be used for user adaptation (or user tracking) of a learning agent. In this work we consider the case of the Reinforcement Learning (RL) of a dialogue management module. Social signals (gazes, postures, emotions, etc.) have an undeniable importance in human interactions and can be used as an additional and user-dependent (subjective) reinforcement signal during learning. In this paper, the Kalman Temporal Differences (KTD) framework is employed in combination with a potential-based shaping reward method to properly integrate the social information in the optimisation procedure and adapt the policy to user profiles. In a second step the ability of the method to track a new user profile (after self learning of the user or switch to a new user) is shown. Experiments carried out using a state-of-the-art goal-oriented dialogue management framework with simulations support our claims.

Natural Language Dialog Systems and Intelligent Assistants | 2015

Users’ Belief Awareness in Reinforcement Learning-Based Situated Human–Robot Dialogue Management

Emmanuel Ferreira; Grégoire Milliez; Fabrice Lefèvre; Rachid Alami

Others can have a different perception of the world than ours. Understanding this divergence is an ability, known as perspective taking in developmental psychology, that humans exploit in daily social interactions. A recent trend in robotics aims at endowing robots with similar mental mechanisms. The goal then is to enable them to naturally and efficiently plan tasks and communicate about them. In this paper we address this challenge extending a state-of-the-art goal-oriented dialogue management framework, the Hidden Information State (HIS). The new version makes use of the robot’s awareness of the users’ belief in a reinforcement learning-based situated dialogue management optimisation procedure. Thus the proposed solution enables the system to cope not only with the communication ambiguities due to noisy channel but also with the possible misunderstandings due to some divergence among the beliefs of the robot and its interlocutor in a human–robot interaction (HRI) context. We show the relevance of the approach by comparing different handcrafted and learnt dialogue policies with and without divergent belief reasoning in an in-house pick–place–carry scenario by means of user trials in a simulated 3D environment.

simulation modeling and programming for autonomous robots | 2014

Simulating Human-Robot Interactions for Dialogue Strategy Learning

Grégoire Milliez; Emmanuel Ferreira; Michelangelo Fiore; Rachid Alami; Fabrice Lefèvre

Many robotic projects use simulation as a faster and easier way to develop, evaluate and validate software components compared with on-board real world settings. In the human-robot interaction field, some recent works have attempted to integrate humans in the simulation loop. In this paper we investigate how such kind of robotic simulation software can be used to provide a dynamic and interactive environment to both collect a multimodal situated dialogue corpus and to perform an efficient reinforcement learning-based dialogue management optimisation procedure. Our proposition is illustrated by a preliminary experiment involving real users in a Pick-Place-Carry task for which encouraging results are obtained.

ieee automatic speech recognition and understanding workshop | 2013

Expert-based reward shaping and exploration scheme for boosting policy learning of dialogue management

Emmanuel Ferreira; Fabrice Lefèvre

This paper investigates the conditions under which expert knowledge can be used to accelerate the policy optimization of a learning agent. Recent works on reinforcement learning for dialogue management allowed to devise sophisticated methods for value estimation in order to deal all together with exploration/exploitation dilemma, sample-efficiency and non-stationary environments. In this paper, a reward shaping method and an exploration scheme, both based on some intuitive hand-coded expert advices, are combined with an efficient temporal difference-based learning procedure. The key objective is to boost the initial training stage, when the system is not sufficiently reliable to interact with real users (e.g. clients). Our claims are illustrated by experiments based on simulation and carried out using a state-of-the-art goal-oriented dialogue management framework, the Hidden Information State (HIS).

international conference on acoustics, speech, and signal processing | 2015

Online adaptative zero-shot learning spoken language understanding using word-embedding

Emmanuel Ferreira; Bassam Jabaian; Fabrice Lefèvre

Many recent competitive state-of-the-art solutions for understanding of speech data have in common to be probabilistic and to rely on machine learning algorithms to train their models from large amount of data. The difficulty remains in the cost and time of collecting and annotating such data, but also to update the existing models to new conditions, tasks and/or languages. In the present work an approach based on a zeroshot learning method using word embeddings for spoken language understanding is investigated. This approach requires no dedicated data. Large amounts of un-annotated and unstructured found data are used to learn a continuous space vector representation of words, based on neural network architectures. Only the ontological description of the target domain and the generic word embedding features are then required to derive the model used for decoding. In this paper, we extend this baseline with an online adaptative strategy allowing to refine progressively the initial model with only a light and adjustable supervision. We show that this proposition can significantly improve the performance of the spoken language understanding module on the second Dialog State Tracking Challenge (DSTC2) datasets.

international conference on acoustics, speech, and signal processing | 2016

Adversarial Bandit for online interactive active learning of zero-shot spoken language understanding

Emmanuel Ferreira; Alexandre Reiffers Masson; Bassam Jabaian; Fabrice Lefèvre

Many state-of-the-art solutions for the understanding of speech data have in common to be probabilistic and to rely on machine learning algorithms to train their models from large amount of data. The difficulty remains in the cost of collecting and annotating such data. Another point is the time for updating an existing model to a new domain. Recent works showed that a zero-shot learning method allows to bootstrap a model with good initial performance. To do so, this method relies on exploiting both a small-sized ontological description of the target domain and a generic word-embedding semantic space for generalization. Then, this framework has been extended to exploit user feedbacks to refine the zero-shot semantic parser parameters and increase its performance online. In this paper, we propose to drive this online adaptive process with a policy learnt using the Adversarial Bandit algorithm Exp3. We show, on the second Dialog State Tracking Challenge (DSTC2) datasets, that this proposition can optimally balance the cost of gathering valuable user feedbacks and the overall performance of the spoken language understanding module.

spoken language technology workshop | 2012

Acoustic modeling for under-resourced languages based on vectorial HMM-states representation using Subspace Gaussian Mixture Models

Mohamed Bouallegue; Emmanuel Ferreira; Driss Matrouf; Georges Linarès; Maria Goudi; Pascal Nocera

This paper explores a novel method for context-dependent models in automatic speech recognition (ASR), in the context of under-resourced languages. We present a simple way to realize a tying states approach, based on a new vectorial representation of the HMM states. This vectorial representation is considered as a vector of a low number of parameters obtained by the Subspace Gaussian Mixture Models paradigm (SGMM). The proposed method does not require phonetic knowledge or a large amount of data, which represent the major problems of acoustic modeling for under-resourced languages. This paper shows how this representation can be obtained and used for tying states. Our experiments, applied on Vietnamese, show that this approach achieves a stable gain compared to the classical approach which is based on decision trees. Furthermore, this method appears to be portable to other languages, as shown in the preliminary study conducted on Berber.

international conference on asian language processing | 2012

YAST: A Scalable ASR Toolkit Especially Designed for Under-Resourced Languages

Emmanuel Ferreira; Pascal Nocera; Maria Goudi; Ngoc Diep Do Thi

The ability to collect and process a large amount of resources (e.g. vocabularies, text corpora, transcribed speech corpora and phonetic dictionaries) constitutes a critical prerequisite of systems based on statistical methods. This aspect becomes crucial for languages presenting a lack of computer resources, also known as under-resourced languages, such as Vietnamese. Our work consists in exploring an efficient methodology which can help the development of speech recognition systems for this kind of languages. This article presents a possible solution that provides a fast building and customisable ASR toolkit, called YAST. The latter includes an ASR library as well as a collection of C++/Java executable programs and some helper bash and perl scripts. These utilities allow on one hand, to build and evaluate an ASR system, on the other, to provide programming development hooks that permit to include state of the art techniques. YAST is freely available for non-commercial purposes. This paper summarizes the functionality of the toolkit and also provides a basic example carried out on the Vietnamese language.

conference of the international speech communication association | 2015