Iñigo Casanueva | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Iñigo Casanueva is active.

Explore More

Publication

Featured researches published by Iñigo Casanueva.

annual meeting of the special interest group on discourse and dialogue | 2015

Knowledge transfer between speakers for personalised dialogue management

Iñigo Casanueva; Thomas Hain; Heidi Christensen; Ricard Marxer; Phil D. Green

Model-free reinforcement learning has been shown to be a promising data driven approach for automatic dialogue policy optimization, but a relatively large amount of dialogue interactions is needed before the system reaches reasonable performance. Recently, Gaussian process based reinforcement learning methods have been shown to reduce the number of dialogues needed to reach optimal performance, and pre-training the policy with data gathered from different dialogue systems has further reduced this amount. Following this idea, a dialogue system designed for a single speaker can be initialised with data from other speakers, but if the dynamics of the speakers are very different the model will have a poor performance. When data gathered from different speakers is available, selecting the data from the most similar ones might improve the performance. We propose a method which automatically selects the data to transfer by defining a similarity measure between speakers, and uses this measure to weight the influence of the data from each speaker in the policy model. The methods are tested by simulating users with different severities of dysarthria interacting with a voice enabled environmental control system.

meeting of the association for computational linguistics | 2017

PyDial: A Multi-domain Statistical Dialogue System Toolkit

Stefan Ultes; Lina M. Rojas Barahona; Pei-Hao Su; David Vandyke; Dongho Kim; Iñigo Casanueva; Pawel Budzianowski; Nikola Mrksic; Tsung-Hsien Wen; Milica Gasic; Steve J. Young

Statistical Spoken Dialogue Systems have been around for many years. However, access to these systems has always been difficult as there is still no publicly available end-to-end system implementation. To alleviate this, we present PyDial, an opensource end-to-end statistical spoken dialogue system toolkit which provides implementations of statistical approaches for all dialogue system modules. Moreover, it has been extended to provide multidomain conversational functionality. It offers easy configuration, easy extensibility, and domain-independent implementations of the respective dialogue system modules. The toolkit is available for download under the Apache 2.0 license.

conference of the international speech communication association | 2016

Improving generalisation to new speakers in spoken dialogue state tracking

Iñigo Casanueva; Thomas Hain; Phil D. Green

Users with disabilities can greatly benefit from personalised voice-enabled environmental-control interfaces, but for users with speech impairments (e.g. dysarthria) poor ASR performance poses a challenge to successful dialogue. Statistical dialogue management has shown resilience against high ASR error rates, hence making it useful to improve the performance of these interfaces. However, little research was devoted to dialogue management personalisation to specific users so far. Recently, data driven discriminative models have been shown to yield the best performance in dialogue state tracking (the inference of the user goal from the dialogue history). However, due to the unique characteristics of each speaker, training a system for a new user when user specific data is not available can be challenging due to the mismatch between training and working conditions. This work investigates two methods to improve the performance with new speakers of a LSTM-based personalised state tracker: The use of speaker specific acoustic and ASRrelated features; and dropout regularisation. It is shown that in an environmental control system for dysarthric speakers, the combination of both techniques yields improvements of 3.5% absolute in state tracking accuracy. Further analysis explores the effect of using different amounts of speaker specific data to train the tracking system.

annual meeting of the special interest group on discourse and dialogue | 2016

Using phone features to improve dialogue state tracking generalisation to unseen states

Iñigo Casanueva; Thomas Hain; Mauro Nicolao; Phil D. Green

The generalisation of dialogue state tracking to unseen dialogue states can be very challenging. In a slot-based dialogue system, dialogue states lie in discrete space where distances between states cannot be computed. Therefore, the model parameters to track states unseen in the training data can only be estimated from more general statistics, under the assumption that every dialogue state will have the same underlying state tracking behaviour. However, this assumption is not valid. For example, two values, whose associated concepts have different ASR accuracy, may have different state tracking performance. Therefore, if the ASR performance of the concepts related to each value can be estimated, such estimates can be used as general features. The features will help to relate unseen dialogue states to states seen in the training data with similar ASR performance. Furthermore, if two phonetically similar concepts have similar ASR performance, the features extracted from the phonetic structure of the concepts can be used to improve generalisation. In this paper, ASR and phonetic structurerelated features are used to improve the dialogue state tracking generalisation to unseen states of an environmental control system developed for dysarthric speakers.

Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies | 2013

homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition

Heidi Christensen; Iñigo Casanueva; Stuart P. Cunningham; Phil D. Green; Thomas Hain

conference of the international speech communication association | 2014

Adaptive speech recognition and dialogue management for users with speech disorders.

Iñigo Casanueva; Heidi Christensen; Thomas Hain; Phil D. Green

conference of the international speech communication association | 2017

Domain-Independent User Satisfaction Reward Estimation for Dialogue Policy Learning.

Stefan Ultes; Pawel Budzianowski; Iñigo Casanueva; Nikola Mrksic; Lina Maria Rojas-Barahona; Pei-Hao Su; Tsung-Hsien Wen; Milica Gasic; Steve J. Young

annual meeting of the special interest group on discourse and dialogue | 2017

Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning.

Pawel Budzianowski; Stefan Ultes; Pei-Hao Su; Nikola Mrksic; Tsung-Hsien Wen; Iñigo Casanueva; Lina Maria Rojas-Barahona; Milica Gasic

arXiv: Machine Learning | 2017

A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management.

Iñigo Casanueva; Pawel Budzianowski; Pei-Hao Su; Nikola Mrksic; Tsung-Hsien Wen; Stefan Ultes; Lina Maria Rojas-Barahona; Steve J. Young; Milica Gasic

annual meeting of the special interest group on discourse and dialogue | 2017