Iñigo Casanueva
University of Sheffield
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Iñigo Casanueva.
annual meeting of the special interest group on discourse and dialogue | 2015
Iñigo Casanueva; Thomas Hain; Heidi Christensen; Ricard Marxer; Phil D. Green
Model-free reinforcement learning has been shown to be a promising data driven approach for automatic dialogue policy optimization, but a relatively large amount of dialogue interactions is needed before the system reaches reasonable performance. Recently, Gaussian process based reinforcement learning methods have been shown to reduce the number of dialogues needed to reach optimal performance, and pre-training the policy with data gathered from different dialogue systems has further reduced this amount. Following this idea, a dialogue system designed for a single speaker can be initialised with data from other speakers, but if the dynamics of the speakers are very different the model will have a poor performance. When data gathered from different speakers is available, selecting the data from the most similar ones might improve the performance. We propose a method which automatically selects the data to transfer by defining a similarity measure between speakers, and uses this measure to weight the influence of the data from each speaker in the policy model. The methods are tested by simulating users with different severities of dysarthria interacting with a voice enabled environmental control system.
meeting of the association for computational linguistics | 2017
Stefan Ultes; Lina M. Rojas Barahona; Pei-Hao Su; David Vandyke; Dongho Kim; Iñigo Casanueva; Pawel Budzianowski; Nikola Mrksic; Tsung-Hsien Wen; Milica Gasic; Steve J. Young
Statistical Spoken Dialogue Systems have been around for many years. However, access to these systems has always been difficult as there is still no publicly available end-to-end system implementation. To alleviate this, we present PyDial, an opensource end-to-end statistical spoken dialogue system toolkit which provides implementations of statistical approaches for all dialogue system modules. Moreover, it has been extended to provide multidomain conversational functionality. It offers easy configuration, easy extensibility, and domain-independent implementations of the respective dialogue system modules. The toolkit is available for download under the Apache 2.0 license.
conference of the international speech communication association | 2016
Iñigo Casanueva; Thomas Hain; Phil D. Green
Users with disabilities can greatly benefit from personalised voice-enabled environmental-control interfaces, but for users with speech impairments (e.g. dysarthria) poor ASR performance poses a challenge to successful dialogue. Statistical dialogue management has shown resilience against high ASR error rates, hence making it useful to improve the performance of these interfaces. However, little research was devoted to dialogue management personalisation to specific users so far. Recently, data driven discriminative models have been shown to yield the best performance in dialogue state tracking (the inference of the user goal from the dialogue history). However, due to the unique characteristics of each speaker, training a system for a new user when user specific data is not available can be challenging due to the mismatch between training and working conditions. This work investigates two methods to improve the performance with new speakers of a LSTM-based personalised state tracker: The use of speaker specific acoustic and ASRrelated features; and dropout regularisation. It is shown that in an environmental control system for dysarthric speakers, the combination of both techniques yields improvements of 3.5% absolute in state tracking accuracy. Further analysis explores the effect of using different amounts of speaker specific data to train the tracking system.
annual meeting of the special interest group on discourse and dialogue | 2016
Iñigo Casanueva; Thomas Hain; Mauro Nicolao; Phil D. Green
The generalisation of dialogue state tracking to unseen dialogue states can be very challenging. In a slot-based dialogue system, dialogue states lie in discrete space where distances between states cannot be computed. Therefore, the model parameters to track states unseen in the training data can only be estimated from more general statistics, under the assumption that every dialogue state will have the same underlying state tracking behaviour. However, this assumption is not valid. For example, two values, whose associated concepts have different ASR accuracy, may have different state tracking performance. Therefore, if the ASR performance of the concepts related to each value can be estimated, such estimates can be used as general features. The features will help to relate unseen dialogue states to states seen in the training data with similar ASR performance. Furthermore, if two phonetically similar concepts have similar ASR performance, the features extracted from the phonetic structure of the concepts can be used to improve generalisation. In this paper, ASR and phonetic structurerelated features are used to improve the dialogue state tracking generalisation to unseen states of an environmental control system developed for dysarthric speakers.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies | 2013
Heidi Christensen; Iñigo Casanueva; Stuart P. Cunningham; Phil D. Green; Thomas Hain
conference of the international speech communication association | 2014
Iñigo Casanueva; Heidi Christensen; Thomas Hain; Phil D. Green
conference of the international speech communication association | 2017
Stefan Ultes; Pawel Budzianowski; Iñigo Casanueva; Nikola Mrksic; Lina Maria Rojas-Barahona; Pei-Hao Su; Tsung-Hsien Wen; Milica Gasic; Steve J. Young
annual meeting of the special interest group on discourse and dialogue | 2017
Pawel Budzianowski; Stefan Ultes; Pei-Hao Su; Nikola Mrksic; Tsung-Hsien Wen; Iñigo Casanueva; Lina Maria Rojas-Barahona; Milica Gasic
arXiv: Machine Learning | 2017
Iñigo Casanueva; Pawel Budzianowski; Pei-Hao Su; Nikola Mrksic; Tsung-Hsien Wen; Stefan Ultes; Lina Maria Rojas-Barahona; Steve J. Young; Milica Gasic
annual meeting of the special interest group on discourse and dialogue | 2017
Stefan Ultes; Pawel Budzianowski; Iñigo Casanueva; Nikola Mrksic; Lina M. Rojas Barahona; Pei-Hao Su; Tsung-Hsien Wen; Milica Gasic; Steve J. Young