Matt Shannon
University of Cambridge
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Matt Shannon.
IEEE Transactions on Audio, Speech, and Language Processing | 2013
Matt Shannon; Heiga Zen; William Byrne
We propose using the autoregressive hidden Markov model (HMM) for speech synthesis. The autoregressive HMM uses the same model for parameter estimation and synthesis in a consistent way, in contrast to the standard approach to statistical parametric speech synthesis. It supports easy and efficient parameter estimation using expectation maximization, in contrast to the trajectory HMM. At the same time its similarities to the standard approach allow use of established high quality synthesis algorithms such as speech parameter generation considering global variance. The autoregressive HMM also supports a speech parameter generation algorithm not available for the standard approach or the trajectory HMM and which has particular advantages in the domain of real-time, low latency synthesis. We show how to do efficient parameter estimation and synthesis with the autoregressive HMM and look at some of the similarities and differences between the standard approach, the trajectory HMM and the autoregressive HMM. We compare the three approaches in subjective and objective evaluations. We also systematically investigate which choices of parameters such as autoregressive order and number of states are optimal for the autoregressive HMM.
international conference on acoustics, speech, and signal processing | 2013
Matt Shannon; William Byrne
Speech parameter generation considering global variance (GV generation) is widely acknowledged to dramatically improve the quality of synthetic speech generated by HMM-based systems. However it is slower and has higher latency than the standard speech parameter generation algorithm. In addition it is known to produce artifacts, though existing approaches to prevent artifacts are effective. We present a simple new theoretical analysis of speech parameter generation considering global variance based on Lagrange multipliers. This analysis sheds light on one source of artifacts and suggests a way to reduce their occurrence. It also suggests an approximation to exact GV generation that allows fast, low latency synthesis. In a subjective evaluation our fast approximation shows no degradation in naturalness compared to conventional GV generation.
Proceedings of the 7th ISCA Speech Synthesis Workshop | 2010
Mirjam Wester; John Dines; Matthew Gibson; Hui Liang; Yi-Jian Wu; Lakshmi Saheer; Simon King; Keiichiro Oura; Philip N. Garner; William Byrne; Yong Guan; Teemu Hirsimäki; Reima Karhila; Mikko Kurimo; Matt Shannon; Sayaka Shiota; Jilei Tian; Keiichi Tokuda; Junichi Yamagishi
conference of the international speech communication association | 2014
Gustav Eje Henter; Thomas Merritt; Matt Shannon; Catherine Mayo; Simon King
meeting of the association for computational linguistics | 2010
Mikko Kurimo; William Byrne; John Dines; Philip N. Garner; Matthew Gibson; Yong Guan; Teemu Hirsimäki; Reima Karhila; Simon King; Hui Liang; Keiichiro Oura; Lakshmi Saheer; Matt Shannon; Sayaki Shiota; Jilei Tian
conference of the international speech communication association | 2011
Matt Shannon; Heiga Zen; William Byrne
conference of the international speech communication association | 2009
Matt Shannon; William Byrne
Archive | 2009
Matt Shannon; William Byrne
conference of the international speech communication association | 2017
Bo Li; Tara N. Sainath; Arun Narayanan; Joe Caroselli; Michiel Bacchiani; Ananya Misra; Izhak Shafran; Hasim Sak; Golan Pundak; Kean K. Chin; Khe Chai Sim; Ron J. Weiss; Kevin W. Wilson; Ehsan Variani; Chanwoo Kim; Olivier Siohan; Mitchel Weintraub; Erik McDermott; Richard C. Rose; Matt Shannon
conference of the international speech communication association | 2017
Matt Shannon