Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Matt Shannon is active.

Publication


Featured researches published by Matt Shannon.


IEEE Transactions on Audio, Speech, and Language Processing | 2013

Autoregressive Models for Statistical Parametric Speech Synthesis

Matt Shannon; Heiga Zen; William Byrne

We propose using the autoregressive hidden Markov model (HMM) for speech synthesis. The autoregressive HMM uses the same model for parameter estimation and synthesis in a consistent way, in contrast to the standard approach to statistical parametric speech synthesis. It supports easy and efficient parameter estimation using expectation maximization, in contrast to the trajectory HMM. At the same time its similarities to the standard approach allow use of established high quality synthesis algorithms such as speech parameter generation considering global variance. The autoregressive HMM also supports a speech parameter generation algorithm not available for the standard approach or the trajectory HMM and which has particular advantages in the domain of real-time, low latency synthesis. We show how to do efficient parameter estimation and synthesis with the autoregressive HMM and look at some of the similarities and differences between the standard approach, the trajectory HMM and the autoregressive HMM. We compare the three approaches in subjective and objective evaluations. We also systematically investigate which choices of parameters such as autoregressive order and number of states are optimal for the autoregressive HMM.


international conference on acoustics, speech, and signal processing | 2013

Fast, low-artifact speech synthesis considering global variance

Matt Shannon; William Byrne

Speech parameter generation considering global variance (GV generation) is widely acknowledged to dramatically improve the quality of synthetic speech generated by HMM-based systems. However it is slower and has higher latency than the standard speech parameter generation algorithm. In addition it is known to produce artifacts, though existing approaches to prevent artifacts are effective. We present a simple new theoretical analysis of speech parameter generation considering global variance based on Lagrange multipliers. This analysis sheds light on one source of artifacts and suggests a way to reduce their occurrence. It also suggests an approximation to exact GV generation that allows fast, low latency synthesis. In a subjective evaluation our fast approximation shows no degradation in naturalness compared to conventional GV generation.


Proceedings of the 7th ISCA Speech Synthesis Workshop | 2010

Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project

Mirjam Wester; John Dines; Matthew Gibson; Hui Liang; Yi-Jian Wu; Lakshmi Saheer; Simon King; Keiichiro Oura; Philip N. Garner; William Byrne; Yong Guan; Teemu Hirsimäki; Reima Karhila; Mikko Kurimo; Matt Shannon; Sayaka Shiota; Jilei Tian; Keiichi Tokuda; Junichi Yamagishi


conference of the international speech communication association | 2014

Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech

Gustav Eje Henter; Thomas Merritt; Matt Shannon; Catherine Mayo; Simon King


meeting of the association for computational linguistics | 2010

Personalising Speech-To-Speech Translation in the EMIME Project

Mikko Kurimo; William Byrne; John Dines; Philip N. Garner; Matthew Gibson; Yong Guan; Teemu Hirsimäki; Reima Karhila; Simon King; Hui Liang; Keiichiro Oura; Lakshmi Saheer; Matt Shannon; Sayaki Shiota; Jilei Tian


conference of the international speech communication association | 2011

The Effect of Using Normalized Models in Statistical Speech Synthesis.

Matt Shannon; Heiga Zen; William Byrne


conference of the international speech communication association | 2009

Autoregressive HMMs for speech synthesis

Matt Shannon; William Byrne


Archive | 2009

A formulation of the autoregressive HMM for speech synthesis

Matt Shannon; William Byrne


conference of the international speech communication association | 2017

Acoustic Modeling for Google Home

Bo Li; Tara N. Sainath; Arun Narayanan; Joe Caroselli; Michiel Bacchiani; Ananya Misra; Izhak Shafran; Hasim Sak; Golan Pundak; Kean K. Chin; Khe Chai Sim; Ron J. Weiss; Kevin W. Wilson; Ehsan Variani; Chanwoo Kim; Olivier Siohan; Mitchel Weintraub; Erik McDermott; Richard C. Rose; Matt Shannon


conference of the international speech communication association | 2017

Optimizing expected word error rate via sampling for speech recognition

Matt Shannon

Collaboration


Dive into the Matt Shannon's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Simon King

University of Edinburgh

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Heiga Zen

Nagoya Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Keiichiro Oura

Nagoya Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Teemu Hirsimäki

Helsinki University of Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge