Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Alexander Gutkin is active.

Publication


Featured researches published by Alexander Gutkin.


Procedia Computer Science | 2016

Building Statistical Parametric Multi-speaker Synthesis for Bangladeshi Bangla

Alexander Gutkin; Linne Ha; Martin Jansche; Oddur Kjartansson; Knot Pipatsrisawat; Richard Sproat

Abstract We present a text-to-speech (TTS) system designed for the dialect of Bengali spoken in Bangladesh. This work is part of an ongoing effort to address the needs of new under-resourced languages. We propose a process for streamlining the bootstrapping of TTS systems for under-resourced languages. First, we use crowdsourcing to collect the data from multiple ordinary speakers, each speaker recording small amount of sentences. Second, we leverage an existing text normalization system for a related language (Hindi) to bootstrap a linguistic front-end for Bangla. Third, we employ statistical techniques to construct multi-speaker acoustic models using Long Short-term Memory Recurrent Neural Network (LSTM-RNN) and Hidden Markov Model (HMM) approaches. We then describe our experiments that show that the resulting TTS voices score well in terms of their perceived quality as measured by Mean Opinion Score (MOS) evaluations.


Archive | 2014

Method and System for Building Text-to-Speech Voice from Diverse Recordings

Ioannis Agiomyrgiannakis; Alexander Gutkin


Archive | 2013

Text-to-speech synthesis

Javier Gonzalvo Fructuoso; Alexander Gutkin


language resources and evaluation | 2016

TTS for Low Resource Languages: A Bangla Synthesizer

Alexander Gutkin; Linne Ha; Martin Jansche; Knot Pipatsrisawat; Richard Sproat


workshop spoken language technologies for under resourced languages | 2018

A Unified Phonological Representation of South Asian Languages for Multilingual Text-to-Speech

Isin Demirsahin; Martin Jansche; Alexander Gutkin


language resources and evaluation | 2018

FonBund: A Library for Combining Cross-lingual Phonological Segment Data.

Alexander Gutkin; Martin Jansche; Tatiana Merkulova


language resources and evaluation | 2018

Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech

Jaka Aris Eko Wibawa; Supheakmungkol Sarin; Chen Fang Li; Knot Pipatsrisawat; Keshan Sodimana; Oddur Kjartansson; Alexander Gutkin; Martin Jansche; Linne Ha


Archive | 2018

Text Normalization for Bangla, Khmer, Nepali, Javanese, Sinhala, and Sundanese TTS Systems

Keshan Sodimana; Pasindu De Silva; Richard Sproat; A Theeraphol; Chen Fang Li; Alexander Gutkin; Supheakmungkol Sarin; Knot Pipatsrisawat


Archive | 2015

STATISTICAL UNIT SELECTION LANGUAGE MODELS BASED ON ACOUSTIC FINGERPRINTING

Alexander Gutkin; Javier Gonzalvo Fructuoso; Cyril Allauzen


Archive | 2014

JOINT MULTIGRAM-BASED DETECTION OF SPELLING VARIANTS

Matthew Nicholas Stuttle; Alexander Gutkin

Collaboration


Dive into the Alexander Gutkin's collaboration.

Researchain Logo
Decentralizing Knowledge