Hironori Doi
Nara Institute of Science and Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Hironori Doi.
IEEE Transactions on Audio, Speech, and Language Processing | 2014
Hironori Doi; Tomoki Toda; Keigo Nakamura; Hiroshi Saruwatari; Kiyohiro Shikano
In this paper, we present novel speaking-aid systems based on one-to-many eigenvoice conversion (EVC) to enhance three types of alaryngeal speech: esophageal speech, electrolaryngeal speech, and body-conducted silent electrolaryngeal speech. Although alaryngeal speech allows laryngectomees to utter speech sounds, it suffers from the lack of speech quality and speaker individuality. To improve the speech quality of alaryngeal speech, alaryngeal-speech-to-speech (AL-to-Speech) methods based on statistical voice conversion have been proposed. In this paper, one-to-many EVC capable of flexibly controlling the converted voice quality by adapting the conversion model to given target natural voices is further implemented for the AL-to-Speech methods to effectively recover speaker individuality of each type of alaryngeal speech. These proposed systems are compared with each other from various perspectives. The experimental results demonstrate that our proposed systems are capable of effectively addressing the issues of alaryngeal speech, e.g., yielding significant improvements in speech quality of each type of alaryngeal speech.
international conference on acoustics, speech, and signal processing | 2011
Hironori Doi; Keigo Nakamura; Tomoki Toda; Hiroshi Saruwatari; Kiyohiro Shikano
In this study, we evaluate our proposed methods for enhancing alaryngeal speech based on statistical voice conversion techniques. Voice conversion based on a Gaussian mixture model has been applied to the conversion of alaryngeal speech into normal speech (AL-to-Speech). Moreover, one-to-many eigenvoice conversion (EVC) has also been applied to AL-to-Speech to enable the recovery of the original voice quality of laryngectomees even if only one arbitrary utterance of the original voice is available. VC/EVC-based AL-to-Speech systems have been developed for several types of alaryngeal speech, such as esophageal speech (ES), electrolaryngeal speech (EL), and body-conducted silent electrolaryngeal speech (silent EL). These proposed systems are compared with each other from various perspectives. The experimental results demonstrate that our proposed systems yield significant enhancement effects on each type of alaryngeal speech.
international conference on acoustics, speech, and signal processing | 2012
Kenzo Yamamoto; Tomoki Toda; Hironori Doi; Hiroshi Saruwatari; Kiyohiro Shikano
This paper describes a voice quality control method in statistical esophageal speech enhancement. Esophageal speech is produced by one of the alternative speaking methods for laryngectomees. Its naturalness and intelligibility are much lower than those of natural voices and its voice quality sounds similar even if uttered by different laryngectomees. These issues are alleviated by a statistical voice conversion method from esophageal speech into normal speech (ES-to-Speech) based on eigenvoices. This method is capable of determining converted voice quality using a few target voice samples. In this paper, we propose ES-to-Speech using regression techniques to make it possible to manually control the converted voice quality by manipulating a few intuitively controllable parameters even if no target voice sample is available. The effectiveness of the proposed method is confirmed by experimental evaluations.
IEICE Transactions on Information and Systems | 2010
Hironori Doi; Keigo Nakamura; Tomoki Toda; Hiroshi Saruwatari; Kiyohiro Shikano
asia pacific signal and information processing association annual summit and conference | 2012
Hironori Doi; Tomoki Toda; Tomoyasu Nakano; Masataka Goto; Satoshi Nakamura
international conference on acoustics, speech, and signal processing | 2010
Hironori Doi; Keigo Nakamura; Tomoki Toda; Hiroshi Saruwatari; Kiyohiro Shikano
IEICE Transactions on Information and Systems | 2014
Kazuhiro Kobayashi; Tomoki Toda; Hironori Doi; Tomoyasu Nakano; Masataka Goto; Graham Neubig; Sakriani Sakti; Satoshi Nakamura
conference of the international speech communication association | 2013
Kazuhiro Kobayashi; Hironori Doi; Tomoki Toda; Tomoyasu Nakano; Masataka Goto; Graham Neubig; Sakriani Sakti; Satoshi Nakamura
asia pacific signal and information processing association annual summit and conference | 2009
Hironori Doi; Keigo Nakamura; Tomoki Toda; Hiroshi Saruwatari; Kiyohiro Shikano
conference of the international speech communication association | 2013
Hironori Doi; Tomoki Toda; Tomoyasu Nakano; Masataka Goto; Satoshi Nakamura
Collaboration
Dive into the Hironori Doi's collaboration.
National Institute of Advanced Industrial Science and Technology
View shared research outputs