Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Hironori Doi is active.

Publication


Featured researches published by Hironori Doi.


IEEE Transactions on Audio, Speech, and Language Processing | 2014

Alaryngeal Speech Enhancement Based on One-to-Many Eigenvoice Conversion

Hironori Doi; Tomoki Toda; Keigo Nakamura; Hiroshi Saruwatari; Kiyohiro Shikano

In this paper, we present novel speaking-aid systems based on one-to-many eigenvoice conversion (EVC) to enhance three types of alaryngeal speech: esophageal speech, electrolaryngeal speech, and body-conducted silent electrolaryngeal speech. Although alaryngeal speech allows laryngectomees to utter speech sounds, it suffers from the lack of speech quality and speaker individuality. To improve the speech quality of alaryngeal speech, alaryngeal-speech-to-speech (AL-to-Speech) methods based on statistical voice conversion have been proposed. In this paper, one-to-many EVC capable of flexibly controlling the converted voice quality by adapting the conversion model to given target natural voices is further implemented for the AL-to-Speech methods to effectively recover speaker individuality of each type of alaryngeal speech. These proposed systems are compared with each other from various perspectives. The experimental results demonstrate that our proposed systems are capable of effectively addressing the issues of alaryngeal speech, e.g., yielding significant improvements in speech quality of each type of alaryngeal speech.


international conference on acoustics, speech, and signal processing | 2011

An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques

Hironori Doi; Keigo Nakamura; Tomoki Toda; Hiroshi Saruwatari; Kiyohiro Shikano

In this study, we evaluate our proposed methods for enhancing alaryngeal speech based on statistical voice conversion techniques. Voice conversion based on a Gaussian mixture model has been applied to the conversion of alaryngeal speech into normal speech (AL-to-Speech). Moreover, one-to-many eigenvoice conversion (EVC) has also been applied to AL-to-Speech to enable the recovery of the original voice quality of laryngectomees even if only one arbitrary utterance of the original voice is available. VC/EVC-based AL-to-Speech systems have been developed for several types of alaryngeal speech, such as esophageal speech (ES), electrolaryngeal speech (EL), and body-conducted silent electrolaryngeal speech (silent EL). These proposed systems are compared with each other from various perspectives. The experimental results demonstrate that our proposed systems yield significant enhancement effects on each type of alaryngeal speech.


international conference on acoustics, speech, and signal processing | 2012

Statistical approach to voice quality control in esophageal speech enhancement

Kenzo Yamamoto; Tomoki Toda; Hironori Doi; Hiroshi Saruwatari; Kiyohiro Shikano

This paper describes a voice quality control method in statistical esophageal speech enhancement. Esophageal speech is produced by one of the alternative speaking methods for laryngectomees. Its naturalness and intelligibility are much lower than those of natural voices and its voice quality sounds similar even if uttered by different laryngectomees. These issues are alleviated by a statistical voice conversion method from esophageal speech into normal speech (ES-to-Speech) based on eigenvoices. This method is capable of determining converted voice quality using a few target voice samples. In this paper, we propose ES-to-Speech using regression techniques to make it possible to manually control the converted voice quality by manipulating a few intuitively controllable parameters even if no target voice sample is available. The effectiveness of the proposed method is confirmed by experimental evaluations.


IEICE Transactions on Information and Systems | 2010

Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models

Hironori Doi; Keigo Nakamura; Tomoki Toda; Hiroshi Saruwatari; Kiyohiro Shikano


asia pacific signal and information processing association annual summit and conference | 2012

Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system

Hironori Doi; Tomoki Toda; Tomoyasu Nakano; Masataka Goto; Satoshi Nakamura


international conference on acoustics, speech, and signal processing | 2010

Statistical approach to enhancing esophageal speech based on Gaussian mixture models

Hironori Doi; Keigo Nakamura; Tomoki Toda; Hiroshi Saruwatari; Kiyohiro Shikano


IEICE Transactions on Information and Systems | 2014

Voice Timbre Control Based on Perceived Age in Singing Voice Conversion

Kazuhiro Kobayashi; Tomoki Toda; Hironori Doi; Tomoyasu Nakano; Masataka Goto; Graham Neubig; Sakriani Sakti; Satoshi Nakamura


conference of the international speech communication association | 2013

An Investigation of Acoustic Features for Singing Voice Conversion based on Perceptual Age

Kazuhiro Kobayashi; Hironori Doi; Tomoki Toda; Tomoyasu Nakano; Masataka Goto; Graham Neubig; Sakriani Sakti; Satoshi Nakamura


asia pacific signal and information processing association annual summit and conference | 2009

Enhancement of esophageal speech using statistical voice conversion

Hironori Doi; Keigo Nakamura; Tomoki Toda; Hiroshi Saruwatari; Kiyohiro Shikano


conference of the international speech communication association | 2013

Evaluation of a Singing Voice Conversion Method Based on Many-to-Many Eigenvoice Conversion

Hironori Doi; Tomoki Toda; Tomoyasu Nakano; Masataka Goto; Satoshi Nakamura

Collaboration


Dive into the Hironori Doi's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Kiyohiro Shikano

Nara Institute of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Keigo Nakamura

Nara Institute of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Satoshi Nakamura

Nara Institute of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Sakriani Sakti

Nara Institute of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Tomoyasu Nakano

National Institute of Advanced Industrial Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Kazuhiro Kobayashi

Nara Institute of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Daisuke Deguchi

Nara Institute of Science and Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge