Hitoshi Honda
Sony Broadcast & Professional Research Laboratories
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Hitoshi Honda.
international conference on acoustics speech and signal processing | 1998
Naoto Iwahashi; Hongchang Pao; Hitoshi Honda; Katsuki Minamino; Masanori Omote
This paper describes a novel technique for noise robust speech recognition, which can incorporate the characteristics of noise distribution directly in features. The feature itself of each analysis frame has a stochastic form, which can represent the probability density function of the estimated speech component in the noisy speech. Using the sequence of the probability density functions of the estimated speech components and hidden Markov modelling of clean speech, the observation probability of the noisy speech is calculated. In the whole process of the technique, the explicit information on the SNR is not used. The technique is evaluated by large vocabulary isolated word recognition under car noise environment, and is found to have clearly outperformed nonlinear spectral subtraction (with between 13% and 44% reduction in recognition errors).
International Journal of Speech Technology | 2007
Xavier Menendez-Pidal; Ajay Patrikar; Lex Olorenshaw; Hitoshi Honda
In this paper we discuss two techniques to reduce the size of the acoustic model while maintaining or improving the accuracy of the recognition engine. The first technique, demiphone modeling, tries to reduce the redundancy existing in a context dependent state-clustered Hidden Markov Model (HMM). Three-state demiphones optimally designed from the triphone decision tree are introduced to drastically reduce the phone space of the acoustic model and to improve system accuracy. The second redundancy elimination technique is a more classical approach based on parameter tying. Similar vectors of variances in each HMM cluster are tied together to reduce the number of parameters. The closeness between the vectors of variances is measured using a Vector Quantizer (VQ) to maintain the information provided by the variances parameters. The paper also reports speech recognition improvements using assignment of variable number Gaussians per cluster and gender-based HMMs. The main motivation behind these techniques is to improve the acoustic model and at the same time lower its memory usage. These techniques may help in reducing memory and improving accuracy of an embedded Large Vocabulary Continuous Speech Recognition (LVCSR) application.
Journal of the Acoustical Society of America | 2008
Hitoshi Honda; Masanori Omote; Hiroaki Ogawa; Hongchang Pao
Archive | 2001
Yaeko Fujita; Lucke Helmut; Hitoshi Honda; Hiroaki Ogawa; Eiji Tamaru; Junichi Yamashita; ルッケ ヘルムート; 浩明 小川; 潤一 山下; 等 本田; 英司 田丸; 八重子 藤田
Archive | 2005
Naoto Iwahashi; Hongchang Bao; Hitoshi Honda
Archive | 2008
Motoki Nakade; Hiroaki Ogawa; Hitoshi Honda; Yoshinori Kurata; Daisuke Ishizuka
Archive | 2011
Satoshi Asakawa; Atsuo Hiroe; Hiroaki Ogawa; Hitoshi Honda; Tsutomu Sawada
Archive | 2010
Yoshinori Maeda; Hitoshi Honda; Katsuki Minamino
Archive | 1998
Koji Asano; Osamu Hamada; Hitoshi Honda; Katsuki Minamino; 活樹 南野; 等 本田; 康治 浅野; 修 浜田
Archive | 2010
Katsuki Minamino; Hitoshi Honda; Yoshinori Maeda; Hiroaki Ogawa
Collaboration
Dive into the Hitoshi Honda's collaboration.
National Institute of Information and Communications Technology
View shared research outputs