Circuits, Systems, and Signal Processing | 2019

Wavelet-Based Power Normalized Spectrum for Hindi Phoneme Classification

 
 

Abstract


This paper presents wavelet-based power normalized spectrum for computing robust cepstral features named WP-PNCC features. The proposed technique computes wavelet packet-based short-time spectrum of speech signal. A nonlinear function is defined as relating power spectrum of clean speech to the power spectrum of speech corrupted with noise. The constants of function are computed from longer-duration speech spectrum, and the short-time spectrum for each frame is weighted with the power function. The weighted speech spectrum is processed with logarithmic and discrete cosine transform operation to compute cepstral coefficients. The cepstral coefficients thus obtained are processed with quantile-based cepstral dynamics normalization technique. The proposed features are examined with hidden Markov model classifier on TIFR database for Hindi phoneme classification task and on TIMIT database for English phoneme classification task along with mel-frequency cepstral coefficients, power normalized cepstral coefficients and 24-band wavelet-based features in clean and noisy environments. Different noises from NOISEX-92 database are used for preparing noisy database with SNR ranging from 20\xa0dB to 0\xa0dB. The results show enhanced performance of proposed features in all the considered cases. The simulations are performed on MATLAB 2015b. The performance of proposed features is also evaluated on hidden Markov model toolkit-based speech recognition system. The comparative results confirm the robustness of proposed features with sufficient improvement over other features examined in this paper.

Volume None
Pages 1-20
DOI 10.1007/S00034-019-01113-1
Language English
Journal Circuits, Systems, and Signal Processing

Full Text