Mohamed Anouar Ben Messaoud
Tunis El Manar University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Mohamed Anouar Ben Messaoud.
Cognitive Computation | 2010
Mohamed Anouar Ben Messaoud; Aïcha Bouzid; Noureddine Ellouze
In this work, we present an algorithm for voiced/unvoiced decision and pitch estimation from speech signals. Our approach is based on classifying the peaks provided by the autocorrelation of the speech multi-scale product. The multi-scale product is based on making the product of the speech wavelet transform coefficients at three successive dyadic scales. The autocorrelation function of the multi-scale product is calculated over frames of a specific length. The experimental results show the robustness and the effectiveness of our approach. Besides, the proposed method outperforms some existing algorithms in a clean and noisy environment.
non-linear speech processing | 2009
Mohamed Anouar Ben Messaoud; Aicha Bouzid; Noureddine Ellouze
In this work, we present an algorithm for estimating the fundamental frequency in speech signals. Our approach is based on the spectral multi-scale product analysis. It consists of operating a short Fourier transform on the speech multi-scale product. The multi-scale product is based on making the product of the speech wavelet transform coefficients at three successive dyadic scales. The wavelet used is the quadratic spline function with a support of 0.8 ms. We estimate the pitch for each time frame based on its multi-scale product harmonic structure. We evaluate our approach on the Keele database. Experimental results show the effectiveness of our method presenting a good performance surpassing other algorithms. Besides, the proposed approach is robust for noisy speech.
international conference on advanced technologies for signal and image processing | 2016
Belhedi Wiem; Mohamed Anouar Ben Messaoud; Bouzid Aicha
We present an approach of single channel speech separation based on sinusoidal modeling and multi-scale product analysis. To construct the sinusoidal model of speech, we need to determine three parameters which are namely frequency, amplitude and phase. For fundamental frequency determination, we apply an effective method based on multi-scale product analysis. For amplitude estimation, we adopt an iterative peak picking algorithm. This approach effectively estimates multi-pitch of the desired speaker in a multiple speaker context.
International Journal of Speech Technology | 2016
Mohamed Anouar Ben Messaoud; Aicha Bouzid
The pitch is a crucial parameter in speech and music signals. However, due to severe noisy conditions, missing harmonics, unsuitable physical vibration, the determination of pitch presents a great challenge when desiring to get a good accuracy. In this paper, we propose a method for pitch estimation of speech and music sounds. Our method is based on the fast Fourier transform (FFT) of the multi-scale product (MP) provided by a feature auditory model of the sound signals. The auditory model simulates the spectral behaviour of the cochlea by a gammachirp filter-bank, and the out/middle ear filtering by a low-pass filter. For the two output channels, the FFT function of the MP is computed over frames. The MP is based on constituting the product of the speech and music wavelet transform coefficients at three scales. The experimental results show that our method estimates the pitch with high accuracy. Besides, our proposed method outperforms several other pitch detection algorithms in clean and noisy environments.
international conference on sciences of electronics technologies of information and telecommunications | 2012
Mohamed Anouar Ben Messaoud; Aicha Bouzid; Noureddine Ellouze
In this work, we present an algorithm for estimating the fundamental frequency in speech signals. Our approach is based on the spectral compression by the autocorrelation of the speech multi-scale product analysis. It consists of operating the product of compressed copies of the original spectrum on the multi-scale product autocorrelation. The multi-scale product is based on making the product of the speech wavelet transform coefficients at three successive dyadic scales. The wavelet used is the quadratic spline function with a support of 0.8 ms. We estimate the pitch for each time frame based on its multi-scale product autocorrelation of the harmonic product spectrum structure. We evaluate our approach on the Keele database. Experimental results show the effectiveness of our method presenting a good performance surpassing the other algorithms.
international conference on sciences of electronics technologies of information and telecommunications | 2016
Belhedi Wiem; Mohamed Anouar Ben Messaoud; Bouzid Aicha
In this paper we present a comparative analysis of different time-frequency (T-F) masking techniques used for single channel speech separation (SCSS). We survey T-F masking concept and compare different types of masks in different criteria. The comparison is conduct theoretically by mathematical study and numerically by objective and subjective assessment. Also, we study the effect of the masking techniques on the perceptual quality of speech and their ability to separate a target speech from monaural mixing.
Journal of The Audio Engineering Society | 2016
Mohamed Anouar Ben Messaoud; Aicha Bouzid
Advances in Electrical and Electronic Engineering | 2016
Mohamed Anouar Ben Messaoud; Aicha Bouzid
WSEAS Transactions on Signal Processing archive | 2017
Héla Khazri; Mohamed Anouar Ben Messaoud; Aicha Bouzid
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, volume 1: JEP | 2012
Mohamed Anouar Ben Messaoud; A"icha Bouzid; Noureddine Ellouze