Nicolas Sturmel
Centre national de la recherche scientifique
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Nicolas Sturmel.
IEEE Transactions on Audio, Speech, and Language Processing | 2013
Nicolas Sturmel; Laurent Daudet
This paper presents a technique for Informed Source Separation (ISS) of a single channel mixture, based on the Multiple Input Spectrogram Inversion (MISI) phase estimation method. The reconstruction of the source signals is iterative, alternating between a time-frequency consistency enforcement and a re-mixing constraint. A dual resolution technique is also proposed, for sharper transients reconstruction. The two algorithms are compared to a state-of-the-art Wiener-based ISS technique, on a database of fourteen monophonic mixtures, with standard source separation objective measures. Experimental results show that the proposed algorithms outperform both this reference technique and the oracle Wiener filter by up to 3 dB in distortion, at the cost of a significantly heavier computation.
international conference on acoustics, speech, and signal processing | 2009
Nicolas Sturmel; Christophe d'Alessandro; Francois Rigaud
The Lines Of Maximum Amplitude (LOMA) of the wavelet transform are used for glottal closure instant detection. Following Kadambe & al. (1992), the wavelet transform modulus maxima can be used for singularity detection. The LOMA method extends this idea. All the lines chaining maxima of a wavelet transform across scales are built. Then a back-tracking procedure allows for selection of the optimal line for each pitch period, the top of which indicates the GCI. The LOMA method is then evaluated by comparing its results to the DYPSA (Naylor & al.) algorithm, with the option of using inverse filtering as preprocessing. The LOMA method compares favorably to DYPSA, particularly on accuracy. One of the advantage of the LOMA method is its ability to deal with variations in the glottal source parameters.
international conference on acoustics, speech, and signal processing | 2012
Nicolas Sturmel; Laurent Daudet
This paper deals with phase estimation in the framework of underdetermined blind source separation, using an estimated spectrogram of the source and its associated Wiener filter. By thresholding the Wiener mask, two domains are defined on the spectrogram : a confidence domain where the phase is kept as the phase of the mixture, and its complement where the phase is updated with a projection similar to the widely-used Griffin and Lim technique. We show that with this simple technique, the choice of parameters results in a simple trade-off between distortion and interference. Experiments show that this technique brings significant improvements over the classical Wiener filter, while being much faster than other iterative methods.
non-linear speech processing | 2007
Christophe d'Alessandro; Baris Bozkurt; Boris Doval; Thierry Dutoit; Nathalie Henrich; Vu Ngoc Tuan; Nicolas Sturmel
Voice source analysis is an important but difficult issue for speech processing. In this talk, three aspects of voice source analysis recently developed at LIMSI (Orsay, France) and FPMs (Mons, Belgium) are discussed. In a first part, time domain and spectral domain modelling of glottal flow signals are presented. It is shown that the glottal flow can be modelled as an anticausal filter (maximum phase) before the glottal closing, and as a causal filter (minimum phase) after the glottal closing. In a second part, taking advantage of this phase structure, causal and anticausal components of the speech signal are separated according to the location in the Z-plane of the zeros of the Z-Transform (ZZT) of the windowed signal. This method is useful for voice source parameters analysis and source-tract deconvolution. Results of a comparative evaluation of the ZZT and linear prediction for source/tract separation are reported. In a third part, glottal closing instant detection using the phase of the wavelet transform is discussed. A method based on the lines of maximum phase in the time-scale plane is proposed. This method is compared to EGG for robust glottal closing instant analysis.
Journal of The Audio Engineering Society | 2012
Nicolas Sturmel; Antoine Liutkus; Jonathan Pinel; Laurent Girin; Sylvain Marchand; Gaël Richard; Roland Badeau; Laurent Daudet
Sadhana-academy Proceedings in Engineering Sciences | 2011
Christophe d’Alessandro; Nicolas Sturmel
conference of the international speech communication association | 2007
Nicolas Sturmel; Christophe d'Alessandro; Boris Doval
european signal processing conference | 2012
Antoine Liutkus; Stanislaw Gorlow; Nicolas Sturmel; Shuhua Zhang; Laurent Girin; Roland Badeau; Laurent Daudet; Sylvain Marchand; Gaël Richard
Journal of The Audio Engineering Society | 2012
Sylvain Marchand; Roland Badeau; Cléo Baras; Laurent Daudet; Dominique Fourer; Laurent Girin; Stanislaw Gorlow; Antoine Liutkus; Jonathan Pinel; Gaël Richard; Nicolas Sturmel; Shuhua Zhang
15th International Conference on Digital Audio Effects (DAFx 2012) | 2012
Nicolas Sturmel; Laurent Daudet; Laurent Girin