Is this you? Create Your Porfile

Nicolas Sturmel

Centre national de la recherche scientifique

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Nicolas Sturmel is active.

Explore More

Publication

Featured researches published by Nicolas Sturmel.

IEEE Transactions on Audio, Speech, and Language Processing | 2013

Informed Source Separation Using Iterative Reconstruction

Nicolas Sturmel; Laurent Daudet

This paper presents a technique for Informed Source Separation (ISS) of a single channel mixture, based on the Multiple Input Spectrogram Inversion (MISI) phase estimation method. The reconstruction of the source signals is iterative, alternating between a time-frequency consistency enforcement and a re-mixing constraint. A dual resolution technique is also proposed, for sharper transients reconstruction. The two algorithms are compared to a state-of-the-art Wiener-based ISS technique, on a database of fourteen monophonic mixtures, with standard source separation objective measures. Experimental results show that the proposed algorithms outperform both this reference technique and the oracle Wiener filter by up to 3 dB in distortion, at the cost of a significantly heavier computation.

international conference on acoustics, speech, and signal processing | 2009

Glottal closure instant detection using Lines of Maximum Amplitudes (LOMA) of thewavelet transform

Nicolas Sturmel; Christophe d'Alessandro; Francois Rigaud

The Lines Of Maximum Amplitude (LOMA) of the wavelet transform are used for glottal closure instant detection. Following Kadambe & al. (1992), the wavelet transform modulus maxima can be used for singularity detection. The LOMA method extends this idea. All the lines chaining maxima of a wavelet transform across scales are built. Then a back-tracking procedure allows for selection of the optimal line for each pitch period, the top of which indicates the GCI. The LOMA method is then evaluated by comparing its results to the DYPSA (Naylor & al.) algorithm, with the option of using inverse filtering as preprocessing. The LOMA method compares favorably to DYPSA, particularly on accuracy. One of the advantage of the LOMA method is its ability to deal with variations in the glottal source parameters.

international conference on acoustics, speech, and signal processing | 2012

Iterative phase reconstruction of wiener filtered signals

Nicolas Sturmel; Laurent Daudet

This paper deals with phase estimation in the framework of underdetermined blind source separation, using an estimated spectrogram of the source and its associated Wiener filter. By thresholding the Wiener mask, two domains are defined on the spectrogram : a confidence domain where the phase is kept as the phase of the mixture, and its complement where the phase is updated with a projection similar to the widely-used Griffin and Lim technique. We show that with this simple technique, the choice of parameters results in a simple trade-off between distortion and interference. Experiments show that this technique brings significant improvements over the classical Wiener filter, while being much faster than other iterative methods.

non-linear speech processing | 2007

Phase-based methods for voice source analysis

Christophe d'Alessandro; Baris Bozkurt; Boris Doval; Thierry Dutoit; Nathalie Henrich; Vu Ngoc Tuan; Nicolas Sturmel

Voice source analysis is an important but difficult issue for speech processing. In this talk, three aspects of voice source analysis recently developed at LIMSI (Orsay, France) and FPMs (Mons, Belgium) are discussed. In a first part, time domain and spectral domain modelling of glottal flow signals are presented. It is shown that the glottal flow can be modelled as an anticausal filter (maximum phase) before the glottal closing, and as a causal filter (minimum phase) after the glottal closing. In a second part, taking advantage of this phase structure, causal and anticausal components of the speech signal are separated according to the location in the Z-plane of the zeros of the Z-Transform (ZZT) of the windowed signal. This method is useful for voice source parameters analysis and source-tract deconvolution. Results of a comparative evaluation of the ZZT and linear prediction for source/tract separation are reported. In a third part, glottal closing instant detection using the phase of the wavelet transform is discussed. A method based on the lines of maximum phase in the time-scale plane is proposed. This method is compared to EGG for robust glottal closing instant analysis.

Journal of The Audio Engineering Society | 2012

Linear Mixing Models for Active Listening of Music Productions in Realistic Studio Conditions

Nicolas Sturmel; Antoine Liutkus; Jonathan Pinel; Laurent Girin; Sylvain Marchand; Gaël Richard; Roland Badeau; Laurent Daudet

Sadhana-academy Proceedings in Engineering Sciences | 2011

Glottal closure instant and voice source analysis using time-scale lines of maximum amplitude

Christophe d’Alessandro; Nicolas Sturmel

conference of the international speech communication association | 2007

A comparative evaluation of the zeros of z transform representation for voice source estimation.

Nicolas Sturmel; Christophe d'Alessandro; Boris Doval

european signal processing conference | 2012

Informed audio source separation: A comparative study

Antoine Liutkus; Stanislaw Gorlow; Nicolas Sturmel; Shuhua Zhang; Laurent Girin; Roland Badeau; Laurent Daudet; Sylvain Marchand; Gaël Richard

Journal of The Audio Engineering Society | 2012

DReaM: A Novel System for Joint Source Separation and Multi-Track Coding

Sylvain Marchand; Roland Badeau; Cléo Baras; Laurent Daudet; Dominique Fourer; Laurent Girin; Stanislaw Gorlow; Antoine Liutkus; Jonathan Pinel; Gaël Richard; Nicolas Sturmel; Shuhua Zhang

15th International Conference on Digital Audio Effects (DAFx 2012) | 2012