Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Réda Dehak is active.

Publication


Featured researches published by Réda Dehak.


IEEE Transactions on Audio, Speech, and Language Processing | 2011

Front-End Factor Analysis for Speaker Verification

Najim Dehak; Patrick Kenny; Réda Dehak; Pierre Dumouchel; Pierre Ouellet

This paper presents an extension of our previous work which proposes a new speaker representation for speaker verification. In this modeling, a new low-dimensional speaker- and channel-dependent space is defined using a simple factor analysis. This space is named the total variability space because it models both speaker and channel variabilities. Two speaker verification systems are proposed which use this new representation. The first system is a support vector machine-based system that uses the cosine kernel to estimate the similarity between the input data. The second system directly uses the cosine similarity as the final decision score. We tested three channel compensation techniques in the total variability space, which are within-class covariance normalization (WCCN), linear discriminate analysis (LDA), and nuisance attribute projection (NAP). We found that the best results are obtained when LDA is followed by WCCN. We achieved an equal error rate (EER) of 1.12% and MinDCF of 0.0094 using the cosine distance scoring on the male English trials of the core condition of the NIST 2008 Speaker Recognition Evaluation dataset. We also obtained 4% absolute EER improvement for both-gender trials on the 10 s-10 s condition compared to the classical joint factor analysis scoring.


international conference on acoustics, speech, and signal processing | 2009

Support vector machines and Joint Factor Analysis for speaker verification

Najim Dehak; Patrick Kenny; Réda Dehak; Ondrej Glembek; Pierre Dumouchel; Lukas Burget; Valiantsina Hubeika; Fabio Castaldo

This article presents several techniques to combine between Support vector machines (SVM) and Joint Factor Analysis (JFA) model for speaker verification. In this combination, the SVMs are applied to different sources of information produced by the JFA. These informations are the Gaussian Mixture Model supervectors and speakers and Common factors. We found that using SVM in JFA factors gave the best results especially when within class covariance normalization method is applied in order to compensate for the channel effect. The new combination results are comparable to other classical JFA scoring techniques.


IEEE Transactions on Audio, Speech, and Language Processing | 2013

Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach

Stephen Shum; Najim Dehak; Réda Dehak; James R. Glass

In speaker diarization, standard approaches typically perform speaker clustering on some initial segmentation before refining the segment boundaries in a re-segmentation step to obtain a final diarization hypothesis. In this paper, we integrate an improved clustering method with an existing re-segmentation algorithm and, in iterative fashion, optimize both speaker cluster assignments and segmentation boundaries jointly. For clustering, we extend our previous research using factor analysis for speaker modeling. In continuing to take advantage of the effectiveness of factor analysis as a front-end for extracting speaker-specific features (i.e., i-vectors), we develop a probabilistic approach to speaker clustering by applying a Bayesian Gaussian Mixture Model (GMM) to principal component analysis (PCA)-processed i-vectors. We then utilize information at different temporal resolutions to arrive at an iterative optimization scheme that, in alternating between clustering and re-segmentation steps, demonstrates the ability to improve both speaker cluster assignments and segmentation boundaries in an unsupervised manner. Our proposed methods attain results that are comparable to those of a state-of-the-art benchmark set on the multi-speaker CallHome telephone corpus. We further compare our system with a Bayesian nonparametric approach to diarization and attempt to reconcile their differences in both methodology and performance.


international conference on acoustics, speech, and signal processing | 2011

A channel-blind system for speaker verification

Najim Dehak; Zahi N. Karam; Douglas A. Reynolds; Réda Dehak; William M. Campbell; James R. Glass

The majority of speaker verification systems proposed in the NIST speaker recognition evaluation are conditioned on the type of data to be processed: telephone or microphone. In this paper, we propose a new speaker verification system that can be applied to both types of data. This system, named blind system, is based on an extension of the total variability framework. Recognition results with the proposed channel-independent system are comparable to state of the art systems that require conditioning on the channel type. Another advantage of our proposed system is that it allows for combining data from multiple channels in the same visualization in order to explore the effects of different microphones and collection environments.


conference of the international speech communication association | 2009

Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification

Najim Dehak; Réda Dehak; Patrick Kenny; Niko Brümmer; Pierre Ouellet; Pierre Dumouchel


conference of the international speech communication association | 2011

Language Recognition via i-vectors and Dimensionality Reduction.

Najim Dehak; Pedro A. Torres-Carrasquillo; Douglas A. Reynolds; Réda Dehak


conference of the international speech communication association | 2009

Cepstral and long-term features for emotion recognition

Pierre Dumouchel; Najim Dehak; Yazid Attabi; Réda Dehak; Narjès Boufaden


Archive | 2011

Language Recognition via Ivectors and Dimensionality Reduction

Najim Dehak; Pedro A. Torres-Carrasquillo; Douglas A. Reynolds; Réda Dehak


conference of the international speech communication association | 2006

Linear and Non Linear Kernel GMM SuperVector Machines for Speaker Verification

Réda Dehak; Najim Dehak; Patrick Kenny; Pierre Dumouchel


Odyssey | 2010

Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification.

Stephen Shum; Najim Dehak; Réda Dehak; James R. Glass

Collaboration


Dive into the Réda Dehak's collaboration.

Top Co-Authors

Avatar

Najim Dehak

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Pierre Dumouchel

École de technologie supérieure

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Pedro A. Torres-Carrasquillo

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Douglas A. Reynolds

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Douglas E. Sturim

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Fred Richardson

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

James R. Glass

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Stephen Shum

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

William M. Campbell

Massachusetts Institute of Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge