Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Evandro Gouvea is active.

Publication


Featured researches published by Evandro Gouvea.


2008 Hands-Free Speech Communication and Microphone Arrays | 2008

Binaural and Multiple-Microphone Signal Processing Motivated by Auditory Perception

Richard M. Stern; Evandro Gouvea; Chanwoo Kim; Kshitiz Kumar; Hyung-Min Park

It is well known that binaural processing is very useful for separating incoming sound sources as well as for improving the intelligibility of speech in reverberant environments. This paper describes and compares a number of ways in which the classic model of interaural cross-correlation proposed by Jeffress, quantified by Colburn, and further elaborated by Blauert, Lindemann, and others, can be applied to improving the accuracy of automatic speech recognition systems operating in cluttered, noisy, and reverberant environments. Typical implementations begin with an abstraction of cross-correlation of the incoming signals after nonlinear monaural bandpass processing, but there are many alternative implementation choices that can be considered. Typical implementations differ in the ways in which an enhanced version of the desired signal is developed using binaural principles, in the extent to which specific processing mechanisms are used to impose suppression motivated by the precedence effect, and in the precise mechanism used to extract interaural time differences.


Journal of the Acoustical Society of America | 2008

Polyaural array processing for robust automatic speech recognition in noisy and reverberant environments

Richard M. Stern; Evandro Gouvea; Kshitiz Kumar

It is well known that human binaural processing is very useful for separating incoming sound sources as well as for improving the intelligibility of speech in reverberant environments. In this paper we present a new method of signal processing for robust speech recognition using multiple microphones. The method, loosely based on the human binaural hearing system, consists of passing the speech signals detected by multiple microphones through bandpass filtering and nonlinear halfwave rectification operations, and then cross‐correlating the outputs from each channel within each frequency band. These operations provide rejection of off‐axis interfering signals. These operations are repeated (in a non‐physiological fashion) for the negative of the signal, and an estimate of the desired signal is obtained by combining the positive and negative outputs. We demonstrate that the use of this approach provides substantially better recognition accuracy than delay‐and‐sum beamforming using the same sensors for target...


Archive | 2004

Sphinx-4: a flexible open source framework for speech recognition

Willie Walker; Paul Lamere; Philip Kwok; Bhiksha Raj; Rita Singh; Evandro Gouvea; Peter Wolf; Joe Woelfel


international conference on acoustics, speech, and signal processing | 1995

Multivariate-Gaussian-based cepstral normalization for robust speech recognition

Pedro J. Moreno; Bhiksha Raj; Evandro Gouvea; Richard M. Stern


conference of the international speech communication association | 1997

Speaker normalization through formant-based warping of the frequency scale.

Evandro Gouvea; Richard M. Stern


international conference on spoken language processing | 1996

Cepstral compensation by polynomial approximation for environment-independent speech recognition

Bhiksha Raj; Evandro Gouvea; Pedro J. Moreno; Richard M. Stern


Archive | 1995

RECOGNITION OF CONTINUOUS BROADCAST NEWS WITH MULTIPLE UNKNOWN SPEAKERS AND ENVIRONMENTS

Uday K. Jain; Matthew Siegler; Sam-Joo Doh; Evandro Gouvea; Juan M. Huerta; Pedro J. Moreno; Bhiksha Raj; Richard M. Stern


conference of the international speech communication association | 2007

polyaural array processing for automatic speech recognition in degraded environments.

Richard M. Stern; Evandro Gouvea; Govindarajan Thattai


artificial intelligence in education | 2005

A Generic Tool to Browse Tutor-Student Interactions: Time Will Tell!

Jack Mostow; Joseph E. Beck; Andrew Cuneo; Evandro Gouvea; Cecily Heiner


Archive | 2009

Lessons from Project LISTEN’s Session Browser

Jack Mostow; Joseph E. Beck; Andrew Cuneo; Evandro Gouvea; Cecily Heiner; Octavio Juarez

Collaboration


Dive into the Evandro Gouvea's collaboration.

Top Co-Authors

Avatar

Richard M. Stern

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Bhiksha Raj

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jack Mostow

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Joseph E. Beck

Worcester Polytechnic Institute

View shared research outputs
Top Co-Authors

Avatar

Kshitiz Kumar

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Pedro J. Moreno

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Andrew Cuneo

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Chanwoo Kim

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Rita Singh

Carnegie Mellon University

View shared research outputs
Researchain Logo
Decentralizing Knowledge