Erik Visser
University of California, San Diego
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Erik Visser.
Speech Communication | 2003
Erik Visser; Manabu Otsuka; Te-Won Lee
Abstract A new speech enhancement scheme is presented integrating spatial and temporal signal processing methods for robust speech recognition in noisy environments. The scheme first separates spatially localized point sources from noisy speech signals recorded by two microphones. Blind source separation algorithms assuming no a priori knowledge about the sources involved are applied in this spatial processing stage. Then denoising of distributed background noise is achieved in a combined spatial/temporal processing approach. The desired speaker signal is first processed along with an artificially constructed noise signal in a supplementary blind source separation step. It is further denoised by exploiting differences in temporal speech and noise statistics in a wavelet filterbank. The scheme’s performance is illustrated by speech recognition experiments on real recordings in a noisy car environment. In comparison to a common multi-microphone technique like beamforming with spectral subtraction, the scheme is shown to enable more accurate speech recognition in the presence of a highly interfering point source and strong background noise.
international conference on acoustics, speech, and signal processing | 2004
Erik Visser; Te-Won Lee
A speech enhancement scheme including blind source separation and background denoising based on minimum statistics is studied in mobile environments. To accommodate the dependence of the separated output signals on the spatial properties of the recorded source signals, these blind signal processing steps are complemented by an adaptive separated output channel selection stage using prior knowledge about the desired speaker speech content. The resulting scheme performance is illustrated by speech recognition experiments on real recordings corrupted by various noise sources and shown to outperform conventional beamforming and single channel denoising techniques as well as an equivalent scheme with fixed output channel selection.
international conference on acoustics, speech, and signal processing | 2007
Erik Visser
Traditional passive broadband source localization techniques like maximum likelihood estimation and MUSIC have shown difficulties in situations where multiple correlating source signals are interfering with each other. Blind source separation (BSS) algorithms on the other hand have demonstrated good performance in separating correlated mixture signals into independent sources. In this paper it will be shown that the performance of traditional source localization algorithms can be improved by using a permutation-free frequency domain BSS algorithm as a front end. In addition a source localization method based solely on information gained from the separated BSS solution and sensor array architecture is presented. The methodologies are illustrated in an undercomplete acoustic scenario involving 3 speech sources and a 6 element microphone array.
Archive | 2005
Erik Visser; Te-Won Lee
Archive | 2005
Erik Visser; Jeremy Toman; Tom Davis; Brian Momeyer
Archive | 2005
Erik Visser; Jeremy Toman; Kwokleung Chan
Archive | 2004
Erik Visser; Tzyy-Ping Jung; Kwokleung Chan
Chemometrics and Intelligent Laboratory Systems | 2004
Erik Visser; Te-Won Lee
Archive | 2008
Erik Visser; Kwokleung Chan; Hyun Jin Park
conference of the international speech communication association | 2003
Kaisheng Yao; Erik Visser; Oh-Wook Kwon; Te-Won Lee