Scott Otterson | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Scott Otterson is active.

Explore More

Publication

Featured researches published by Scott Otterson.

IEEE Transactions on Speech and Audio Processing | 2002

Graceful degradation of speech recognition performance over packet-erasure networks

Constantinos Boulis; Mari Ostendorf; Eve A. Riskin; Scott Otterson

This paper explores packet loss recovery for automatic speech recognition (ASR) in spoken dialog systems, assuming an architecture in which a lightweight client communicates with a remote ASR server. Speech is transmitted with source and channel codes optimized for the ASR application, i.e., to minimize word error rate. Unequal amounts of forward error correction, depending on the datas effect on ASR performance, are assigned to protect against packet loss. Experiments with simulated packet loss in a range of loss conditions are conducted on the DARPA Communicator (air travel information) task. Results show that the approach provides robust ASR performance which degrades gracefully as packet loss rates increase. Transmitting at 5.2 Kbps with up to 200 ms added delay, leads to only a 7% relative degradation in word error rate even under extremely adverse network conditions.

ieee automatic speech recognition and understanding workshop | 2007

Efficient use of overlap information in speaker diarization

Scott Otterson; Mari Ostendorf

Speaker overlap in meetings is thought to be a significant contributor to error in speaker diarization, but it is not clear if overlaps are problematic for speaker clustering and/or if errors could be addressed by assigning multiple labels in overlap regions. In this paper, we look at these issues experimentally, assuming perfect detection of overlaps, to assess the relative importance of these problems and the potential impact of overlap detection. With our best features, we find that detecting overlaps could potentially improve diarization accuracy by 15% relative, using a simple strategy of assigning speaker labels in overlap regions according to the labels of the neighboring segments. In addition, the use of cross-correlation features with MFCCs reduces the performance gap due to overlaps, so that there is little gain from removing overlapped regions before clustering.

international conference on machine learning | 2004

The 2004 ICSI-SRI-UW meeting recognition system

Chuck Wooters; Nikki Mirghafori; Andreas Stolcke; Tuomo W. Pirinen; Ivan Bulyko; David Gelbart; Martin Graciarena; Scott Otterson; Barbara Peskin; Mari Ostendorf

The paper describes our system devised for recognizing speech in meetings, which was an entry in the NIST Spring 2004 Meeting Recognition Evaluation. This system was developed as a collaborative effort between ICSI, SRI, and UW and was based on SRIs 5xRT Conversational Telephone Speech (CTS) recognizer. The CTS system was adapted to the Meetings domain by adapting the CTS acoustic and language models to the Meeting domain, adding noise reduction and delay-sum array processing for far-field recognition, and adding postprocessing for cross-talk suppression for close-talking microphones. A modified MAP adaptation procedure was developed to make best use of discriminatively trained (MMIE) prior models. These meeting-specific changes yielded an overall 9% and 22% relative improvement as compared to the original CTS system, and 16% and 29% relative improvement as compared to our 2002 Meeting Evaluation system, for the individual-headset and multiple-distant microphones conditions, respectively.

international conference on acoustics, speech, and signal processing | 2008

PROGRESS IN MEETING RECOGNITION: THE ICSI-SRI-UW SPRING 2004 EVALUATION SYSTEM

Andreas Stolcke; Chuck Wooters; Nikki Mirghafori; Tuomo W. Pirinen; Ivan Bulyko; Dave Gelbart; Martin Graciarena; Scott Otterson; Barbara Peskin; Mari Ostendorf

Archive | 1996

Method and system for doppler ultrasound audio dealiasing

Ja-Il Koo; Scott Otterson

conference of the international speech communication association | 2007

Improved Location Features for Meeting Speaker Diarization

Scott Otterson

Archive | 1997

Method and apparatus for estimation and display of spectral broadening error margin for doppler time-velocity waveforms

Larry Y. L. Mo; Scott Otterson

conference of the international speech communication association | 2004

From switchboard to meetings: development of the 2004 ICSI-SRI-UW meeting recognition system.

Andreas Stolcke; Chuck Wooters; Ivan Bulyko; Martin Graciarena; Scott Otterson; Barbara Peskin; Mari Ostendorf; David Gelbart; Nikki Mirghafori; Tuomo W. Pirinen

Archive | 1996