Hotspot


spoken language technology workshop | 2012

Context-dependent Deep Neural Networks for audio indexing of real-life data

Gang Li; Huifeng Zhu; Gong Cheng; Kit Thambiratnam; Behrooz Chitsaz; Dong Yu; Frank Seide

We apply Context-Dependent Deep-Neural-Network HMMs, or CD-DNN-HMMs, to the real-life problem of audio indexing of data across various sources. Recently, we had shown that on the Switchboard benchmark on speaker-independent transcription of phone calls, CD-DNN-HMMs with 7 hidden layers reduce the word error rate by as much as one-third, compared to discriminatively trained Gaussian-mixture HMMs, and by one-fourth if the GMM-HMM also uses fMPE features. This paper takes CD-DNN-HMM based recognition into a real-life deployment for audio indexing. We find that for our best speaker-independent CD-DNN-HMM, with 32k senones trained on 2000h of data, the one-fourth reduction does carry over to inhomogeneous field data (video podcasts and talks). Compared to a speaker-adaptive GMM system, the relative improvement is 18%, at very similar end-to-end runtime. In system building, we find that DNNs can benefit from a larger number of senones than the GMM-HMM; and that DNN likelihood evaluation is a sizeable runtime factor even in our wide-beam context of generating rich lattices: Cutting the model size by 60% reduces runtime by one-third at a 5% relative WER loss.


Archive | 2007

CONTENT SEARCH SERVICE, FINDING CONTENT, AND PREFETCHING FOR THIN CLIENT

Curtis G. Wong; Dale A. Sather; Kenneth Reneris; Thaddeus C. Pritchett; Behrooz Chitsaz; Talal A. Batrouny


Archive | 2007

INTELLIGENT NETWORK ADDRESS LOOKUP SERVICE

Sharad Agarwal; Najam Ahmad; Behrooz Chitsaz; Manuel Costa; Albert G. Greenberg; Parantap Lahiri; Venkata N. Padmanabhan


Archive | 2006

User presence detection for altering operation of a computing system

Behrooz Chitsaz; Darko Kirovski


Archive | 2007

VIRTUAL PERSONAL VIDEO RECORDER

Curtis G. Wong; Dale A. Sather; Kenneth Reneris; Thaddeus C. Pritchett; Behrooz Chitsaz; Talal A. Batrouny


Archive | 2006

Watchdog processors in multicore systems

Behrooz Chitsaz; Darko Kirovski


Archive | 2008

Resource equalization for inter- and intra- data center operations

James R. Hamilton; Rebecca A. Norlander; Michael J. Manos; Feng Zhao; David R. Treadwell; Behrooz Chitsaz


Archive | 2010

Managing Shared Sessions in a Shared Resource Computing Environment

Paul C. Sutton; Shahram Izadi; Behrooz Chitsaz


Archive | 2010

Dual-Mode, Dual-Display Shared Resource Computing

Shahram Izadi; Behrooz Chitsaz


Archive | 2010

Natural User Interaction in Shared Resource Computing Environment

Paul C. Sutton; Shahram Izadi; Behrooz Chitsaz

Researchain Logo
Decentralizing Knowledge