Jon A. Arrowood
Georgia Institute of Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jon A. Arrowood.
international conference on acoustics, speech, and signal processing | 2005
Robert W. Morris; Jon A. Arrowood; Mark A. Clements
Noise mitigation systems for speech coding and recognition have primarily focused on spectral subtraction techniques due to their well understood behavior and computational simplicity. As computation complexity becomes a smaller constraint, understanding the characteristics of different estimation schemes becomes more important. The merits of two algorithms based on direct estimation of the linear prediction spectrum of a speech signal are explored. These algorithms are maximum likelihood (ML) and minimum mean square error estimation (MMSE) of the autoregressive speech spectrum. The MMSE algorithm is able to improve objective quality effectively at low SNRs while also improving the speech recognition accuracy by 20-30% on the Aurora2 test set at the cost of requiring two orders of magnitude more operations than the ML method. Because of these improvements, autoregressive based algorithms should be considered in the future for noise robust speech processing tasks.
international conference on acoustics, speech, and signal processing | 2004
Jon A. Arrowood; Mark A. Clements
This paper presents a novel extension to vector quantization referred to as extended cluster information (ECI). In this method the decoder retains more general statistics about the VQ clusters found during codebook training than the single prototypical point of conventional VQ systems. Typically this information is unnecessary, however if the items being compressed are feature space vectors used as input to a statistical pattern classification system, the extra probabilistic information can be used during the classification as in Bayes predictive classification (BPC) to improve recognition results. To demonstrate ECI-VQ, a simple experiment is described where the Aurora2 distributed speech recognition front end is altered to provide more aggressive mel frequency cepstral coefficient (MFCC) compression. As the bit-rate drops, the corresponding recognition performance suffers. It is then shown that using ECI-VQ as the input to an uncertain observation (UO) speech recognizer, a number of errors due to compression can be corrected with no extra cost in bit-rate.
north american chapter of the association for computational linguistics | 2004
Robert W. Morris; Jon A. Arrowood; Peter S. Cardillo; Mark A. Clements
When evaluating wordspotting systems, one normally compares receiver operating characteristic curves and different measures of accuracy. However, there are many other factors that are relevant to the systems usability for searching speech. In this paper, we discuss both measures of quality for confidence scores and propose algorithms for producing scores that are optimal with respect to these criteria.
conference of the international speech communication association | 2002
Jon A. Arrowood; Mark A. Clements
Archive | 2008
Jon A. Arrowood; Robert W. Morris; Mark Finlay; Scott A. Judy
Archive | 2009
Robert W. Morris; Jon A. Arrowood; Mark A. Clements; Kenneth King Griggs; Peter S. Cardillo; Marsal Gavalda
Archive | 2003
Jon A. Arrowood; Mark A. Clements
Archive | 2009
Robert W. Morris; Marsal Gavalda; Peter S. Cardillo; Jon A. Arrowood
Archive | 2011
Jacob Benjamin Garland; Jon A. Arrowood; Drew Lanham; Marsal Gavalda
conference of the international speech communication association | 2003
Robert W. Morris; Jon A. Arrowood; Mark A. Clements