Stan McClellan
University of Birmingham
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Stan McClellan.
IEEE Transactions on Speech and Audio Processing | 1997
Stan McClellan; Jerry D. Gibson
Code-excited linear prediction (CELP) is the predominant methodology for communications quality speech coding below 8 kbps, and several variable-rate CELP schemes have been discussed in the literature, including QCELP, the variable-rate wideband digital cellular mobile radio speech coding standard specified in IS-95. A key component of these speech coders is the detection and classification of speech activity, and several cues for rate variation have been studied, such as measuring the short-term speech energy, deciding whether the speech is voiced or unvoiced, or making more sophisticated phonetic classifications. We present a new method for rate variation based on a measure of subband spectral flatness, called spectral entropy. Spectral entropy is a normalized indicator of the texture of the input spectrum and is thus less dependent on speech and background noise energy variations. We present some results on the use of spectral entropy for voice activity detection across subbands and then evaluate using spectral entropy for deriving mode and rate allocation cues for a variable-rate CELP coder operating at an average rate of 2 kbps. To achieve communications quality speech at this rate, we develop a new split-band vector quantization (VQ) technique for representing the line spectral pairs and a multiple codebook approach for efficiently quantizing the coefficients of a three-tap pitch predictor, called lag-indexed VQ.
international conference on acoustics, speech, and signal processing | 1994
Stan McClellan; Jerry D. Gibson
We introduce an approach to speech segment classification that differs from the usual energy, correlation, and zero-crossing criteria. Instead, we measure the gross shape of the short-term speech spectrum using spectral entropy to derive some indication of effective bandwidth. We propose to lower the required encoding rate by compensating for dynamic variations in signal bandwidth. We show that the spectral entropy can be used effectively to determine regions of voicing activity even in extreme background noise.<<ETX>>
asilomar conference on signals, systems and computers | 1993
Jerry D. Gibson; Steven P. Stanners; Stan McClellan
Variable rate speech coding is well-suited for network and wireless communications and is necessary to maintain good speech quality and intelligibility at ever-lower rates. In 1960 Campbell used a version of the asymptotic equipartition property (AEP) to derive a relationship between the entropy of the source power spectral density and the minimum coefficient rate required to encode the source. We analyze Campbells coefficient rate expression and investigate its properties for autoregressive (AR) processes and for speech. We compare the coefficient rate to the familiar entropy rate power, and to the AIC model order criterion of Akaike (1971), and consider these quantities as rate indicators for dynamically varying the rate of speech coders.<<ETX>>
IEEE Transactions on Speech and Audio Processing | 1999
Stan McClellan; J.D. Gibson; B.K. Rutherford
Analysis-by-synthesis techniques are used in a wide variety of speech coding standards and applications for rates below 16 kbps. The presence of a long-term predictor, commonly known as the adaptive codebook, is critical to coder performance at the lower rates. Unfortunately, the encoding rate and computational requirements for high-quality encoding of pitch filter parameters can be excessive. Several popular approaches explore the trade-off between predictor order, allocated bit rate, and computational requirements for long-term predictor optimization. We investigate the relative performance of several long-term predictor structures and present a new approach to vector quantization of the pitch filter coefficients having a subjective quality equivalent to other schemes, but at a lower coding rate and requiring significantly less closed-loop computation. The performance is evaluated in a variable-rate CELP coder at an average rate of 2 kbps and in Federal Standard 1016 CELP.
international conference on communications | 1995
Stan McClellan; Jerry D. Gibson
With the standardization of Qualcomms QCELP and the deployment of various digital multiple-access networks, the implementation of variable-rate speech coding schemes has become an area of significant interest. Code-excited linear prediction (CELP) is the predominant coding methodology for communications quality speech coding below 8 kbps, and several variable-rate CELP schemes have been discussed in the literature. We propose a variable-rate CELP architecture wherein cues for rate variation are derived from subband measures of spectral flatness using the entropy functional. We also discuss a variable-rate coding scheme for multiple-tap pitch filters. Using reasonable assumptions about voice activity and instantaneous speech bandwidths, our coder can achieve an average rate of below 2000 bits/sec while maintaining communications quality in the encoded speech.
international conference on acoustics speech and signal processing | 1996
Stan McClellan; Jerry D. Gibson
The presence of a pitch predictor is critical to low-rate performance of CELP coders. Unfortunately the rate required for high-quality encoding of pitch filter parameters is often a large fraction of the available bandwidth. Moreover, the application of analysis-by-synthesis techniques to pitch filter optimization can require excessive computation. We present a vector quantization (VQ) approach for coding pitch filter parameters which maintains a subjective quality equivalent to other coding schemes while requiring lower (variable) rate with less closed-loop computation than other VQ techniques.
southeastcon | 1996
Stan McClellan; Jerry D. Gibson
Effective rate variation during active speech is a necessary component of sophisticated variable rate speech compression schemes. Here, we use open-loop estimates of spectral shape to roughly determine signal bandwidth and bit allocations for variable rate encoding of spectral parameters. We analyze the application of the relative entropy functional to sets of line-spectrum pairs (LSPs) and transform-based generalized spectral distributions of Gibson et al. (1993). We present experimental results demonstrating that the relative entropy of these quantities can be used to good advantage in developing variable rate vector quantization schemes for the spectral envelope of speech signals.
international symposium on information theory | 1995
Stan McClellan; Jerry D. Gibson
We apply the relative entropy functional to sets of line-spectrum pairs (LSPs) and transform-based generalized spectral pmfs of Gibson et al. (1993) and present experimental results for sequence segmentation and vector quantization which show that the relative entropy of these quantities is a useful indicator for variable-rate speech coding.
southeastcon | 1997
Stan McClellan; Thomas S. Winokur
international conference of the ieee engineering in medicine and biology society | 2004
Mark Barr; Stan McClellan; Thomas S. Winokur; Gregg Vaughn