Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Stan McClellan is active.

Publication


Featured researches published by Stan McClellan.


IEEE Transactions on Speech and Audio Processing | 1997

Variable-rate CELP based on subband flatness

Stan McClellan; Jerry D. Gibson

Code-excited linear prediction (CELP) is the predominant methodology for communications quality speech coding below 8 kbps, and several variable-rate CELP schemes have been discussed in the literature, including QCELP, the variable-rate wideband digital cellular mobile radio speech coding standard specified in IS-95. A key component of these speech coders is the detection and classification of speech activity, and several cues for rate variation have been studied, such as measuring the short-term speech energy, deciding whether the speech is voiced or unvoiced, or making more sophisticated phonetic classifications. We present a new method for rate variation based on a measure of subband spectral flatness, called spectral entropy. Spectral entropy is a normalized indicator of the texture of the input spectrum and is thus less dependent on speech and background noise energy variations. We present some results on the use of spectral entropy for voice activity detection across subbands and then evaluate using spectral entropy for deriving mode and rate allocation cues for a variable-rate CELP coder operating at an average rate of 2 kbps. To achieve communications quality speech at this rate, we develop a new split-band vector quantization (VQ) technique for representing the line spectral pairs and a multiple codebook approach for efficiently quantizing the coefficients of a three-tap pitch predictor, called lag-indexed VQ.


international conference on acoustics, speech, and signal processing | 1994

Spectral entropy: an alternative indicator for rate allocation?

Stan McClellan; Jerry D. Gibson

We introduce an approach to speech segment classification that differs from the usual energy, correlation, and zero-crossing criteria. Instead, we measure the gross shape of the short-term speech spectrum using spectral entropy to derive some indication of effective bandwidth. We propose to lower the required encoding rate by compensating for dynamic variations in signal bandwidth. We show that the spectral entropy can be used effectively to determine regions of voicing activity even in extreme background noise.<<ETX>>


asilomar conference on signals, systems and computers | 1993

Spectral entropy and coefficient rate for speech coding

Jerry D. Gibson; Steven P. Stanners; Stan McClellan

Variable rate speech coding is well-suited for network and wireless communications and is necessary to maintain good speech quality and intelligibility at ever-lower rates. In 1960 Campbell used a version of the asymptotic equipartition property (AEP) to derive a relationship between the entropy of the source power spectral density and the minimum coefficient rate required to encode the source. We analyze Campbells coefficient rate expression and investigate its properties for autoregressive (AR) processes and for speech. We compare the coefficient rate to the familiar entropy rate power, and to the AIC model order criterion of Akaike (1971), and consider these quantities as rate indicators for dynamically varying the rate of speech coders.<<ETX>>


IEEE Transactions on Speech and Audio Processing | 1999

Efficient pitch filter encoding for variable rate speech processing

Stan McClellan; J.D. Gibson; B.K. Rutherford

Analysis-by-synthesis techniques are used in a wide variety of speech coding standards and applications for rates below 16 kbps. The presence of a long-term predictor, commonly known as the adaptive codebook, is critical to coder performance at the lower rates. Unfortunately, the encoding rate and computational requirements for high-quality encoding of pitch filter parameters can be excessive. Several popular approaches explore the trade-off between predictor order, allocated bit rate, and computational requirements for long-term predictor optimization. We investigate the relative performance of several long-term predictor structures and present a new approach to vector quantization of the pitch filter coefficients having a subjective quality equivalent to other schemes, but at a lower coding rate and requiring significantly less closed-loop computation. The performance is evaluated in a variable-rate CELP coder at an average rate of 2 kbps and in Federal Standard 1016 CELP.


international conference on communications | 1995

Variable rate CELP based on subband flatness

Stan McClellan; Jerry D. Gibson

With the standardization of Qualcomms QCELP and the deployment of various digital multiple-access networks, the implementation of variable-rate speech coding schemes has become an area of significant interest. Code-excited linear prediction (CELP) is the predominant coding methodology for communications quality speech coding below 8 kbps, and several variable-rate CELP schemes have been discussed in the literature. We propose a variable-rate CELP architecture wherein cues for rate variation are derived from subband measures of spectral flatness using the entropy functional. We also discuss a variable-rate coding scheme for multiple-tap pitch filters. Using reasonable assumptions about voice activity and instantaneous speech bandwidths, our coder can achieve an average rate of below 2000 bits/sec while maintaining communications quality in the encoded speech.


international conference on acoustics speech and signal processing | 1996

Lag-indexed VQ for pitch filter coding

Stan McClellan; Jerry D. Gibson

The presence of a pitch predictor is critical to low-rate performance of CELP coders. Unfortunately the rate required for high-quality encoding of pitch filter parameters is often a large fraction of the available bandwidth. Moreover, the application of analysis-by-synthesis techniques to pitch filter optimization can require excessive computation. We present a vector quantization (VQ) approach for coding pitch filter parameters which maintains a subjective quality equivalent to other coding schemes while requiring lower (variable) rate with less closed-loop computation than other VQ techniques.


southeastcon | 1996

Variable rate vector quantization of the speech spectral envelope

Stan McClellan; Jerry D. Gibson

Effective rate variation during active speech is a necessary component of sophisticated variable rate speech compression schemes. Here, we use open-loop estimates of spectral shape to roughly determine signal bandwidth and bit allocations for variable rate encoding of spectral parameters. We analyze the application of the relative entropy functional to sets of line-spectrum pairs (LSPs) and transform-based generalized spectral distributions of Gibson et al. (1993). We present experimental results demonstrating that the relative entropy of these quantities can be used to good advantage in developing variable rate vector quantization schemes for the spectral envelope of speech signals.


international symposium on information theory | 1995

Distortion measures for variable rate coding

Stan McClellan; Jerry D. Gibson

We apply the relative entropy functional to sets of line-spectrum pairs (LSPs) and transform-based generalized spectral pmfs of Gibson et al. (1993) and present experimental results for sequence segmentation and vector quantization which show that the relative entropy of these quantities is a useful indicator for variable-rate speech coding.


southeastcon | 1997

Telepath: real-time remote pathology

Stan McClellan; Thomas S. Winokur


international conference of the ieee engineering in medicine and biology society | 2004

An automated tissue preclassification approach for telepathology: implementation and performance analysis

Mark Barr; Stan McClellan; Thomas S. Winokur; Gregg Vaughn

Collaboration


Dive into the Stan McClellan's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Thomas S. Winokur

University of Alabama at Birmingham

View shared research outputs
Top Co-Authors

Avatar

Gregg Vaughn

University of Alabama at Birmingham

View shared research outputs
Researchain Logo
Decentralizing Knowledge