William R. Gardner | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where William R. Gardner is active.

Explore More

Publication

Featured researches published by William R. Gardner.

Journal of the Acoustical Society of America | 1998

Variable rate vocoder

Paul E. Jacobs; William R. Gardner; Chong U. Lee; Klein S. Gilhousen; S. Katherine Lam; Ming-Chang Tsai

A method of speech signal compression, by variable rate coding of frames of digitized speech samples, comprising the steps of: determining a level of speech activity for a frame of digitized speech samples; selecting an encoding rate from a set of rates based upon said determined level of speech activity for said frame; coding said frame according to a coding format of a set of coding formats for said selected rate wherein each rate has a corresponding different coding format and wherein each coding format provides for a different plurality of parameter signals representing said digitized speech samples in accordance with a speech model; and generating for said frame a data packet of said parameter signals.

Archive | 1993

QCELP: A Variable Rate Speech Coder for CDMA Digital Cellular

William R. Gardner; Paul E. Jacobs; Chong Lee

Digital cellular telephone systems require efficient encoding of speech to achieve capacity improvements required of the next generation of cellular systems. The use of a variable rate speech coder can reduce the average data rate required to transmit conversational speech by a factor of two or more, while providing many other advantages. This reduction in average data rate leads to a factor of two increase in the capacity of a Code Division Multiple Access, or CDMA, based digital cellular telephone system by decreasing the mutual interference among users. This chapter describes “QCELP,” a variable rate speech coder which has been selected as the speech coding algorithm for the TIA North American digital cellular standard based on CDMA technology.

EURASIP Journal on Advances in Signal Processing | 2001

Techniques for the regeneration of wideband speech from narrowband speech

Jason A. Fuemmeler; Russell C. Hardie; William R. Gardner

This paper addresses the problem of reconstructing wideband speech signals from observed narrowband speech signals. The goal of this work is to improve the perceived quality of speech signals which have been transmitted through narrowband channels or degraded during acquisition. We describe a system, based on linear predictive coding, for estimating wideband speech from narrowband. This system employs both previously identified and novel techniques. Experimental results are provided in order to illustrate the systems ability to improve speech quality. Both objective and subjective criteria are used to evaluate the quality of the processed speech signals.

IEEE Transactions on Audio, Speech, and Language Processing | 2006

Low-Complexity Source Coding Using Gaussian Mixture Models, Lattice Vector Quantization, and Recursive Coding with Application to Speech Spectrum Quantization

Anand D. Subramaniam; William R. Gardner; Bhaskar D. Rao

In this paper, we use the Gaussian mixture model (GMM) based multidimensional companding quantization framework to develop two important quantization schemes. In the first scheme, the scalar quantization in the companding framework is replaced by more efficient lattice vector quantization. Low-complexity lattice pruning and quantization schemes are provided for the

IEEE Transactions on Speech and Audio Processing | 1997

Noncausal all-pole modeling of voiced speech

William R. Gardner; Bhaskar D. Rao

E_8

international conference on acoustics, speech, and signal processing | 2003

Joint source-channel decoding of speech spectrum parameters over erasure channels using Gaussian mixture models

Anand D. Subramaniam; William R. Gardner; Bhaskar D. Rao

Gossett lattice. At moderate to high bit rates, the proposed scheme recovers much of the space-filling loss due to the product vector quantizers (PVQ) employed in earlier work, and thereby, provides improved performance with a marginal increase in complexity. In the second scheme, we generalize the compression framework to accommodate recursive coding. In this approach, the joint probability density function (PDF) of the parameter vectors of successive source frames is modeled using a GMM. The conditional density of the parameter vector of the current source frame based on the quantized values of the parameter vector of the previous source frames is used to generate a new codebook for every current source frame. We demonstrate the efficacy of the proposed schemes in the application of speech spectrum quantization. The proposed scheme is shown to provide superior performance with moderate increase in complexity when compared with conventional one-step linear prediction based compression schemes for both narrow-band and wide-band speech.

international conference on acoustics, speech, and signal processing | 2002

Low complexity recursive coding of spectrum parameters

Anand D. Subramaniam; William R. Gardner; Bhaskar D. Rao

This paper introduces noncausal all-pole models that are capable of efficiently capturing both the magnitude and phase information of voiced speech. It is shown that noncausal all-pole filter models are better able to match both magnitude and phase information and are particularly appropriate for voiced speech due to the nature of the glottal excitation. By modeling speech in the frequency domain, the standard difficulties that occur when using noncausal all-pole filters are avoided. Several algorithms for determining the model parameters based on frequency-domain information and the masking effects of the ear are described. Our work suggests that high-quality voiced speech can be produced using a 14th-order noncausal all-pole model.

IEEE Transactions on Audio, Speech, and Language Processing | 2006

Iterative joint source-channel decoding of speech spectrum parameters over an additive white Gaussian noise channel

Anand D. Subramaniam; William R. Gardner; Bhaskar D. Rao

A joint source-channel decoding scheme that improves the performance of conventional channel decoders over erasure channels by exploiting the cross-correlation between successive speech frames is presented. Speech spectrum parameters are quantized using the scheme presented in Subramaniam and Rao (2001). The joint probability density function (PDF) of the spectrum parameters of successive speech frames is modelled using a Gaussian mixture model (GMM). This model is then used to process the channel decoder output over erasure channels. The performance of two decoding strategies, namely, maximum likelihood decoding (ML) and minimum mean squared error decoding (MMSE) is shown to provide significantly better performance than prediction based schemes.

asilomar conference on signals, systems and computers | 1992

Non-causal linear prediction of voiced speech

William R. Gardner; Bhaskar D. Rao

A low complexity vector quantization scheme for the recursive coding of speech spectrum parameters is proposed. In this scheme, the joint probability density function (pdf) of the spectrum parameters of successive speech frames is modelled using a Gaussian mixture model whose parameters are estimated using the Expectation Maximization (EM) algorithm. The conditional density of the spectrum parameters of the current speech frame based on the quantized values of the spectrum parameters of previous speech frames is used to generate a new codebook for every current speech frame. An efficient quantization scheme using transform coding and bit allocation techniques which allows easy and computationally efficient mapping from observation to quantized value is developed. Transparent quality speech for the first order fixed rate recursive coding case is achieved at 20 bits per frame.

international conference on acoustics, speech, and signal processing | 1994

Mixed-phase AR models for voiced speech and perceptual cost functions

William R. Gardner; Bhaskar D. Rao

In this paper, we show how the Gaussian mixture modeling framework used to develop efficient source encoding schemes can be further exploited to model source statistics during channel decoding in an iterative framework to develop an effective joint source-channel decoding scheme. The joint probability density function (PDF) of successive source frames is modeled as a Gaussian mixture model (GMM). Based on previous work, the marginal source statistics provided by the GMM is used at the encoder to design a low-complexity memoryless source encoding scheme. The source encoding scheme has the specific advantage of providing good estimates to the probability of occurrence of a given source code-point based on the GMM. The proposed iterative decoding procedure works with any channel code whose decoder can implement the soft-output Viterbi algorithm that uses a priori information (APRI-SOVA) or the BCJR algorithm to provide extrinsic information on each source encoded bit. The source decoder uses the GMM model and the channel decoder output to provide a priori information back to the channel decoder. Decoding is done in an iterative manner by trading extrinsic information between the source and channel decoders. Experimental results showing improved decoding performance are provided in the application of speech spectrum parameter compression and communication.

Explore More