Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Roar Hagen is active.

Publication


Featured researches published by Roar Hagen.


IEEE Transactions on Information Theory | 1999

Robust vector quantization by a linear mapping of a block code

Roar Hagen; Per Hedelin

In this paper we propose a novel technique for vector quantizer design where the reconstruction vectors are given by a linear mapping of a binary block code (LMBC). The LMBC framework provides a relation between the index bits and the reconstruction vectors through mapping properties. We define a framework, show its flexibility, and give optimality conditions. We consider source optimized vector quantization (VQ), where the objective is to directly obtain a VQ with inherent good channel robustness properties. Several instructive theoretical results and properties of the distortion experienced due to channel noise are demonstrated. These results are used to guide the design process. Both optimization algorithms and a block code selection procedure are devised. Experimental results for Gauss-Markov sources show that quantization performance close to an unconstrained VQ is obtained with a short block code which implies a constrained VQ. The resulting VQs have better channel noise robustness than conventional VQs designed with the generalized Lloyd algorithm (GLA) and splitting initialization, even when a post-processing index assignment algorithm is applied to the GLA-based VQ. We have, thus, demonstrated a unique method for direct design resulting in an inherent good index assignment combined with small losses in quantization performance.


international conference on acoustics speech and signal processing | 1998

Removal of sparse-excitation artifacts in CELP

Roar Hagen; E. Ekudden; B. Johansson; W.B. Kleijn

In CELP, the use of codebooks with entries with only a few non-zero samples provides high speech quality and facilitates fast computation. With decreasing bit-rate, the intervals between the pulses increase, and the quality of the reconstructed signal begins to suffer from a particular type of artifact, which is strongest for noise-like segments. In this paper we describe experiments which show that the perceived artifacts are mainly concentrated at frequencies above 3 kHz, and this is consistent with our understanding of auditory theory. Our analysis leads to simple strategies to eliminate the artifacts, even at lower bit rates. We describe both a non-adaptive and an adaptive post-processing method to remove the artifacts. The methods are demonstrated to be efficient when used in the ACELP algorithm. A closed-loop method for ACELP is also described.


IEEE Transactions on Speech and Audio Processing | 1999

Voicing-specific LPC quantization for variable-rate speech coding

Roar Hagen; Erdal Paksoy; Allen Gersho

Phonetic classification of speech frames allows distinctive quantization and bit allocation schemes suited to the particular class. Separate quantization of the linear predictive coding (LPC) parameters for voiced and unvoiced speech frames is shown to offer useful gains for representing the synthesis filter commonly used in code-excited linear prediction (CELP) and other coders. Subjective test results are reported that determine the required bit rate and accuracy in the two classes of voiced and unvoiced LPC spectra for CELP coding with phonetic classification. It was found, in this context, that unvoiced spectra need 9 b/frame or more whereas voiced spectra need 25 b/frame or more with the quantization schemes used. New spectral distortion criteria needed to assure transparent LPC spectral quantization for each voicing class in CELP coders are presented. Similar subjective test results for speech synthesized from the true residual signal are also presented, leading to some interesting observations on the role of the analysis-by-synthesis structure of CELP. Objective performance assessments based on the spectral distortion measure are also presented. The theoretical distortion-rate function for the spectral distortion measure is estimated for voiced and unvoiced LPC parameters and compared with experimental results obtained with unstructured vector quantization (VQ). These results show a saving of at least 2 b/frame for unvoiced spectra compared to voiced spectra to achieve the same spectral distortion performance.


international conference on acoustics, speech, and signal processing | 1993

Robust vector quantization in spectral coding

Roar Hagen; Per Hedelin

A mapping of a binary block code is used to generate the reconstruction vectors of a vector quantizer. The aim is to secure channel robustness while allowing for efficient design, storage, and handling of the vector quantizer. The general procedure is exemplified by the task of spectral coding for speech transmission. Using an LSP (line spectrum pair)-representation for the spectrum, it is demonstrated that short block codes are feasible, thus allowing for compact storage of the code-book. These short block codes combine low quantization noise in clear channel conditions with robustness to bit errors introduced by a noisy channel. A reduced length code offers the possibility of obtaining a vector quantizer. The block code structure of the code vector generation may also be exploited for efficient search of the code-book.<<ETX>>


international conference on acoustics, speech, and signal processing | 1994

Spectral quantization of cepstral coefficients

Roar Hagen

Studies the cepstral coefficients as a suitable representation of the linear prediction filter for spectral coding purposes. Spectral coding methods in predictive speech coders are usually evaluated using the spectral distance measure. The average spectral distance combined with a measure of the percentage of spectra with high distortion are used to predict the perceptual quality when quantizing the prediction filter. The authors show that the spectral distance is equivalent to a squared error in the cepstral domain. Methods for spectral quantization using vector quantization of cepstral coefficients are analyzed. Better results than for quantization of line spectrum frequencies are reported for both single-stage VQ at 11-14 bits as well as 2-stage VQ at 18-22 bits. It is concluded that the cepstral coefficients are the right representation for LPC spectral coding purposes.<<ETX>>


international conference on acoustics, speech, and signal processing | 1990

Low bit-rate spectral coding in CELP, a new LSP-method

Roar Hagen; Per Hedelin

The use of line spectral pair (LSP)-frequencies for approaching the performance of a full vector quantizer for speech coding is discussed. An algorithm is presented that attains close to optimal performance if an accumulative constraint is introduced in the parameter encoding. Numerical results are presented from the training of code-books as well as from the evaluation of the quantizer. Some listening tests are discussed.<<ETX>>


international conference on acoustics, speech, and signal processing | 1995

Variable rate spectral quantization for phonetically classified CELP coding

Roar Hagen; Erdal Paksoy; Allen Gersho

Variable rate quantization of the linear predictive coding (LPC) parameters based on phonetic classification of the speech frame results in substantial performance gain. Speech frames are classified as unvoiced or voiced and are separately quantized with VQ codebooks designed for each class. Performance results, including listening tests, show that for transparent quality roughly 9 bits is sufficient for unvoiced frames and 24 bits for voiced frames. Test results of LPC quantization are described for a variable rate phonetically segmented CELP coder and for the synthesis of speech from the prediction residual.


international symposium on information theory | 1994

Design methods for VQ by linear mappings of block codes

Roar Hagen; Per Hedelin

Vector quantization (VQ) is one of the most important building blocks in modern source coders. Both speech and image compression systems are strongly dependant on VQ. When increasingly exploiting the compression capabilities of VQ, we normally have to pay a price in terms of problems with channel robustness and complexity. We have proposed linear mappings of block codes (LMBC) as a technique well suited to deal with some of these problems. As the scheme includes both a block code and a mapping, it is not obvious how to do the design. We demonstrate several design methods that work well and discuss their properties.<<ETX>>


Archive | 2008

Voice over IP: Speech Transmission over Packet Networks

Jan Skoglund; Ermin Kozica; Jan Linden; Roar Hagen; W. Bastiaan Kleijn

The emergence of packet networks for both data and voice traffic has introduced new challenges for speech transmission designs that differ significantly from those encountered and handled in traditional circuit-switched telephone networks, such as the public switched telephone network (PSTN). In this chapter, we present the many aspects that affect speech quality in a voice over IP (VoIP) conversation. We also present design techniques for coding systems that aim to overcome the deficiencies of the packet channel. By properly utilizing speech codecs tailored for packet networks, VoIP can in fact produce a quality higher than that possible with PSTN.


IEEE Transactions on Speech and Audio Processing | 1996

Robust LPC spectrum quantization-vector quantization by a linear mapping of a block code

Roar Hagen

In this paper, we address the use of a vector quantizer (VQ) defined as a linear mapping of a block code (LMBC-VQ) in robust LPC spectrum quantization. Split VQ and multistage VQ of LSFs are considered. We demonstrate several properties for the LMBC-VQ including design methods and channel robustness theory. The LMBC-VQ codebooks are, guided by the robustness theory, directly designed with inherent robustness against channel errors. We demonstrate results where the resulting LMBC-VQs outperform conventional codebooks designed with the generalized Lloyd algorithm in the presence of channel errors even when postprocessing index assignment is applied to the latter. Listening tests confirmed that the increased robustness is significant for the perceptual quality. We show that short block codes can be used to build the codebooks with no or small losses in quantization performance. This is advantageous for channel error robustness, lower memory requirement, and lower complexity. We also describe the structure of split VQ and multistage VQ using the LMBC framework. This allows us to assess the structural constraints and to generalize these VQ schemes, resulting in improved performance.

Collaboration


Dive into the Roar Hagen's collaboration.

Top Co-Authors

Avatar

Per Hedelin

Chalmers University of Technology

View shared research outputs
Top Co-Authors

Avatar

Jan Linden

Chalmers University of Technology

View shared research outputs
Top Co-Authors

Avatar

Allen Gersho

University of California

View shared research outputs
Top Co-Authors

Avatar

Erdal Paksoy

University of California

View shared research outputs
Top Co-Authors

Avatar

Ermin Kozica

Royal Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Mikael Skoglund

Royal Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

W. Bastiaan Kleijn

Victoria University of Wellington

View shared research outputs
Researchain Logo
Decentralizing Knowledge