Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sakari Himanen is active.

Publication


Featured researches published by Sakari Himanen.


international conference on acoustics, speech, and signal processing | 2002

A 4 kb/s hybrid MELP/CELP speech coding candidate for ITU standardization

Alan V. McCree; Jacek Stachurski; Takahiro Unno; Erdem Ertan; Erdal Paksoy; Vishu R. Viswanathan; Ari Heikkinen; Anssi Rämö; Sakari Himanen; Peter Blöcher; Oliver Dressler

This paper presents an improved 4 kb/s hybrid MELP/CELP speech coder submitted as a candidate for ITU standardization. The coder uses three modes: a high-quality MELP coder for strongly voiced speech frames, an ACELP coder with pitch prediction for weakly voiced frames, and a stochastic CELP coder for unvoiced frames. We present recent enhancements to this coder, both to improve speech quality and to reduce coder complexity. Previous lTU Selection Testing results on an earlier version of this coder showed that it met nearly all requirements for toll-quality speech, more than any other candidate. Our internal testing shows that the current reduced-complexity fixed-point coder maintains this high performance.


international conference on acoustics, speech, and signal processing | 2003

Hybrid MELP/CELP coding at bit rates from 6.4 to 2.4 kb/s

Jacek Stachurski; Alan V. McCree; Vishu R. Viswanathan; Ari Heikkinen; Anssi Rämö; Sakari Himanen; Peter Blöcher

This paper describes extensions of the 4 kb/s hybrid MELP/CELP coder, up to 6.4 kb/s and down to 2.4 kb/s. The baseline 4 kb/s coder uses three coding modes: MELP in strongly voiced speech frames, CELP with pitch prediction in weakly voiced frames, and CELP with stochastic excitation in unvoiced frames. To minimize switching artifacts between parametric MELP and waveform CELP coding, an alignment phase is encoded in MELP and zero-phase equalization is applied to the CELP target signal. The 6.4 kb/s extension uses the same three modes as the 4 kb/s coder, with improved MELP and CELP coders. The 2.4 kb/s extension uses only two modes: MELP for voiced frames and CELP synthesis with random excitation for unvoiced frames. The alignment phase is encoded in MELP frames for all bit rates so that time synchrony with input speech is always maintained. Alignment phase and zero-phase equalization enable smooth switching between coders at different bit rates. The hybrid MELP/CELP coding structure leads to coders that perform better at a given bit rate than MELP or CELP separately, and better than or equivalent to higher bit-rate ITU standards. Formal subjective tests show that for all-but-one tested conditions, the 6.4 kb/s hybrid coder is better than 8 kb/s G.729 and the 2.4 kb/s coder is equivalent to, or better than, 6.4 kb/s G.729 Annex D.


Archive | 2006

Method for inputting characters in electronic device

Janne Vainio; Hannu Mikkola; Hannu Korhonen; Sakari Himanen; Toni P. Nieminen; Tuomas Vaittinen; Juha Marila


Archive | 2008

Method and system for pitch contour quantization in audio coding

Anssi Rämö; Jani Nurminen; Sakari Himanen; Ari Heikkinen


Archive | 2004

Method and system for speech coding

Anssi Rämö; Jani Nurminen; Sakari Himanen; Ari Heikkinen


Archive | 2005

Supporting a concatenative text-to-speech synthesis

Jani Nurminen; Sakari Himanen; Anssi Rämö; Janne Vainio


Archive | 2004

Reusing codebooks in parameter quantization

Anssi Rämö; Sakari Himanen; Jani Nurminen


Archive | 2006

Complexity Adjustment for a Signal Encoder

Jari Mäkinen; Juha Marila; Hannu Mikkola; Janne Vainio; Tuomas Vaittinen; Sakari Himanen; Kai Samposalo


Archive | 2003

Speech coding method, device, coding module, system and software program product for pre-processing the phase structure of a to be encoded speech signal to match the phase structure of the decoded signal

Ari Heikkinen; Sakari Himanen; Anssi Rämö


conference of the international speech communication association | 2004

Segmental speech coding model for storage applications.

Anssi Rämö; Jani Nurminen; Sakari Himanen; Ari Heikkinen

Collaboration


Dive into the Sakari Himanen's collaboration.

Researchain Logo
Decentralizing Knowledge