Juha Iso-Sipilä
Nokia
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Juha Iso-Sipilä.
international conference on acoustics, speech, and signal processing | 2006
Juha Iso-Sipilä; Marko Moberg; Olli Viikki
This paper presents a multi-lingual speaker-independent voice user interface (UI) that has been implemented for Nokia S60 mobile phones. The paper concentrates on discussing the specific approach used for achieving a multi-lingual and configurable speech recognition and speech synthesis system. The main applications are speaker-independent name dialing and voice commands. The novelty of the applications is that the user does not need to train the voice dialing system but the application reads the users phonebook and generates the required voice tags automatically. The speaker-independent voice dialing has already been introduced in regions where the language diversity is not so great. The system presented in this paper is the first of its kind to support both speech recognition and speech synthesis in more than 40 languages in embedded devices with strict memory and performance requirements
international conference on acoustics, speech, and signal processing | 2004
Marcel Vasilache; Juha Iso-Sipilä; Olli Viikki
We outline the main design features of a low complexity speech recognition engine targeted for mobile devices. Although major parts have already been presented, new features and important refinements of the original ideas, which were omitted, are now described. We also show how these techniques can be successfully combined in order to achieve various design targets with minimized impact on the recognition performance.
international conference on acoustics, speech, and signal processing | 2003
Guo-Hong Ding; Bo Xu; Juha Iso-Sipilä; Yang Cao
This paper proposes two fast and effective adaptation algorithms, which are called SATD and SASBD respectively. The two algorithms are implemented in the MLLR frame and the transform matrices have constrained forms. SATD uses triple diagonal matrices to describe the mismatch between speakers and the acoustic model in the log-spectral domain and the matrices can be transformed into the cepstral domain to adjust the acoustic model. SASBD is different from the traditional block-diagonal MLLR and shares the three transformations of basic MFCC and dynamic features with one matrix. Moreover, both algorithms provide multiple choices for the biases. Experiments are extensively implemented and the results prove the advantages of SATD and SASBD over traditional MLLR.
conference of the international speech communication association | 2004
Marko Moberg; Kimmo Pärssinen; Juha Iso-Sipilä
Archive | 2006
Bogdan Barliga; Mikko Antero Harju; Juha Iso-Sipilä
Archive | 2004
Juha Iso-Sipilä
Archive | 2006
Juha Iso-Sipilä
Archive | 2004
Juha Iso-Sipilä; Janne Suontausta; Jilei Tian
Journal of the Acoustical Society of America | 2007
Juha Iso-Sipilä
Archive | 2000
Juha Iso-Sipilä; Kari Laurila