Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where André Mansikkaniemi is active.

Publication


Featured researches published by André Mansikkaniemi.


language resources and evaluation | 2017

Modeling under-resourced languages for speech recognition

Mikko Kurimo; Seppo Enarvi; Ottokar Tilk; Matti Varjokallio; André Mansikkaniemi; Tanel Alumäe

One particular problem in large vocabulary continuous speech recognition for low-resourced languages is finding relevant training data for the statistical language models. Large amount of data is required, because models should estimate the probability for all possible word sequences. For Finnish, Estonian and the other fenno-ugric languages a special problem with the data is the huge amount of different word forms that are common in normal speech. The same problem exists also in other language technology applications such as machine translation, information retrieval, and in some extent also in other morphologically rich languages. In this paper we present methods and evaluations in four recent language modeling topics: selecting conversational data from the Internet, adapting models for foreign words, multi-domain and adapted neural network language modeling, and decoding with subword units. Our evaluations show that the same methods work in more than one language and that they scale down to smaller data resources.


SLSP 2015 Proceedings of the Third International Conference on Statistical Language and Speech Processing - Volume 9449 | 2015

Unsupervised and User Feedback Based Lexicon Adaptation for Foreign Names and Acronyms

André Mansikkaniemi; Mikko Kurimo

In this work we evaluate a set of lexicon adaptation methods for improving the recognition of foreign names and acronyms in automatic speech recognition ASR. The most likely foreign names and acronyms are selected from the LM training corpus based on typographic information and letter-ngram perplexity. Adapted pronunciation rules are generated for the selected foreign name candidates using a statistical grapheme-to-phoneme G2P model. A rule-based method is used for pronunciation adaptation of acronym candidates. In addition to unsupervised lexicon adaptation, we also evaluate an adaptation method based on speech data and user corrected ASR transcripts. Pronunciation variants for foreign name candidates are retrieved using forced alignment and second-pass decoding over partial audio segments. Optimal pronunciation variants are collected and used for future pronunciation adaptation of foreign names.


IEEE Transactions on Audio, Speech, and Language Processing | 2015

Adaptation of morph-based speech recognition for foreign names and acronyms

André Mansikkaniemi; Mikko Kurimo

In this paper, we improve morph-based speech recognition system by focusing adaptation efforts on acronyms (ACRs) and foreign proper names (FPNs). An unsupervised language model (LM) adaptation framework based on two-pass decoding is used. Vocabulary adaptation is applied alongside unsupervised LM adaptation. The aim is to improve both language and pronunciation modeling for FPNs and ACRs. A smart selection algorithm is used to find the most likely topically related foreign words and acronyms from in-domain text. New pronunciation rules are generated for the selected words. Different kinds of morpheme adaptation operations are also evaluated on the ACR and FPN candidate words, to ensure optimal results are gained from pronunciation adaptation. Statistically significant improvements in average word error rate (WER), and term error rate (TER), are achieved using a combination of unsupervised LM adaptation with vocabulary adaptation focused on ACRs and FPNs.


european conference on technology enhanced learning | 2017

ASR in Classroom Today: Automatic Visualization of Conceptual Network in Science Classrooms

Daniela Caballero; Roberto Araya; Hanna Kronholm; Jouni Viiri; André Mansikkaniemi; Sami Lehesvuori; Tuomas Virtanen; Mikko Kurimo

Automatic Speech Recognition (ASR) field has improved substantially in the last years. We are in a point never saw before, where we can apply such algorithms in non-ideal conditions such as real classrooms. In these scenarios it is still not possible to reach perfect recognition rates, however we can already take advantage of these improvements. This paper shows preliminary results using ASR in Chilean and Finnish middle and high school to automatically provide teachers a visualization of the structure of concepts present in their discourse in science classrooms. These visualizations are conceptual networks that relate key concepts used by the teacher. This is an interesting tool that gives feedback to the teacher about his/her pedagogical practice in classes. The result of initial comparisons shows great similarity between conceptual networks generated in a manual way with those generated automatically.


international conference on speech and computer | 2016

In-Document Adaptation for a Human Guided Automatic Transcription Service

André Mansikkaniemi; Mikko Kurimo; Krister Lindén

In this work, the task is to assist human transcribers to produce, for example, interview or parliament speech transcriptions. The system will perform in-document adaptation based on a small amount of manually corrected automatic speech recognition results. The corrected segments of the spoken document are used to adapt the speech recognizer’s acoustic and language model. The updated models are used in second-pass recognition to produce a more accurate automatic transcription for the remaining uncorrected parts of the spoken document. In this work we evaluate two common adaptation methods for speech data in settings that represent typical transcription tasks. For adapting the acoustic model we use the Maximum A Posteriori adaptation method. For adapting the language model we use linear interpolation. We compare results of supervised adaptation to unsupervised adaptation, and evaluate the total benefit of using human corrected segments for in-document adaptation for typical transcription tasks.


conference of the international speech communication association | 2013

Unsupervised Topic Adaptation for Morph-based Speech Recognition

André Mansikkaniemi; Mikko Kurimo


north american chapter of the association for computational linguistics | 2012

Unsupervised Vocabulary Adaptation for Morph-based Language Models

André Mansikkaniemi; Mikko Kurimo


workshop on statistical machine translation | 2010

Applying Morphological Decompositions to Statistical Machine Translation

Sami Virpioja; Jaakko J. Väyrynen; André Mansikkaniemi; Mikko Kurimo


Archive | 2012

Adaptation of Morpheme-based Speech Recognition for Foreign Entity Names, HLT 2012

André Mansikkaniemi; Mikko Kurimo


Archive | 2010

Acoustic model and language model adaptation for a mobile dictation service

André Mansikkaniemi

Collaboration


Dive into the André Mansikkaniemi's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Hanna Kronholm

University of Jyväskylä

View shared research outputs
Top Co-Authors

Avatar

Jouni Viiri

University of Jyväskylä

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Sami Lehesvuori

University of Jyväskylä

View shared research outputs
Top Co-Authors

Avatar

Tuomas Virtanen

Tampere University of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge