Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Siegfried Kunzmann.
international conference on acoustics, speech, and signal processing | 2004
Siegfried Kunzmann; Volker Fischer; Jorge Gonzalez; Ossama Emam; Carsten Günther; Eric Janke
In this paper, we review the design of a common phone alphabet for up to fifteen languages and describe its application in two important components of a seamless multilingual conversational system, namely speech recognition and synthesis. We report on experiments that demonstrate the advantages of multilingual acoustic models both for the recognition of foreign names and non-native speech, and describe the usefulness of a common phone alphabet for the construction of unit selection based mono- and bilingual speech synthesis systems.
international conference on multimedia and expo | 2000
Marion Mast; Carsten Günther; Siegfried Kunzmann; Thomas Ross
The IBM hotel information system was designed to make information, stored on Internet or enterprise backend servers, available in a natural dialog over the telephone. The system combines robust speech recognition with natural language and dialog components to allow the user to request the information in a human-human-like dialog. A summary of the information provided in the conversation like the address and phone number of a hotel is sent additionally to the users mobile phone as an SMS message. The system design combines spoken and textual output.
text speech and dialogue | 2005
Jozef Ivanecký; Volker Fischer; Siegfried Kunzmann
Multilingual access to information and services is a key requirement in any pervasive or ubiquitous computing environment. In this paper we describe our efforts towards multilingual speech recognition with a focus on applications that are designed to run on embedded devices, like e.g. a commercially available PDA. We give an overview on speech recognition techniques suited for the special requirements of the expected phonetic and acoustic environments and explore the ability to create multilingual acoustic models and applications that are able to run on embedded devices in real-time.
dagm conference on pattern recognition | 2005
Jozef Ivanecky; Julia Fischer; Marion Mast; Siegfried Kunzmann; Thomas Ross; Volker Fischer
Over the last decade voice technologies for telephony and embedded solutions became much more mature, resulting in applications providing mobile access to digital information from anywhere. Both a growing demand for voice driven applications in many languages and the need for improved usability and user experience now drives the exploration of multi-lingual speech processing techniques for recognition, synthesis and conversational dialog management. In this overview article we discuss our recent activities on multi-lingual voice technologies and describe the benefits of multi-lingual modeling for the creation of multi-modal mobile and telephony applications.
Archive | 2007
Volker Fischer; Markus Klehr; Siegfried Kunzmann
Im hinter uns liegenden Kapitel haben wir zunachst den zunehmenden Ein satz von Sprachverarbeitungstechnologien zur ortsungebundenen Interaktion zwischen Mensch und Computer skizziert und die mathematisch-technischen Grundlagen heutiger Spracherkennungs- und -synthesesysteme knapp umrissen. Die Entwicklung von deren Kernkomponenten hat sich uns dabei als ein fur viele Sprachen weitgehend einheitlicher, maschineller Lernprozess dargestellt, bei dem die freien Parameter eines stochastischen Modells der menschlichen Sprachproduktion anhand groser Mengen von sprachen- und anwendungsspezifischen Trainingsdaten ermittelt werden.
text speech and dialogue | 2003
Jozef Ivanecký; Markus Klehr; Volker Fischer; Siegfried Kunzmann
The seamless access to information and services is a key requirement for any pervasive or ubiquitous computing environment, and the access via any client is becoming a more and more feature within this scenario. The paper describes our efforts towards a multi–client voice application with focus on an embedded client, like e.g. a commercially available PDA. We give an overview on speech recognition techniques suited for the special requirements of the expected acoustic environments and explore the ability to design applications that are able to run on different voice and GUI capable devices.
Journal of the Acoustical Society of America | 2006
Volker Fischer; Siegfried Kunzmann; Eric-W. Janke; A. Jon Tyrrell
Journal of the Acoustical Society of America | 2000
Upali Bandara; Siegfried Kunzmann; Karlheinz Mohr; Burn L. Lewis
Journal of the Acoustical Society of America | 2002
Ossama Emam; Siegfried Kunzmann
Archive | 2006
Volker Fischer; Siegfried Kunzmann