Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Michael Galler is active.

Publication


Featured researches published by Michael Galler.


international conference on acoustics, speech, and signal processing | 1997

Enhancement of esophageal speech by injection noise rejection

Hector R. Javkin; Michael Galler; Nancy Niedzielski

Esophageal speakers, who produce a voice source by bringing about a vibration of the esophageal superior sphincter, must insufflate the esophagus with an air injection gesture before every utterance, thus creating an air reservoir to drive the vibration. The resulting noise is generally undesired by the speakers. This paper describes a method for the automatic recognition and rejection of the injection noise which occurs in esophageal speech.


Journal of the Acoustical Society of America | 2000

Esophageal speech injection noise detection and rejection

Hector R. Javkin; Michael Galler; Nancy Niedzielski; Robert Boman

The present invention eliminates injection noise in speech produced by esophageal speakers. A speech input signal is digitized. One copy of the digitized signal is used for analysis and the other is passed through a gain switch to an amplifier as output. A Fast Fourier Transform and a mean value of the digitized speech input signal is calculated. The Fast Fourier Transform (FFT) is passed through a morphological filter to produce a filtered spectrum. An occurrence of injection noise is detected by calculating a derivative of the filtered spectrum and determining from the mean value and the derivative a location and value of a largest peak and a second largest peak in the filtered spectrum. If the largest peak is lower in frequency than the second largest peak, and if all points above 2 KHz are less than the mean, then an occurrence of injection noise has been detected. An occurrence of silence is detected by center-clipping the filtered spectrum and determining whether there is any energy within a sliding 10 millisecond window for a predetermined amount of time. If no energy is detected within a sliding 10 millisecond window for a predetermined amount time, then an occurrence of silence has been detected. The output speech signal is passed after the occurrence of injection noise has been detected; and is blocked following an occurrence of silence.


international conference on acoustics, speech, and signal processing | 1997

Robustness improvements in continuously spelled names over the telephone

Michael Galler; Jean-Claude Junqua

A speaker-independent speech recognizer for continuously spelled names, implemented for a switchboard call-routing task, is analyzed for sources of error. Results indicate most errors are due to extraneous speech and end-point detection errors. Strategies are proposed for improving the robustness of recognition, including tolerance for speech with pauses, and a letter-spotting strategy to handle extraneous speech. Experimental results on laboratory data indicate that with the letter-spotting method, name retrieval error rate is reduced on noisy signals or signals with extraneous speech 60.1%, while it is increased on clean signals from 4.5% to 5.5%. On data collected during a telephone field trial, name retrieval error is reduced 54.1% in offline tests by introducing the letter-spotting algorithm.


Journal of the Acoustical Society of America | 1999

Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue

Jean-Claude Junqua; Roland Kuhn; Matteo Contolini; Murat Karaorman; Ken Field; Michael Galler; Yi Zhao


Journal of the Acoustical Society of America | 1998

Method and apparatus using probabilistic language model based on confusable sets for speech recognition

Luca Rigazio; Jean-Claude Junqua; Michael Galler


Archive | 1996

Call routing device employing continuous speech

Jean-Claude Junqua; Michael Galler


Journal of the Acoustical Society of America | 2000

Speech recognition system employing multiple grammar networks

Michael Galler; Jean-Claude Junqua


Archive | 2000

Method for goal-oriented speech translation using meaning extraction and dialogue

Matteo Contolini; Ken Field; Michael Galler; Jean-Claude Junqua; Murat Karaorman; Roland Kuhn; Yi Zhao


international conference on acoustics speech and signal processing | 1998

Multilevel discriminative training for spelled word recognition

Luca Rigazio; Jean-Claude Junqua; Michael Galler


Archive | 2000

Procédé de traduction de la parole utilisant une extraction de la signification et un dialogue

Matteo Contolini; Ken Field; Michael Galler; Jean-Claude Junqua; Murat Karaorman; Roland Kuhn; Yi Zhao

Collaboration


Dive into the Michael Galler's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Roland Kuhn

State Street Corporation

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge