Kerry A. Ortega
IBM
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Kerry A. Ortega.
Journal of the Acoustical Society of America | 2002
Kerry A. Ortega; James R. Lewis; Ronald VanBuskirk; Huifang Wang; Stephane Herman Maes
A method and apparatus for transcribing text from multiple speakers in a computer system having a speech recognition application. The system receives speech from one of a plurality of speakers through a single channel, assigns a speaker ID to the speaker, transcribes the speech into text, and associates the speaker ID with the speech and text. In order to detect a speaker change, the system monitors the speech input through the channel for a speaker change.
Journal of the Acoustical Society of America | 2002
James R. Lewis; Kerry A. Ortega
A method and a system for use in a computer speech recognition system for adding new vocabulary by using language model statistics corresponding to an existing vocabulary word. The method involves a series of steps including receiving a user input identifying a first word for which no language model statistics exist in the speech recognition system. The first word is for inclusion within the existing vocabulary of the speech recognition system. In response to a second user input identifying a second word for which language model statistics exist in the speech recognition system, recalling from a computer memory the language model statistics for the second word. The speech recognition system then automatically creates language model statistics for the first word by duplicating the language model statistics of the second word and replacing each occurrence of the second word in the duplicated language model statistics with the first word.
Journal of the Acoustical Society of America | 2007
James R. Lewis; Kerry A. Ortega; Huifang Wang
A method for guiding text-to-speech output timing with speech recognition markers can include the following steps. First, tokens can be retrieved in a TTS system. The tokens can include words, phrase markers, punctuation marks and meta-tags. Second, phrase markers can be identified among the retrieved tokens. Third, words can be identified among the retrieved tokens. Fourth, the TTS system can TTS play back the identified words. Finally, during the TTS playback of the words, the TTS system can pause in response to the identification of the phrase markers.
Archive | 2003
James R. Lewis; Barbara E. Ballard; Gary R. Hanson; Kerry A. Ortega; Ronald VanBuskirk; Arthur Keller
Archive | 2000
James R. Lewis; Kerry A. Ortega
Archive | 2001
Vincent C. Conzola; Aaron R. Cox; Kerry A. Ortega; Thomas J. Sluchak
Archive | 2000
Linda M. Boyer; James R. Lewis; Kerry A. Ortega; Ji Wee Tan
Journal of the Acoustical Society of America | 2005
James R. Lewis; Kerry A. Ortega
Archive | 1999
Amado Nassiff; Kerry A. Ortega
Archive | 2001
Kerry A. Ortega; Hans Egger; Arthur Keller; Ronald VanBuskirk; Huifang Wang; James R. Lewis