Wael Hamza
IBM
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Wael Hamza.
international conference on acoustics, speech, and signal processing | 2003
Ellen Eide; Andrew Aaron; Raimo Bakis; R. Cohen; Robert E. Donovan; Wael Hamza; T. Mathes; Michael Picheny; M. Polkosky; M. Smith; M. Viswanathan
In this paper we describe the current status of the trainable text-to-speech system at IBM. Recent algorithmic and database changes to the system have led to significant gains in the output quality. On the algorithms side, we have introduced statistical models for predicting pitch and duration targets which replace the rule-based target generation previously employed. Additionally, we have changed the cost function and the search strategy, introduced a post-search pitch smoothing algorithm, and improved our method of preselection. Through the combined data and algorithmic contributions, we have been able to significantly improve (p < 0.0001) the mean opinion score (MOS) of our female voice, from 3.68 to 4.85 when heard over loudspeakers and to 5.42 when heard over the telephone (seven point scale).
international conference on acoustics, speech, and signal processing | 2001
Wael Hamza; Mohsen A. Rashwan; Mohamed Afify
Modeling phonetic context is one of the key points to get natural sounding in concatenativc speech synthesis. In this paper, a new quantitative method to model context is proposed. In the proposed method, the context is measured as the distance between leafs of the top-down likelihood-based decision trees that have been grown during the construction of acoustic inventory. Unlike other context modeling methods, this method allows the unit selection algorithm to borrow unit occurrences from other contexts when their context distances are close. This is done by incorporating the measured distance as an element in the unit selection cost function. The motivation behind this method is that it reduces the required speech modification by using better unit occurrences from near context. This method also makes it easy to use long synthesis units, e.g. syllables or words, in the same unit selection framework.
IEEE Transactions on Audio, Speech, and Language Processing | 2006
John F. Pitrelli; Raimo Bakis; Ellen Eide; Raul Fernandez; Wael Hamza; Michael Picheny
SSW | 2004
Ellen Eide; Andrew Aaron; Raimo Bakis; Wael Hamza; Michael Picheny; John F. Pitrelli
Archive | 2004
Andrew Aaron; Raimo Bakis; Ellen Eide; Wael Hamza
Archive | 2003
Andy Aaron; Raimo Bakis; Ellen Eide; Wael Hamza
conference of the international speech communication association | 2004
Wael Hamza; Ellen Eide; Raimo Bakis; Michael Picheny; John F. Pitrelli
SSW | 2001
Robert E. Donovan; Abraham Ittycheriah; Martin Franz; Bhuvana Ramabhadran; Ellen Eide; Mahesh Viswanathan; Raimo Bakis; Wael Hamza; Michael Picheny; Philip Gleason; T. Rutherfoord; P. Cox; D. Green; Eric Janke; S. Revelin; Claire Waast; B. Zeller; C. Guenther; J. Kunzmann
Archive | 2002
Wael Hamza; Michael Picheny
Archive | 2005
Ellen Eide; Wael Hamza; Michael Picheny