Wael Hamza | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Wael Hamza is active.

Explore More

Publication

Featured researches published by Wael Hamza.

international conference on acoustics, speech, and signal processing | 2003

Recent improvements to the IBM trainable speech synthesis system

Ellen Eide; Andrew Aaron; Raimo Bakis; R. Cohen; Robert E. Donovan; Wael Hamza; T. Mathes; Michael Picheny; M. Polkosky; M. Smith; M. Viswanathan

In this paper we describe the current status of the trainable text-to-speech system at IBM. Recent algorithmic and database changes to the system have led to significant gains in the output quality. On the algorithms side, we have introduced statistical models for predicting pitch and duration targets which replace the rule-based target generation previously employed. Additionally, we have changed the cost function and the search strategy, introduced a post-search pitch smoothing algorithm, and improved our method of preselection. Through the combined data and algorithmic contributions, we have been able to significantly improve (p < 0.0001) the mean opinion score (MOS) of our female voice, from 3.68 to 4.85 when heard over loudspeakers and to 5.42 when heard over the telephone (seven point scale).

international conference on acoustics, speech, and signal processing | 2001

Quantitative method for modeling context in concatenative synthesis using large speech database

Wael Hamza; Mohsen A. Rashwan; Mohamed Afify

Modeling phonetic context is one of the key points to get natural sounding in concatenativc speech synthesis. In this paper, a new quantitative method to model context is proposed. In the proposed method, the context is measured as the distance between leafs of the top-down likelihood-based decision trees that have been grown during the construction of acoustic inventory. Unlike other context modeling methods, this method allows the unit selection algorithm to borrow unit occurrences from other contexts when their context distances are close. This is done by incorporating the measured distance as an element in the unit selection cost function. The motivation behind this method is that it reduces the required speech modification by using better unit occurrences from near context. This method also makes it easy to use long synthesis units, e.g. syllables or words, in the same unit selection framework.

IEEE Transactions on Audio, Speech, and Language Processing | 2006

The IBM expressive text-to-speech synthesis system for American English

John F. Pitrelli; Raimo Bakis; Ellen Eide; Raul Fernandez; Wael Hamza; Michael Picheny

SSW | 2004

A corpus-based approach to expressive speech synthesis.

Ellen Eide; Andrew Aaron; Raimo Bakis; Wael Hamza; Michael Picheny; John F. Pitrelli

Archive | 2004

Generating paralinguistic phenomena via markup in text-to-speech synthesis

Andrew Aaron; Raimo Bakis; Ellen Eide; Wael Hamza

Archive | 2003

Systems and methods for text-to-speech synthesis using spoken example

Andy Aaron; Raimo Bakis; Ellen Eide; Wael Hamza

conference of the international speech communication association | 2004

The IBM expressive speech synthesis system.

Wael Hamza; Ellen Eide; Raimo Bakis; Michael Picheny; John F. Pitrelli

SSW | 2001

Current status of the IBM Trainable Speech Synthesis System.

Robert E. Donovan; Abraham Ittycheriah; Martin Franz; Bhuvana Ramabhadran; Ellen Eide; Mahesh Viswanathan; Raimo Bakis; Wael Hamza; Michael Picheny; Philip Gleason; T. Rutherfoord; P. Cox; D. Green; Eric Janke; S. Revelin; Claire Waast; B. Zeller; C. Guenther; J. Kunzmann

Archive | 2002