Zhiwei Shuang
IBM
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Zhiwei Shuang.
international conference on pattern recognition | 2010
Shilei Zhang; Zhiwei Shuang; Qin Shi; Yong Qin
This paper presents an improved acoustic keyword spotting (KWS) algorithm using a novel confusion garbage model in Mandarin conversational speech. Observing the KWS corpus, we found there are many words with similar pronunciation with predefined keywords, although they have different Chinese characters and different meanings, which easily result in high false alarm rate. In this paper, an improved acoustic KWS method with confusion garbage models was developed that absorbs similar pronunciation words confused with specific keywords for a given task. One obvious advantage of such method is that it provides a flexible framework to implement the selection procedure and reduce false alarm rate effectively for a specific task. The efficiency of the proposed architecture was evaluated under HMM-based confidence measures (CM) methods and demonstrated on a conversational telephone dataset.
international conference on pattern recognition | 2010
Quansheng Duan; Shiyin Kang; Zhiyong Wu; Lianhong Cai; Zhiwei Shuang; Yong Qin
The performance of HMM-based text to speech (TTS) system is affected by the basic modeling units and the size of training data. This paper compares two HMM based Mandarin TTS systems using syllable and phone as basic units respectively with 1000, 3000 and 5000 sentences’ training data. Two female speakers’ corpora are used as training data for evaluation. For both corpora, the system using syllable as basic unit outperforms the system using phone as basic unit with 3000 and 5000 sentences’ training data.
international conference on pattern recognition | 2010
Shilei Zhang; Zhiwei Shuang; Yong Qin
This paper presents automatic pronunciation transliteration method with acoustic and contextual analysis for Chinese-English mixed language keyword spotting (KWS) system. More often, we need to develop robust Chinese-English mixed language spoken language technology without Chinese accented English acoustic data. In this paper, we exploit pronunciation conversion method based on syllable-based characteristic analysis of pronunciation and data-driven phoneme pairs mappings to solve mixed language problem by only using well-trained Chinese models. One obvious advantage of such method is that it provides a flexible framework to implement the pronunciation conversion of English keywords to Chinese automatically. The efficiency of the proposed method was demonstrated under KWS task on mixed language database.
Archive | 2010
Yong Qin; Qin Shi; Zhiwei Shuang; Shi Lei Zhang; Jie Zhou
Archive | 2009
Fan Ping Meng; Yong Qin; Qin Shi; Zhiwei Shuang
Archive | 2011
Yong Qin; Qin Shi; Zhiwei Shuang; Shi Lei Zhang
Archive | 2011
Shenghua Bao; Jian Chen; Yong Qin; Qin Shi; Zhiwei Shuang; Zhong Su; Liu Wen; Shi Lei Zhang
conference of the international speech communication association | 2009
Shiyin Kang; Zhiwei Shuang; Quansheng Duan; Yong Qin; Lianhong Cai
Archive | 2009
Qin Shi; Yong Qin; Yi Liu; Zhiwei Shuang
conference of the international speech communication association | 2009
Zhiwei Shuang; Shiyin Kang; Qin Shi; Yong Qin; Lianhong Cai