Changhuai You
Agency for Science, Technology and Research
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Changhuai You.
international conference on acoustics, speech, and signal processing | 2009
Haizhou Li; Bin Ma; Kong-Aik Lee; Hanwu Sun; Donglai Zhu; Khe Chai Sim; Changhuai You; Rong Tong; Ismo Kärkkäinen; Chien-Lin Huang; Vladimir Pervouchine; Wu Guo; Yijie Li; Li-Rong Dai; Mohaddeseh Nosratighods; Thiruvaran Tharmarajah; Julien Epps; Eliathamby Ambikairajah; Eng Siong Chng; Tanja Schultz; Qin Jin
This paper describes the performance of the I4U speaker recognition system in the NIST 2008 Speaker Recognition Evaluation. The system consists of seven subsystems, each with different cepstral features and classifiers. We describe the I4U Primary system and report on its core test results as they were submitted, which were among the best-performing submissions. The I4U effort was led by the Institute for Infocomm Research, Singapore (IIR), with contributions from the University of Science and Technology of China (USTC), the University of New South Wales, Australia (UNSW), Nanyang Technological University, Singapore (NTU) and Carnegie Mellon University, USA (CMU).
international symposium on chinese spoken language processing | 2006
Rong Tong; Bin Ma; Kong-Aik Lee; Changhuai You; Donglai Zhu; Tomi Kinnunen; Hanwu Sun; Minghui Dong; Eng Siong Chng; Haizhou Li
This paper describes our recent efforts in exploring effective discriminative features for speaker recognition. Recent researches have indicated that the appropriate fusion of features is critical to improve the performance of speaker recognition system. In this paper we describe our approaches for the NIST 2006 Speaker Recognition Evaluation. Our system integrated the cepstral GMM modeling, cepstral SVM modeling and tokenization at both phone level and frame level. The experimental results on both NIST 2005 SRE corpus and NIST 2006 SRE corpus are presented. The fused system achieved 8.14% equal error rate on 1conv4w-1conv4w test condition of the NIST 2006 SRE.
international conference on acoustics, speech, and signal processing | 2008
Kong-Aik Lee; Changhuai You; Haizhou Li
This paper introduces a spoken language recognition system with a generative front-end and a discriminative backend. The generative front-end is built upon an ensemble of Gaussian densities. These Gaussian densities are trained to represent elementary speech sound units characterizing a wide variety of languages. We formulate the generative front-end in a form of sequence kernel. This sequence kernel transforms a spoken utterance into a feature vector with its attributes representing the occurrence statistics of the speech sound units. A discriminative support vector machine (SVM) then operates on the feature vectors to make classification decision. The proposed language recognition system demonstrates competitive performance on NIST 1996, 2003 and 2005 LRE corpora.
international symposium on chinese spoken language processing | 2006
Kong-Aik Lee; Hanwu Sun; Rong Tong; Bin Ma; Minghui Dong; Changhuai You; Donglai Zhu; Chin-Wei Eugene Koh; Lei Wang; Tomi Kinnunen; Eng Siong Chng; Haizhou Li
This paper describes the design and implementation of a practical automatic speaker recognition system for the CSLP speaker recognition evaluation (SRE). The speaker recognition system is built upon four subsystems using speaker information from acoustic spectral features. In addition to the conventional spectral features, a novel temporal discrete cosine transform (TDCT) feature is introduced in order to capture long-term speech dynamic. The speaker information is modeled using two complementary speaker modeling techniques, namely, Gaussian mixture model (GMM) and support vector machine (SVM). The resulting subsystems are then integrated at the score level through a multilayer perceptron (MLP) neural network. Evaluation results confirm that the feature selection, classifier design, and fusion strategy are successful, giving rise to an effective speaker recognition system.
conference of the international speech communication association | 2007
Kong-Aik Lee; Changhuai You; Haizhou Li; Tomi Kinnunen
conference of the international speech communication association | 2008
Kong-Aik Lee; Changhuai You; Haizhou Li; Tomi Kinnunen; Donglai Zhu
conference of the international speech communication association | 2012
Changhuai You; Haizhou Li; Bin Ma; Kong-Aik Lee
Archive | 2006
Rong Tong; Bin Ma; Kong-Aik Lee; Changhuai You; Donglai Zhu; Hanwu Sun; Minghui Dong; Eng Siong Chng; Heng Mui; Keng Terrace
MediaEval | 2016
Lei Wang; Chongjia Ni; Cheung-Chi Leung; Changhuai You; Lei Xie; Haihua Xu; Xiong Xiao; Tin Lay Nwe; Eng Siong Chng; Bin Ma
NIST Speaker Recognition Conference | 2012
Kong Aik Lee; Rahim Saedi; Tawfik Hasan; Tomi Kinnunen; Benoit G. B. Fauve; Pierre-Michel Bousquet; Elie Khoury; Pablo Luis Sordo Martinez; Tharmarajah Thiruvaran; Changhuai You; Padmanabhan Rajan; David A. van Leeuwen; Seyed Omid Sadjadi; Driss Matrouf; Laurent El Shafey; John S. D. Mason; Eliathamby Ambikairajah; Hanwu Sun; Anthony Larcher; Bin Ma; Ville Hautamäki; Cemal Hanilçi; Billy Braithwaite; Gonzalez-Hautamäki Rosa; Gang Liu; Hynek Boril; Navid Shokouhi; John H. L. Hansen; Jean-François Bonastre; Sébastien Marcel