Changchun Bao
Beijing University of Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Changchun Bao.
international conference on signal processing | 2010
Xin Liu; Changchun Bao; Mao-shen Jia; Yong-tao Sha
For conventional bandwidth extension, the spectral patching methods, such as spectral folding, spectral translation and non-linear processing, are employed to reconstruct high frequency signal, yet it leads to the spectral shifting between reconstructed and original signal, and does not retain the original harmonic relations. In this paper, a blind harmonic bandwidth extension method from wideband to super-wideband was proposed by estimating the energy of high frequency spectral envelope with Gaussian mixture model (GMM). Both the objective and subjective test results show that proposed algorithm performs better than conventional blind bandwidth extension algorithms.
international conference on signal processing | 2008
Ruwei Li; Changchun Bao; Huijing Dou
Most of the current pitch detection algorithms can not work well under the high noise environment. For this reason, a pitch detection algorithm for noisy speech signal based on pre-filtering and weighted wavelet coefficients is proposed. Firstly, the noisy speech signals are pre-filtered. Secondly, the speech pre-filtered is decomposed by the quadratic spline wavelet. Thirdly, the wavelet coefficients of three consecutive scales are weighted to emphasize the sharp change points. Fourthly, three candidate pitch periods are extracted from the weighted signals. Finally, the pitch period is calculated by autocorrelation function. Experiments show that this algorithm can increase the performance of pitch detection in noisy environment and decreases computational complexity compared with DWT-NCCF method.
international conference on signal processing | 2008
Huijing Dou; Changchun Bao; Ruwei Li
In this paper, by using the cyclostationary properties of speech signal, a voice activity detection (VAD) algorithm based on cyclic cumulant is proposed. The proposed scheme employs the third-order cyclic cumulant of the LPC residual of a speech signal. Analytical expressions for the third-order cyclic cumulant of the LPC residual of short-term speech are derived assuming a sinusoidal model. Matrix pencil method (MP) are adopted to estimate the frequencies of harmonic signal contained in LPC residual of speech signal, which are used as the cyclic frequencies of cyclic cumulant. Then the third-order cyclic cumulant is defined and used to construct the VAD detection variation. The test results show that the proposed algorithm gives better results than G.729B VAD.
international conference on signal processing | 2012
Rui Rui; Changchun Bao
In this paper, the projective non-negative matrix factorization (PNMF) with Bregman divergence is applied into the musical instrument classification. A novel supervised learning algorithm for automatic classification of individual musical instrument sounds is addressed inspiring from PNMF with several versions of Bregman divergence. Moreover, the orthogonality of basis matrices between PNMF and conventional non-negative matrix factorization (NMF) is compared. In addition, three classifiers based on nearest neighbors (NN), Gaussian mixture model (GMM) and radial basis function (RBF) are added to evaluate the performance of PNMF classifier. The results indicate that the classification accuracy of the proposed PNMF classifier outperforms the classifiers derived from conventional NMF and machine learning.
international conference on signal processing | 2008
Er-juan Xue; Changchun Bao
A new 1kb/s waveform interpolation (WI) speech coding algorithm based on non-negative matrix factorization (NMF) is proposed and implemented in this paper. Multi-frame parameter joint vector or matrix quantization, parameter prediction and discrete cosine transform (DCT) are used to reduce the bit rate and to improve the quality of speech. The results of informal subjective listening test shows that the intelligibility and articulation of the algorithm are close to that of 2kb/s NMF-based WI coder (called ldquoNMF-WIrdquo as convenience).
international conference on signal processing | 2008
Ji-hua Niu; Changchun Bao
This paper proposes an efficient method for frame erasure concealment in G.722.1 coding algorithm, which can mitigate the adverse impact of frame erasure on the reconstruction quality. The lost frame is reconstructed in the modulated lapped transform (MLT) domain in terms of magnitude and sign of the coefficients. The method of interpolation is employed for the magnitude estimation. Some sign information is recovered by extra information from encoder. Listening test results indicate that improved quality is produced by the proposed algorithm.
international conference on signal processing | 2008
Mao-shen Jia; Changchun Bao; Rui Li
This paper describes an embedded speech and audio codec which is based on ITU-T Recommendation G.722.1; it can process 7 kHz bandwidth speech and audio signal at scalable bit rates. Based on the G.722.1 of ITU-T, this algorithm adds two modules: the energy ordering of sub-band and the processing of bit-stream truncation. Furthermore, it does some modification on the categorization and noise-fill modules. It makes sure that the codec could produce embedded bit-stream, so this codec had more robustness in the transmission. The test results by ITU-T PESQ show that this codec has good performance as G.722.1 at the same bit-rates.
Archive | 2008
Changchun Bao; Zexin Liu; Rui Fan; Heng Zhu; Haiting Li; Mao-shen Jia; Rui Li; Lixiong Li
Archive | 2010
Haiting Li; Rui Fan; Changchun Bao; Lixiong Li; Zexin Liu; Heng Zhu
Archive | 2008
Changchun Bao; Haiting Li; Heng Zhu; Zexin Liu; Rui Fan; Mao-shen Jia; Rui Li; Lixiong Li