Sang-Ick Kang
Inha University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sang-Ick Kang.
IEICE Transactions on Communications | 2008
Kye-Hwan Lee; Sang-Ick Kang; Deok-Hwan Kim; Joon-Hyuk Chang
We propose an effective voice-based gender identification method using a support vector machine (SVM). The SVM is a binary classification algorithm that classifies two groups by finding the voluntary nonlinear boundary in a feature space and is known to yield high classification performance. In the present work, we compare the identification performance of the SVM with that of a Gaussian mixture model (GMM)-based method using the mel frequency cepstral coefficients (MFCC). A novel approach of incorporating a features fusion scheme based on a combination of the MFCC and the fundamental frequency is proposed with the aim of improving the performance of gender identification. Experimental results demonstrate that the gender identification performance using the SVM is significantly better than that of the GMM-based scheme. Moreover, the performance is substantially improved when the proposed features fusion technique is applied.
Circuits Systems and Signal Processing | 2010
Sang-Ick Kang; Joon-Hyuk Chang
In this paper, we present an approach to incorporate discriminative weight training into a statistical model-based voice activity detection (VAD) method. In our approach, the VAD decision rule is derived from the optimally weighted likelihood ratios (LRs) using a minimum classification error (MCE) method. An adaptive on-line means of selecting two kinds of weights based on a power spectral flatness measure (PSFM) is devised for performance improvement. The proposed approach is compared to conventional schemes under various noise conditions, and shows better performance.
IEICE Electronics Express | 2009
Sang-Ick Kang; Joon-Hyuk Chang
In this paper, we apply a discriminative weight training to a support vector machine (SVM) based gender identification. In our approach, the gender decision rule is derived by the SVM incorporating the optimally weighted mel-frequency cepstral coefficient (MFCC) based on a minimum classification error (MCE) method which is different from the previous works in that optimal weights are differently assigned to each MFCC which is considered more realistic. According to the experimental results, the proposed approach is found to be effective for gender identification based on the SVM.
international conference on acoustics, speech, and signal processing | 2010
Yun-Sik Park; Ji-Hyun Song; Sang-Ick Kang; Woojung Lee; Joon-Hyuk Chang
In this paper, we propose a novel double-talk detection (DTD) technique based on a soft decision in the frequency domain. The proposed method provides an efficient procedure to detect the double-talk situation by the use of the global near-end speech presence probability (GNSPP) and voice activity detection (VAD) of the near-end and far-end signal. Specifically, the GNSPP is derived based on a statistical method of speech and is employed to determine the double-talk presence in a given frame. The performance of our approach is evaluated by objective tests under different environments, and it is found that the suggested method yields better results compared with the conventional scheme.
Symmetry | 2016
Sang-Kyun Kim; Sang-Ick Kang; Young-Jin Park; Sanghyuk Lee; Sang-Min Lee
In this paper, we propose a robust voice activity detection (VAD) algorithm to effectively distinguish speech from non-speech in various noisy environments. The proposed VAD utilizes power spectral deviation (PSD), using Teager energy (TE) to provide a better representation of the PSD, resulting in improved decision performance for speech segments. In addition, the TE-based likelihood ratio and speech absence probability are derived in each frame to modify the PSD for further VAD. We evaluate the performance of the proposed VAD algorithm by objective testing in various environments and obtain better results that those attained by of the conventional methods.
conference of the international speech communication association | 2010
Sang-Kyun Kim; Jae-Hun Choi; Sang-Ick Kang; Ji-Hyun Song; Joon-Hyuk Chang
The Journal of Korean Institute of Communications and Information Sciences | 2010
Sang-Ick Kang; Joon-Hyuk Chang
conference of the international speech communication association | 2008
Sang-Ick Kang; Ji-Hyun Song; Kye-Hwan Lee; Yun-Sik Park; Joon-Hyuk Chang
conference of the international speech communication association | 2010
Ji-Hyun Song; Kyu-Ho Lee; Yun-Sik Park; Sang-Ick Kang; Joon-Hyuk Chang
conference of the international speech communication association | 2008
Kye-Hwan Lee; Sang-Ick Kang; Ji-Hyun Song; Joon-Hyuk Chang