Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sang-Ick Kang is active.

Publication


Featured researches published by Sang-Ick Kang.


IEICE Transactions on Communications | 2008

A Support Vector Machine-Based Gender Identification Using Speech Signal

Kye-Hwan Lee; Sang-Ick Kang; Deok-Hwan Kim; Joon-Hyuk Chang

We propose an effective voice-based gender identification method using a support vector machine (SVM). The SVM is a binary classification algorithm that classifies two groups by finding the voluntary nonlinear boundary in a feature space and is known to yield high classification performance. In the present work, we compare the identification performance of the SVM with that of a Gaussian mixture model (GMM)-based method using the mel frequency cepstral coefficients (MFCC). A novel approach of incorporating a features fusion scheme based on a combination of the MFCC and the fundamental frequency is proposed with the aim of improving the performance of gender identification. Experimental results demonstrate that the gender identification performance using the SVM is significantly better than that of the GMM-based scheme. Moreover, the performance is substantially improved when the proposed features fusion technique is applied.


Circuits Systems and Signal Processing | 2010

Voice Activity Detection Based on Discriminative Weight Training Incorporating a Spectral Flatness Measure

Sang-Ick Kang; Joon-Hyuk Chang

In this paper, we present an approach to incorporate discriminative weight training into a statistical model-based voice activity detection (VAD) method. In our approach, the VAD decision rule is derived from the optimally weighted likelihood ratios (LRs) using a minimum classification error (MCE) method. An adaptive on-line means of selecting two kinds of weights based on a power spectral flatness measure (PSFM) is devised for performance improvement. The proposed approach is compared to conventional schemes under various noise conditions, and shows better performance.


IEICE Electronics Express | 2009

Discriminative weight training-based optimally weighted MFCC for gender identification

Sang-Ick Kang; Joon-Hyuk Chang

In this paper, we apply a discriminative weight training to a support vector machine (SVM) based gender identification. In our approach, the gender decision rule is derived by the SVM incorporating the optimally weighted mel-frequency cepstral coefficient (MFCC) based on a minimum classification error (MCE) method which is different from the previous works in that optimal weights are differently assigned to each MFCC which is considered more realistic. According to the experimental results, the proposed approach is found to be effective for gender identification based on the SVM.


international conference on acoustics, speech, and signal processing | 2010

A statistical model-based double-talk detection incorporating soft decision

Yun-Sik Park; Ji-Hyun Song; Sang-Ick Kang; Woojung Lee; Joon-Hyuk Chang

In this paper, we propose a novel double-talk detection (DTD) technique based on a soft decision in the frequency domain. The proposed method provides an efficient procedure to detect the double-talk situation by the use of the global near-end speech presence probability (GNSPP) and voice activity detection (VAD) of the near-end and far-end signal. Specifically, the GNSPP is derived based on a statistical method of speech and is employed to determine the double-talk presence in a given frame. The performance of our approach is evaluated by objective tests under different environments, and it is found that the suggested method yields better results compared with the conventional scheme.


Symmetry | 2016

Power Spectral Deviation-Based Voice Activity Detection Incorporating Teager Energy for Speech Enhancement

Sang-Kyun Kim; Sang-Ick Kang; Young-Jin Park; Sanghyuk Lee; Sang-Min Lee

In this paper, we propose a robust voice activity detection (VAD) algorithm to effectively distinguish speech from non-speech in various noisy environments. The proposed VAD utilizes power spectral deviation (PSD), using Teager energy (TE) to provide a better representation of the PSD, resulting in improved decision performance for speech segments. In addition, the TE-based likelihood ratio and speech absence probability are derived in each frame to modify the PSD for further VAD. We evaluate the performance of the proposed VAD algorithm by objective testing in various environments and obtain better results that those attained by of the conventional methods.


conference of the international speech communication association | 2010

Toward Detecting Voice Activity Employing Soft Decision in Second-order Conditional MAP

Sang-Kyun Kim; Jae-Hun Choi; Sang-Ick Kang; Ji-Hyun Song; Joon-Hyuk Chang


The Journal of Korean Institute of Communications and Information Sciences | 2010

Voice Activity Detection Based on Non-negative Matrix Factorization

Sang-Ick Kang; Joon-Hyuk Chang


conference of the international speech communication association | 2008

A statistical model-based voice activity detection employing minimum classification error technique.

Sang-Ick Kang; Ji-Hyun Song; Kye-Hwan Lee; Yun-Sik Park; Joon-Hyuk Chang


conference of the international speech communication association | 2010

On Using Gaussian Mixture Model for Double-Talk Detection in Acoustic Echo Suppression

Ji-Hyun Song; Kyu-Ho Lee; Yun-Sik Park; Sang-Ick Kang; Joon-Hyuk Chang


conference of the international speech communication association | 2008

Group delay function for improved gender identification.

Kye-Hwan Lee; Sang-Ick Kang; Ji-Hyun Song; Joon-Hyuk Chang

Collaboration


Dive into the Sang-Ick Kang's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge