Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Yu Gwang Jin is active.

Publication


Featured researches published by Yu Gwang Jin.


IEEE Signal Processing Letters | 2014

Spectro-Temporal Filtering for Multichannel Speech Enhancement in Short-Time Fourier Transform Domain

Yu Gwang Jin; Jong Won Shin; Nam Soo Kim

In this letter, we propose a spectro-temporal filtering algorithm for multichannel speech enhancement in the short-time Fourier transform (STFT) domain. Compared with the traditional multiplicative filtering technique, the proposed method takes account of interdependencies between components in adjacent frames and frequency bins. For spectro-temporal filtering, speech and noise power spectral density (PSD) matrices are estimated based on an extended formulation utilizing temporal and spectral correlations, and the parametric noise reduction filter based on these PSD matrices is applied to the input microphone array signal. Moreover, multichannel speech presence probabilities are also estimated within a unified framework. A number of experimental results show that the proposed spectro-temporal filtering method improves the performance of multichannel speech enhancement.


intelligent information hiding and multimedia signal processing | 2012

Quality Enhancement of Audio Watermarking for Data Transmission in Aerial Space Based on Segmental SNR Adjustment

Kiho Cho; Jae Choi; Yu Gwang Jin; Nam Soo Kim

Audio watermarking techniques can be used not only as a copyright protect system but also as a short-range wireless data transmission system between a loudspeaker and a microphone. To send data through aerial space using acoustic signal, the length of frequency analysis window should be longer than the reverberation time, which, however, can degrade the quality of watermarked audio signal. In this paper, we propose an audio quality enhancement technique for acoustic data transmission application based on segmental SNR adjustment. The acoustic data transmission system which employs the proposed audio watermarking technique with long frequency analysis window showed better transmission performance than the previous system.


international conference on acoustics, speech, and signal processing | 2009

Speech reinforcement based on partial masking effect

Jong Won Shin; Yu Gwang Jin; Seung Seop Park; Nam Soo Kim

Perceived quality of the speech signal deteriorates significantly in the presence of ambient noise. In this paper, based on the analysis that the partial masking effect is a main source of the quality degradation when interfering signals are present, we propose a novel approach to enhance the perceived quality of speech signal when the ambient noise cannot be directly controlled by reinforcing it so that it can be heard more clearly. To find a suitable reinforcement rule, the loudness perception model proposed by Moore et al. [1] is adopted with the consideration on the prevention of the hearing damage. Experimental results show that the perceived quality and intelligibility can be enhanced under various noise environments.


IEEE Signal Processing Letters | 2016

Dual Microphone Voice Activity Detection Exploiting Interchannel Time and Level Differences

Jaehoon Park; Yu Gwang Jin; Soojoong Hwang; Jong Won Shin

The two most important spatial cues in human auditory system may be the interaural time difference and the interaural level difference. There have been many attempts to utilize the time difference of arrival (TDoA) and level difference between two microphone signals for voice activity detection (VAD). In this letter, we propose a dual microphone VAD algorithm based on a support vector machine for which the input vector consists of both TDoA-based and level difference-based features. Several candidates for the feature combination have been compared using various TDoA-related and level difference-related features. Experimental results showed that the proposed VAD algorithm outperformed a standardized single microphone VAD, VADs based on the TDoA or level difference, and logical combination of them in various noise environments.


international conference on acoustics, speech, and signal processing | 2014

PARAMETRIC MULTICHANNEL NOISE REDUCTION ALGORITHM UTILIZING TEMPORAL CORRELATIONS IN REVERBERANT ENVIRONMENT

Yu Gwang Jin; Jong Won Shin; Chul Min Lee; Soo Hyun Bae; Nam Soo Kim

In this paper, we propose a parametric multichannel noise reduction algorithm utilizing temporal correlations in a noisy and reverberant environment. Under the reverberant condition, the received acoustic signal becomes highly correlated in the time domain and it makes successful noise reduction quite difficult. The proposed parametric noise reduction method takes account of interdependencies between components observed from different frames. Extended speech and noise power spectral density (PSD) matrices are estimated containing additional temporal information, and the parametric multichannel noise reduction filter based on these PSD matrices is applied to the input microphone array signal. According to the experimental results, the proposed algorithm has been found to show better performances compared with the conventional multiplicative filtering technique which considers the current input signals only.


international conference on acoustics, speech, and signal processing | 2011

A data-driven residual gain approach for two-stage speech enhancement

Yu Gwang Jin; Chul Min Lee; Kiho Cho; Nam Soo Kim

In this paper, we propose a novel speech enhancement algorithm based on data-driven residual gain estimation. The system consists of two stages. A noisy input signal is processed at the first stage by a conventional speech enhancement module from which both the enhanced signal and several signal-to-noise ratio (SNR)-related parameters are obtained. At the second stage, the residual gain, which is estimated by a data-driven method, is applied to the enhanced signal to further adjust it. According to the experimental results, the proposed algorithm has been found to show better performances compared with the conventional speech enhancement technique based on soft decision as well as the data-driven approach using the SNR grid look-up table.


Signal Processing | 2017

Multichannel speech reinforcement based on binaural unmasking

Junhyeong Pak; Inyong Choi; Yu Gwang Jin; Jong Won Shin

Multichannel speech reinforcement exploiting DoA information is proposed.An empirical evidence of the binaural unmasking for monaural speech is provided.Proposed reinforcement restores perceived loudness considering binaural unmasking.The performance of the algorithm is verified through subjective listening tests. Speech reinforcement or near-end listening enhancement is a technique that modifies the far-end signal to mitigate the effect of the near-end noise, usually based on the power spectra of the far-end signal and the near-end noise. Psychoacoustic experiments have shown that the location of a noise source with respect to that of a signal source affects the amount of masking. Since conventional speech reinforcement methods obtain spectral gain based only on the power spectra, this psychoacoustic phenomenon called binaural unmasking has not been considered in those approaches. In this paper, we propose a novel speech reinforcement algorithm that modifies the far-end speech signal based on both the power spectrum and the direction-of-arrival (DoA) of the noise. Specifically, we have computed the equivalent frontal noise level from the observed noise level and the estimated DoA, and used it to compute spectral gains as in conventional partial loudness restoration-based speech reinforcement. Experimental results showed that the proposed method outperformed the conventional methods based on partial loudness restoration and speech intelligibility index (SII) optimization in terms of the overall perceived quality through subjective listening tests.


Journal of the Acoustical Society of America | 2017

Decision-directed speech power spectral density matrix estimation for multichannel speech enhancement

Yu Gwang Jin; Jong Won Shin; Nam Soo Kim

In this letter, a multichannel decision-directed approach to estimate the speech power spectral density (PSD) matrix for multichannel speech enhancement is proposed. There have been attempts to build multichannel speech enhancement filters which depend only on the speech and noise PSD matrices, for which the accurate estimate of the clean speech PSD matrix is crucial for a successful noise reduction. In contrast to the maximum likelihood estimator which has been applied conventionally, the proposed decision-directed method is capable of tracking the time-varying speech characteristics more robustly and improves the noise reduction performance under various noise environments.


international conference on acoustics, speech, and signal processing | 2014

CROSSBAND FILTERING FOR STEREOPHONIC ACOUSTIC ECHO SUPPRESSION

Chul Min Lee; Jong Won Shin; Yu Gwang Jin; Jeoung Hun Kim; Nam Soo Kim

In this paper, we propose a novel stereophonic acoustic echo suppression (SAES) technique based on crossband filtering in the short-time Fourier transform (STFT) domain. The proposed algorithm considers spectral correlations among components in adjacent frequency bins, and estimates the extended power spectral density (PSD) matrices and cross PSD vectors from the signal statistics for more precise echo estimation. In the STFT domain, the echo spectra are estimated by performing the technique without any distinguishable double-talk detector. According to the experimental results, the proposed algorithm has been found to show better performances compared with the conventional SAES method.


Applied Acoustics | 2015

Estimation of speech absence uncertainty based on multiple linear regression analysis for speech enhancement

Jihwan Park; Jong-Woong Kim; Joon-Hyuk Chang; Yu Gwang Jin; Nam Soo Kim

Collaboration


Dive into the Yu Gwang Jin's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jong Won Shin

Gwangju Institute of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Chul Min Lee

Seoul National University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Kiho Cho

Seoul National University

View shared research outputs
Top Co-Authors

Avatar

Soo Hyun Bae

Seoul National University

View shared research outputs
Top Co-Authors

Avatar

Jae Choi

Seoul National University

View shared research outputs
Top Co-Authors

Avatar

Jeoung Hun Kim

Seoul National University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge