Dae-Lim Choi
Wonkwang University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Dae-Lim Choi.
2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) | 2011
Dae-Lim Choi; Bong-Wan Kim; Yong-Ju Lee; Yongnam Um; Minhwa Chung
In this paper we will introduce the work of creation of a speech database to develop speech technology for disabled persons, which has been done as part of a national program to help better life for Korean people. We will report about the creation of speech database of a total of 160 persons: prompting items, designs, etc. for the creation of a database which is needed to develop an embedded key-word spotting speech recognition system tailored for the persons disabled in articulation. The created database is being used by the technology development team in the national program to study the phonetic characteristics of the different types of disabled persons, develop the automatic method to assess degrees of disability, investigate the phonetic features of speech of the disabled, and design and implement the software prototype for personal embedded speech recognition systems adapted to the disabled persons.
text speech and dialogue | 2007
Bong-Wan Kim; Dae-Lim Choi; Yong-Ju Lee
In this paper, we propose Mel-cepstrum modulation energy (MCME) as an extension of modulation energy (ME) for a feature to discriminate speech and music data. MCME is extracted from the time trajectory of Mel-frequency cepstral coefficients (MFCC), while ME is based on the spectrum. As cepstral coefficients are mutually uncorrelated, we expect MCME to perform better than ME. To find out the best modulation frequency for MCME, we make experiments with 4 Hz to 20 Hz modulation frequency, and we compare the results with those obtained from the ME and the MFCC based cepstral flux. In the experiments, 8 Hz MCME shows the best discrimination performance, and it yields a discrimination error reduction rate of 71% compared with 4 Hz ME. Compared with the cepstral flux (CF), it shows an error reduction rate of 53%.
international conference on acoustics, speech, and signal processing | 2012
Bong-Wan Kim; Dae-Lim Choi; JaeDeok Lim; Seung-Wan Han; Yong-Ju Lee
The segmental two-dimensional Mel-frequency cepstral coefficient (STDMFCC) feature has been successfully used in recent studies to detect objectionable sounds, which implicitly represent both static and dynamic characteristics of signal. This study now proposes a new normalized STDMFCC to improve the content recognition performance in diverse noisy environments. Two tests were conducted to verify the performance of the proposed feature: First, an objectionable sound recognition test was conducted with 10-second clips to which white noises with diverse signal-to-noise ratios (SNRs) were added. The proposed feature in the test had an average error reduction rate (ERR) of 24.69% with respect to the STDMFCC. Second, a test was conducted based on the soundtrack that contained diverse channel environments and noises. The equal error rate (EER) of the proposed feature was 4.00% compared with 10.33% of STDMFCC, and the ERR was 61.29%.
The Journal of the Korea Contents Association | 2012
Tae-Guon Kim; Bong-Wan Kim; Dae-Lim Choi; Yong-Ju Lee
Though Android-based smart phones are being released in Korea, Korean TTS engine is not built on them and Google has not announced service or software developer`s kit related to Korean TTS officially. Thus, application developers who want to include Korean TTS capability in their application have difficulties. In this paper, we design and implement Android OS-based Korean TTS system and service. For speed, text preprocessing and synthesis libraries are implemented using Android NDK. By using Java`s thread mechanism and the AudioTrack class, the response time of TTS is minimized. For the test of implemented service, an application that reads incoming SMS is developed. The test shows that synthesized speech are generated in real-time for random sentences. By using the implemented Korean TTS service, Android application developers can transmit information easily through voice. Korean TTS service proposed and implemented in this paper overcomes shortcomings of the existing restrictive synthesis methods and provides the benefit for application developers and users.
language resources and evaluation | 2004
Yong-Ju Lee; Bong-Wan Kim; Young-Il Kim; Dae-Lim Choi; Kwang-Hyun Lee; Yongnam Um
language resources and evaluation | 2012
Dae-Lim Choi; Bong-Wan Kim; YeonWhoa Kim; Yong-Ju Lee; Yongnam Um; Minhwa Chung
Phonetics and Speech Sciences | 2014
YeonWhoa Kim; Dae-Lim Choi; Sook-Hyang Lee; Yong-Ju Lee
conference of the international speech communication association | 2006
Bong-Wan Kim; Dae-Lim Choi; Yongnam Um; Yong-Ju Lee
2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA) | 2017
Youngjoo Suh; Younggwan Kim; Hyungjun Lim; Jahyun Goo; Youngmoon Jung; Yeonjoo Choi; Hoirin Kim; Dae-Lim Choi; Yong-Ju Lee
2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA) | 2016
YongWun Kim; Tae-Guon Kim; LagHwan Ko; Dae-Lim Choi; Yong-Ju Lee