Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Soojong Lim is active.

Publication


Featured researches published by Soojong Lim.


asia information retrieval symposium | 2006

Fine-Grained named entity recognition using conditional random fields for question answering

Changki Lee; Yi-Gyu Hwang; Hyo-Jung Oh; Soojong Lim; Jeong Heo; Chung-Hee Lee; Hyeon-Jin Kim; Ji-Hyun Wang; Myung-Gil Jang

In many QA systems, fine-grained named entities are extracted by coarse-grained named entity recognizer and fine-grained named entity dictionary. In this paper, we describe a fine-grained Named Entity Recognition using Conditional Random Fields (CRFs) for question answering. We used CRFs to detect boundary of named entities and Maximum Entropy (ME) to classify named entity classes. Using the proposed approach, we could achieve an 83.2% precision, a 74.5% recall, and a 78.6% Fl for 147 fined-grained named entity types. Moreover, we reduced the training time to 27% without loss of performance compared to a baseline model. In the question answering, The QA system with passage retrieval and AIU archived about 26% improvement over QA with passage retrieval. The result demonstrated that our approach is effective for QA.


Pattern Recognition Letters | 2013

Dependency-based semantic role labeling using sequence labeling with a structural SVM

Soojong Lim; Changki Lee; Dong-Yul Ra

Semantic Role Labeling (SRL) systems aim at determining the semantic role labels of the arguments of the predicates in natural language text. SRL systems can usually be built to work upon the result of constitient analysis (constituent-based), or dependency parsing (dependency-based). SRL systems can use either classification or sequence labeling as the main processing mechanism. In this paper, we show that a dependency-based SRL system using sequence labeling can achieve state-of-the-art performance when a new structural SVM adapted from the Pegasos algorithm is exploited for performing sequence labeling.


Journal of KIISE | 2015

Korean Semantic Role Labeling Using Structured SVM

Changki Lee; Soojong Lim; Hyunki Kim

Semantic role labeling (SRL) systems determine the semantic role labels of the arguments of predicates in natural language text. An SRL system usually needs to perform four tasks in sequence: Predicate Identification (PI), Predicate Classification (PC), Argument Identification (AI), and Argument Classification (AC). In this paper, we use the Korean Propbank to develop our Korean semantic role labeling system. We describe our Korean semantic role labeling system that uses sequence labeling with structured Support Vector Machine (SVM). The results of our experiments on the Korean Propbank dataset reveal that our method obtains a 97.13% F1 score on Predicate Identification and Classification (PIC), and a 76.96% F1 score on Argument Identification and Classification (AIC).


Journal of KIISE | 2015

Korean Semantic Role Labeling Using Domain Adaptation Technique

Soojong Lim; Yongjin Bae; Hyunki Kim; Dong-Yul Ra

Developing a high-performance Semantic Role Labeling (SRL) system for a domain requires manually annotated training data of large size in the same domain. However, such SRL training data of sufficient size is available only for a few domains. Performances of Korean SRL are degraded by almost 15% or more, when it is directly applied to another domain with relatively small training data. This paper proposes two techniques to minimize performance degradation in the domain transfer. First, a domain adaptation algorithm for Korean SRL is proposed which is based on the prior model that is one of domain adaptation paradigms. Secondly, we proposed to use simplified features related to morphological and syntactic tags, when using small-sized target domain data to suppress the problem of data sparseness. Other domain adaptation techniques were experimentally compared to our techniques in this paper, where news and Wikipedia were used as the sources and target domains, respectively. It was observed that the highest performance is achieved when our two techniques were applied together. In our systems performance, F1 score of 64.3% was considered to be 2.4~3.1% higher than the methods from other research.


Journal of KIISE | 2016

Syllable-based Korean POS Tagging Based on Combining a Pre-analyzed Dictionary with Machine Learning

Chung-Hee Lee; Joon-Ho Lim; Soojong Lim; Hyunki Kim

This study is directed toward the design of a hybrid algorithm for syllable-based Korean POS tagging. Previous syllable-based works on Korean POS tagging have relied on a sequence labeling method and mostly used only a machine learning method. We present a new algorithm integrating a machine learning method and a pre-analyzed dictionary. We used a Sejong tagged corpus for training and evaluation. While the machine learning engine achieved eojeol precision of 0.964, the proposed hybrid engine achieved eojeol precision of 0.990. In a Quiz domain test, the machine learning engine and the proposed hybrid engine obtained 0.961 and 0.972, respectively. This result indicates our method to be effective for Korean POS tagging.


Journal of KIISE | 2016

Korean Semantic Role Labeling Using Semantic Frames and Synonym Clusters

Soojong Lim; Joon-Ho Lim; Chung-Hee Lee; Hyunki Kim

Semantic information and features are very important for Semantic Role Labeling(SRL) though many SRL systems based on machine learning mainly adopt lexical and syntactic features. Previous SRL research based on semantic information is very few because using semantic information is very restricted. We proposed the SRL system which adopts semantic information, such as named entity, word sense disambiguation, filtering adjunct role based on sense, synonym cluster, frame extension based on synonym dictionary and joint rule of syntactic-semantic information, and modified verb-specific numbered roles, etc. According to our experimentations, the proposed present method outperforms those of lexical-syntactic based research works by about 3.77 (Korean Propbank) to 8.05 (Exobrain Corpus) F1-scores.


Lecture Notes in Computer Science | 2006

Fine-grained named entity recognition using conditional random fields for question answering

Changki Lee; Yi-Gyu Hwang; Hyo-Jung Oh; Soojong Lim; Jeong Heo; Chung-Hee Lee; Hyeon-Jin Kim; Ji-Hyun Wang; Myung-Gil Jang


Archive | 2010

Apparatus and method for knowledge graph stabilization

Pum Mo Ryu; Myung Gil Jang; Hyunki Kim; Yi-Gyu Hwang; Soojong Lim; Jeong Heo; Chung Hee Lee; Hyo-Jung Oh; Changki Lee; Miran Choi; Yeo Chan Yoon


Archive | 2010

Apparatus for question answering based on answer trustworthiness and method thereof

Hyo-Jung Oh; Chung-Hee Lee; Soojong Lim; Jeong Heo; Hyunki Kim; Miran Choi; Yeo-Chan Yoon; Changki Lee; Yi-Gyu Hwang; Myung-Gil Jang


Archive | 2009

Topic map based indexing and searching apparatus

Chung Hee Lee; Hyo-Jung Oh; Jeong Heo; Yi Gyu Hwang; Yeo Chan Yoon; Miran Choi; Chang Ki Lee; Soojong Lim; Hyunki Kim; Myung Gil Jang

Collaboration


Dive into the Soojong Lim's collaboration.

Top Co-Authors

Avatar

Hyunki Kim

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Jeong Heo

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Changki Lee

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Miran Choi

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Hyo-Jung Oh

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Myung-Gil Jang

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Chung Hee Lee

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Myung Gil Jang

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Yeo Chan Yoon

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Yi-Gyu Hwang

Electronics and Telecommunications Research Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge