Jeong Seop Sim
Inha University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jeong Seop Sim.
combinatorial pattern matching | 2003
Dong Kyue Kim; Jeong Seop Sim; Heejin Park; Kunsoo Park
The time complexity of suffix tree construction has been shown to be equivalent to that of sorting: O(n) for a constant-size alphabet or an integer alphabet and O(n log n) for a general alphabet. However, previous algorithms for constructing suffix arrays have the time complexity of O(n log n) even for a constant-size alphabet. In this paper we present a linear-time algorithm to construct suffix arrays for integer alphabets, which do not use suffix trees as intermediate data structures during its construction. Since the case of a constant-size alphabet can be subsumed in that of an integer alphabet, our result implies that the time complexity of directly constructing suffix arrays matches that of constructing suffix trees.
Journal of Discrete Algorithms | 2005
Dong Kyue Kim; Jeong Seop Sim; Heejin Park; Kunsoo Park
Abstract The time complexity of suffix tree construction has been shown to be equivalent to that of sorting: O ( n ) for a constant-size alphabet or an integer alphabet and O ( n log n ) for a general alphabet. However, previous algorithms for constructing suffix arrays have the time complexity of O ( n log n ) even for a constant-size alphabet. In this paper we present a linear-time algorithm to construct suffix arrays for integer alphabets, which do not use suffix trees as intermediate data structures during its construction. Since the case of a constant-size alphabet can be subsumed in that of an integer alphabet, our result implies that the time complexity of directly constructing suffix arrays matches that of constructing suffix trees.
Journal of Discrete Algorithms | 2003
Jeong Seop Sim; Kunsoo Park
Given a set S of strings, a consensus string of S based on consensus error is a string w that minimizes the sum of the distances between w and all the strings in S. In this paper, we show that the problem of finding a consensus string based on consensus error is NP-complete when the penalty matrix is a metric.
Theoretical Computer Science | 2001
Jeong Seop Sim; Costas S. Iliopoulos; Kunsoo Park; William F. Smyth
The study of approximately periodic strings is relevant to diverse applications such as molecular biology, data compression, and computer-assisted music analysis. Here we study different forms of approximate periodicity under a variety of distance functions. We consider three related problems, for two of which we derive polynomial-time algorithms; we then show that the third problem is NP-complete.
Information Processing Letters | 2015
Sukhyeun Cho; Joong Chae Na; Kunsoo Park; Jeong Seop Sim
We present a new method of deciding the order-isomorphism between two strings.We show that the bad character rule can be applied to the OPPM problem.We present a space-efficient algorithm computing the shift table for text search.We present a linear-time algorithm for an integer alphabet in the worst case. Given a text T and a pattern P, the order-preserving pattern matching (OPPM) problem is to find all substrings in T which have the same relative orders as P. The OPPM has been studied in the fields of finding some patterns affected by relative orders, not by their absolute values. In this paper, we present a method of deciding the order-isomorphism between two strings even when there are same characters. Then, we show that the bad character rule of the Horspool algorithm for generic pattern matching problems can be applied to the OPPM problem and we present a space-efficient algorithm for computing shift tables for text search. Finally, we combine our bad character rule with the KMP-based algorithm to improve the worst-case running time. We give experimental results to show that our algorithm is about 2 to 6 times faster than the KMP-based algorithm in reasonable cases.
conference on combinatorial optimization and applications | 2013
Sukhyeun Cho; Joong Chae Na; Kunsoo Park; Jeong Seop Sim
Given a text T and a pattern P, the order-preserving pattern matching (OPPM) problem is to find all substrings in T which have the same relative orders as P. The OPPM has been studied in the fields of finding some patterns affected by relative orders, not by their absolute values. For example, it can be applied to time series analysis like share prices on stock markets and to musical melody matching of two musical scores. In this paper, we present a new method of deciding the order-isomorphism between two strings even when there are same characters. Then, we show that the bad character rule of the Horspool algorithm for generic pattern matching problems can be applied to the OPPM problem. Finally, we present a fast algorithm for the OPPM problem and give experimental results to show that our algorithm is about 2 to 5 times faster than the KMP-based algorithm in reasonable cases.
string processing and information retrieval | 2009
Amihood Amir; Gad M. Landau; Joong Chae Na; Heejin Park; Kunsoo Park; Jeong Seop Sim
The consensus string problem is finding a representative string (consensus) of a given set
combinatorial pattern matching | 2010
Taehyung Lee; Joong Chae Na; Heejin Park; Kunsoo Park; Jeong Seop Sim
\mathbb{S}
Theoretical Computer Science | 2013
Taehyung Lee; Joong Chae Na; Heejin Park; Kunsoo Park; Jeong Seop Sim
of strings. In this paper we deal with the consensus string problems optimizing both distance sum and radius, where the distance sum is the sum of (Hamming) distances from the strings in
combinatorial pattern matching | 1999
Jeong Seop Sim; Costas S. Iliopoulos; Kunsoo Park; William F. Smyth
\mathbb{S}