Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Siwoon Son is active.

Publication


Featured researches published by Siwoon Son.


international conference on big data and smart computing | 2017

Anomaly detection for big log data using a Hadoop ecosystem

Siwoon Son; Myeong-Seon Gil; Yang-Sae Moon

In this paper, we address a novel method to efficiently manage and analyze a large amount of log data. First, we present a new Apache Hive-based data storage and analysis architecture to process a large volume of Hadoop log data, which rapidly occur in multiple nodes. Second, we design and implement three simple but efficient anomaly detection methods. These methods use moving average and 3-sigma techniques to detect anomalies in log data. Finally, we show that all the three methods detect abnormal intervals properly, and the weighted anomaly detection methods are more precise than the basic one. These results indicate that our research is an excellent and simple approach in detecting anomalies of log data on a Hadoop ecosystem.


The Journal of Supercomputing | 2017

Prefetching-based metadata management in Advanced Multitenant Hadoop

Minh Chau Nguyen; Hee-Sun Won; Siwoon Son; Myeong-Seon Gil; Yang-Sae Moon

Metadata management is an essential part in Apache Hadoop. Performing optimization of metadata accesses enhances big data storing, processing and analyzing, especially in multitenant environments. Nevertheless, as environmental complexity increases, metadata management is becoming more challenging and costly because of the heavy performance issues. In this paper, we propose a novel approach to improve the performance of metadata management for Hadoop in the multitenant environment based on the prefetching mechanism. We create metadata access graphs based on historical access values, define access patterns and then perform prefetching potential items for the near-future requests to minimize the latency. We present a formal algorithm to apply the prefetching mechanism into the Hadoop system and perform the actual implementation on a recent Hadoop system. Experimental results show that the proposed approach can enable the high performance for metadata management as well as maintain advanced multitenancy features.


Archive | 2016

Hive-Based Anomaly Detection in Hadoop Log Data Management

Siwoon Son; Myeong-Seon Gil; Seokwoo Yang; Yang-Sae Moon

In this paper, we address how to manage and analyze a large volume of log data, which have been difficult to be handled in the traditional computing environment. To handle a large volume of Hadoop log data, which rapidly occur in multiple servers, we present new data storage architecture to efficiently analyze those big log data through Apache Hive. We then design and implement a simple but efficient anomaly detection method, which identifies abnormal status of servers from log data, based on moving average and 3-sigma techniques. We also show effectiveness of the proposed detection method by demonstrating that it properly detects anomalies from Hadoop log data.


database systems for advanced applications | 2015

Performance Analysis of Hadoop-Based SQL and NoSQL for Processing Log Data

Siwoon Son; Myeong-Seon Gil; Yang-Sae Moon; Hee-Sun Won

Recently, many companies and research organizations are seeking scalable solutions by using Hadoop ecosystems. The log data management with large-scale and real-time properties is one of the appropriate application on top of Hadoop. In this paper, we focus on SQL and NoSQL choices for building Hadoop-based log data management system. For this purpose, we first select major products supporting SQL and NoSQL, and we then present an appropriate scheme for each product by considering its own characteristics. All the schema are for real-time monitoring and analyzing the log data. For each product, we implement insertion and selection operations of log data in Hadoop, and we analyze the performance of these operation. Analysis results show that MariaDB and MongoDB are fast in the insertion, and PostgreSQL and HBase are fast in the selection. We believe that our evaluation results will be very helpful for users to choose Hadoop SQL and NoSQL products for handling large-scale and real-time log data.


network operations and management symposium | 2018

Design and implementation of a load shedding engine for solving starvation problems in Apache Kafka

Jiwon Bang; Siwoon Son; Hajin Kim; Yang-Sae Moon; Mi-Jung Choi


international conference on big data and smart computing | 2018

Locality Aware Traffic Distribution in Apache Storm for Energy Analytics Platform

Siwoon Son; Sanghun Lee; Myeong-Seon Gil; Mi-Jung Choi; Yang-Sae Moon


international conference on big data security on cloud | 2017

A Storm-Based Tag Cloud Platform for Multiple SNS Users

Siwoon Son; Dasol Kim; Myeong-Seon Gil; Yang-Sae Moon


KIPS Transactions on Software and Data Engineering | 2017

Storm-Based Dynamic Tag Cloud for Real-Time SNS Data

Siwoon Son; Dasol Kim; Sujeong Lee; Myeong-Seon Gil; Yang-Sae Moon


KIISE Transactions on Computing Practices | 2017

Anomaly Detection Technique of Log Data Using Hadoop Ecosystem

Siwoon Son; Myeong-Seon Gil; Yang-Sae Moon


KIISE Transactions on Computing Practices | 2017

Efficient Locality-Aware Traffic Distribution in Apache Storm

Siwoon Son; Sanghun Lee; Yang-Sae Moon

Collaboration


Dive into the Siwoon Son's collaboration.

Top Co-Authors

Avatar

Yang-Sae Moon

Kangwon National University

View shared research outputs
Top Co-Authors

Avatar

Myeong-Seon Gil

Kangwon National University

View shared research outputs
Top Co-Authors

Avatar

Hee-Sun Won

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Mi-Jung Choi

Kangwon National University

View shared research outputs
Top Co-Authors

Avatar

Sanghun Lee

Kangwon National University

View shared research outputs
Top Co-Authors

Avatar

Hajin Kim

Kangwon National University

View shared research outputs
Top Co-Authors

Avatar

Jiwon Bang

Kangwon National University

View shared research outputs
Top Co-Authors

Avatar

Minh Chau Nguyen

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Seokwoo Yang

Kangwon National University

View shared research outputs
Researchain Logo
Decentralizing Knowledge