Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Yoonseo Choi is active.

Publication


Featured researches published by Yoonseo Choi.


ACM Transactions on Architecture and Code Optimization | 2013

Fast modulo scheduler utilizing patternized routes for coarse-grained reconfigurable architectures

Won-Sub Kim; Yoonseo Choi; Haewoo Park

Coarse-Grained Reconfigurable Architectures (CGRAs) present a potential of high compute throughput with energy efficiency. A CGRA consists of an array of Functional Units (FUs), which communicate with each other through an interconnect network containing transmission nodes and register files. To achieve high performance from the software solutions mapped onto CGRAs, modulo scheduling of loops is generally employed. One of the key challenges in modulo scheduling for CGRAs is to explicitly handle routings of operands from a source to a destination operations through various routing resources. Existing modulo schedulers for CGRAs are slow because finding a valid routing is generally a searching problem over a large space, even with the guidance of well-defined cost metrics. Applications in traditional embedded multimedia domains are regarded as relatively tolerant to a slow compile time in exchange for a high-quality solution. However, many rapidly growing domains of applications, such as 3D graphics, require a fast compilation. Entrances of CGRAs to these domains have been blocked mainly due to their long compile time. We attack this problem by utilizing patternized routes, for which resources and time slots for a success can be estimated in advance when a source operation is placed. By conservatively reserving predefined resources at predefined time slots, future routings originating from the source operation are guaranteed. Experiments on a real-world 3D graphics benchmark suite show that our scheduler improves the compile time up to 6,000 times while achieving an average 70% throughputs of the state-of-the-art CGRA modulo scheduler, the Edge-centric Modulo Scheduler (EMS).


international conference on consumer electronics | 2014

JTS-based static branch prediction

Tai-song Jin; Jin-Seok Lee; Min-wook Ahn; Yoonseo Choi; Do Hyung Kim; Shihwa Lee

VLIW architectures are popular design choices in embedded computing market because of its capability of delivering performance with low power. Branch prediction plays a key role for minimizing pipeline stalls due to control hazard. Though a hardware branch predictor can result in good predictions, its HW cost often hinders it from being used in low-power VLIW architectures. On the other hand, a software branch prediction by the compiler can achieve comparable prediction quality utilizing delay slots intelligently without HW cost. In this paper, we propose a novel static branch prediction technique using jump target setting (JTS) instructions. The JTS-enabled VLIW architecture is successfully shipped in several commercial consumer electronic devices from Samsung. In our experiment using multimedia applications, the proposed branch prediction scheme outperforms the conventional static branch prediction with delay slots by 9%.


international conference on consumer electronics | 2014

Nop compression scheme for high speed DSPs based on VLIW architecture

Tai-song Jin; Min-wook Ahn; Dong-hoon Yoo; Dong-kwan Suh; Yoonseo Choi; Do-Hyung Kim; Shihwa Lee

VLIW (Very Long Instruction Word) is one of the most popular architectures in embedded systems because it has features of low power consumption and low hardware cost. Due to the nature of VLIW architecture such as bundled instructions and large register files, VLIW processors are running with large size of instruction codes in relatively low clock frequency. However compact instruction size and high clock frequency are the most important requirements of modern embedded consumer electronics. In this paper we propose a novel instruction compression scheme to solve the addressed problem. The experiment shows that the proposed scheme can reduce instruction size by 23% and improve clock frequency by 25% in average comparing with conventional compression schemes.


Archive | 2015

METHOD AND APPARATUS FOR CONTROLLING RENDERING QUALITY

Choonki Jang; Haewoo Park; Yoonseo Choi


Archive | 2017

Selecting a memory mapping scheme by determining a number of functional units activated in each cycle of a loop based on analyzing parallelism of a loop

Yoonseo Choi; Tai-song Jin; Dong-hoon Yoo


Archive | 2015

Apparatus and method of controlling power consumption of graphic processing unit (gpu) resources

Yoonseo Choi; Choonki Jang; Haewoo Park; Hyeongseok Yu; Dong-hoon Yoo


Archive | 2015

Multiple-thread processing methods and apparatuses

Minkyu Jeong; Haewoo Park; Min-Young Son; Choonki Jang; Yoonseo Choi; Dong-hoon Yoo


한국방송미디어공학회 학술발표대회 논문집 | 2014

CGRA Compilation Boost up for Acceleration of Graphics

Won-Sub Kim; Yoonseo Choi; Jaehyun Kim


Archive | 2014

Optimizing configuration memory by sequentially mapping the generated configuration data into fields having different sizes by determining regular encoding is not possible

Yoonseo Choi; Tai-song Jin; Dong-hoon Yoo


Archive | 2014

Scheduler and scheduling method for reconfigurable architecture

Won-Sub Kim; Yoonseo Choi; Haewoo Park

Collaboration


Dive into the Yoonseo Choi's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Choonki Jang

Seoul National University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Do Hyung Kim

Electronics and Telecommunications Research Institute

View shared research outputs
Top Co-Authors

Avatar

Do-Hyung Kim

Pukyong National University

View shared research outputs
Researchain Logo
Decentralizing Knowledge