Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Kyusik Chung is active.

Publication


Featured researches published by Kyusik Chung.


Artificial Intelligence Review | 2003

Evolutionary Approach to Quantum andReversible Circuits Synthesis

Martin Lukac; Marek A. Perkowski; Hilton Goi; Mikhail Pivtoraiko; Chung Hyo Yu; Kyusik Chung; Hyunkoo Jeech; Byung-Guk Kim; Yong Duk Kim

The paper discusses theevolutionary computation approach to theproblem of optimal synthesis of Quantum andReversible Logic circuits. Our approach usesstandard Genetic Algorithm (GA) and itsrelative power as compared to previousapproaches comes from the encoding and theformulation of the cost and fitness functionsfor quantum circuits synthesis. We analyze newoperators and their role in synthesis andoptimization processes. Cost and fitnessfunctions for Reversible Circuit synthesis areintroduced as well as local optimizingtransformations. It is also shown that ourapproach can be used alternatively forsynthesis of either reversible or quantumcircuits without a major change in thealgorithm. Results are illustrated onsynthesized Margolus, Toffoli, Fredkin andother gates and Entanglement Circuits. This isfor the first time that several variants ofthese gates have been automatically synthesizedfrom quantum primitives.


IEEE Journal of Solid-state Circuits | 2006

An SoC with 1.3 gtexels/s 3-D graphics full pipeline for consumer applications

Dong-Hyun Kim; Kyusik Chung; Chang-Hyo Yu; Chun-Ho Kim; Inho Lee; Jun-Sang Bae; Young-Jun Kim; Jae-Hyeon Park; Sungbeen Kim; Yong-Ha Park; Nak-Hee Seong; Jin-Aeon Lee; Jaehong Park; Sung Yong Oh; Seh-Woong Jeong; Lee-Sup Kim

A high-speed three-dimensional (3-D) graphics SoC for consumer applications is presented. A 166-MHz 3-D graphics full pipeline engine with performance of 33 Mvertices/s and 1.3Gtexels/s, and 333-MHz ARM11 RISC processor, and video composition IPs are integrated together on a single chip. The geometry part of 3-D graphics IP provides full programmability in vertex and triangle level, and two-level multi-texturing with trilinear MIPMAP filtering are realized in the rasterization part. Per-pixel effects such as fog effects, alpha blending, and stencil test are also implemented in the proposed 3-D graphics IP. The rasterization architecture is designed for reducing external memory accesses to achieve the peak performance. The chip is fabricated using 0.13/spl mu/m CMOS technology and its area is 7.1/spl times/7.0mm/sup 2/.


compound semiconductor integrated circuit symposium | 2007

An Energy-Efficient Mobile Vertex Processor With Multithread Expanded VLIW Architecture and Vertex Caches

Chang-Hyo Yu; Kyusik Chung; Dong-Hyun Kim; Lee-Sup Kim

In this paper, a 3-D vertex processor with a floating-point four-threaded and four-issue expanded VLIW architecture and vertex caches for mobile multimedia applications is proposed. The multi-threaded datapath prevents data hazards, and the multi-issue expanded VLIW architecture enables the processor to have an opportunity to execute instructions in parallel and a well-balanced way. The efficient vertex caches are proposed and implemented for the embedded vertex processors to accelerate its geometry operations and to save bandwidth between hosts and vertex processors. The proposed architecture with the vertex caches reduces the average total energy dissipation of 44.7% compared to a conventional single-threaded SIMD architecture, and the proposed vertex processor achieves 120 M vertices/s of geometry performance which is 3.3 times faster than the previous result, and it supports OpenGL ES 2.0 and vertex shader model 3.0. The processor is implemented in a 0.18-mum 1P4M CMOS process, and the operating frequency is 100 MHz.


international solid-state circuits conference | 2006

A 120Mvertices/s multi-threaded VLIW vertex processor for mobile multimedia applications

Chang-Hyo Yu; Kyusik Chung; Dong-Hyun Kim; Lee-Sup Kim

A 3D vertex processor with a floating-point 4-threaded and 4-issue VLIW architecture and a TnL vertex cache is implemented for mobile multimedia applications in a 0.18mum 4M CMOS process. The proposed architecture efficiently reduces the total energy consumption and achieves 120Mvertices/s with a 2.5GFLOPS datapath using 157mW when operating at 100MHz


international solid-state circuits conference | 2005

An SoC with 1.3 Gtexels/s 3D graphics full pipeline engine for consumer applications

Dong-Hyun Kim; Kyusik Chung; Chang-Hyo Yu; Chun-Ho Kim; Inho Lee; Jaewan Bae; Young-Jun Kim; Young-Jin Chung; Sungbeen Kim; Yong-Ha Park; Nak-Hee Seong; Jin-Aeon Lee; Jaehong Park; Sung Yong Oh; Seh-Woong Jeong; Lee-Sup Kim

A 3D graphics SoC whose performance is 33 Mvertices/s and 1.3 Gtexels/s is designed for consumer applications. The SoC integrates an ARM11 RISC processor, a dedicated 3D graphics full pipeline engine, and video composition IPs. The SoC contains 17.9 M transistors in 50 mm/sup 2/ area and is fabricated in a 0.13 /spl mu/m 7M CMOS process.


IEEE Transactions on Very Large Scale Integration Systems | 2009

A 186-Mvertices/s 161-mW Floating-Point Vertex Processor With Optimized Datapath and Vertex Caches

Chang-Hyo Yu; Kyusik Chung; Dong-Hyun Kim; Seok-Hoon Kim; Lee-Sup Kim

In this paper, a power efficient vertex processor for mobile graphics applications is presented. A four-threaded and four-issue expanded VLIW datapath with a quad-float vertex texture fetcher is proposed by exploiting graphics specific characteristics after evaluation of several candidate architectures. Instruction-level power control methods such as operand sharing and writeback re-allocation along with operand isolations and gated clocks result in 40.4% and 82% reduction in energy dissipation and energy delay product compared to the most widely used single threaded SIMD. The proposed processor with the optimized datapath and vertex caches implemented in a 0.18- mum 1P4M CMOS process achieves 186-Mvertices/s geometry performance which is the best result among the processors that are IEEE-754 compliant.


IEEE Transactions on Very Large Scale Integration Systems | 2013

A Unified Graphics and Vision Processor With a 0.89 /spl mu/W/fps Pose Estimation Engine for Augmented Reality

Jae-Sung Yoon; Jeong-Hyun Kim; Hyo-Eun Kim; Won-Young Lee; Seok-Hoon Kim; Kyusik Chung; Jun-Seok Park; Lee-Sup Kim

A unified vision and graphics processor with three layers is shown to provide a fast pipeline for augmented reality. In the image-level layer, a 153.6 GOPS massively parallel processing unit with eight SIMD processors, each containing 128 processing elements, performs highly data-parallel operations. In the sub-image layer, a rasterizer and a pixel arranger respectively generate and reduce data-level parallelism. In the descriptor-level layer, a pose estimation engine executes sequential programs. Our processor can provide images for augmented reality at 100 fps, for a power consumption of 413 mW. This is 39% faster than a comparable smartphone implementation. Our chip is fabricated in a 0.18 μm CMOS process and contains 0.95 M gates.


IEEE Journal of Solid-state Circuits | 2010

A 116 fps/74 mW Heterogeneous 3D-Media Processor for 3-D Display Applications

Seok-Hoon Kim; Hong-Yun Kim; Young-Jun Kim; Kyusik Chung; Dong-Hyun Kim; Lee-Sup Kim

In this paper, a heterogeneous 3D-media processor is presented, which supports all 3-D display applications by combining a 3-D display IP with a 3-D graphics IP and a stereo video decoder. For mobile environments, adaptive power management scheme is proposed, which saves power consumption up to 186 mW by turning off idle functional blocks based on a target application, a target performance, and the run-time ratio between different IPs. As a result, the minimum power consumption of the processor is only 15 mW, while the overall power consumption is 201 mW. As well as the reduction of power consumption, this work shows impressive performance improvement. The proposed fast modulo operators and adopted division-free algorithm reduces the critical latencies of 3-D display image processing. The proposed fast datapath with parallel architecture increase synthesis rate up to 116 fps which is 17 times faster than a previous work. In addition, reordered operation sequence fixes memory bandwidth regardless of the number of images to be produced. In the 3-D graphics IP and the decoding IP, redundant datapath are merged using an IEEE 754 compliant floating-point vector unit to save both chip area and power consumption, which even reduces the critical latency by 30%.


international solid-state circuits conference | 2007

A 36fps SXGA 3D Display Processor with a Programmable 3D Graphics Rendering Engine

Seok-Hoon Kim; Jae-Sung Yoon; Chang-Hyo Yu; Dong-Hyun Kim; Kyusik Chung; Han Shin Lim; HyunWook Park; Lee-Sup Kim

A 3D display processor with a programmable 3D graphics rendering engine is implemented. The integrated rendering engine supports Pixel Shader 3.0 and OpenGL ES 2.0. A 3D image synthesis engine generates 3D images at 36fps. The die contains 1.74M gates and occupies 5times5mm2 in 0.18mum CMOS and dissipates 379mW at 1.8V.


international symposium on circuits and systems | 2003

A hardware-like high-level language based environment for 3D graphics architecture exploration

Inho Lee; Joung-Youn Kim; Yeon-Ho Im; Yun Seok Choi; Hyun-Chul Shin; Chang-Young Han; Dong-Hyun Kim; Hyoung-Joon Park; Young-Il Seo; Kyusik Chung; Chang-Hyo Yu; Kanghyup Chun; Lee-Sup Kim

The high complexity and the short lifetime of 3D graphics acceleration hardware increase the necessity of an environment for hardware development. For easy modification and fast testing of architecture, a high-level language based environment is desirable. Therefore, in this paper we propose a Graphics Architecture Testing Environment (GATE) that is based on Microsoft Visual C++. GATE models overall graphics hardware architecture through a modular approach, supports OpenGL, and offers easy modification and rapid testing of architecture. It also gathers computational statistics. A layered approach and Hardware Description Macro (HDM) support hardware modeling and architecture modification. Pre-defined types and operations provide statistical information. Several case studies of 3D graphics architecture on GATE show the capability of our environment.

Collaboration


Dive into the Kyusik Chung's collaboration.

Researchain Logo
Decentralizing Knowledge