Min-Hao Chiu
MediaTek
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Min-Hao Chiu.
international solid-state circuits conference | 2015
Chi-cheng Ju; Tsu-Ming Liu; Kun-bin Lee; Yung-Chang Chang; Han-Liang Chou; Chin-Ming Wang; Tung-Hsing Wu; Hue-Min Lin; Yi-Hsin Huang; Chia-Yun Cheng; Ting-An Lin; Chun-Chia Chen; Yu-Kun Lin; Min-Hao Chiu; Wei-Cing Li; Sheng-Jen Wang; Yen-Chieh Lai; Ping Chao; Chih-Da Chien; Meng-Jye Hu; Peng-Hao Wang; Fu-Chun Yeh; Yen-Chao Huang; Shun-Hsiang Chuang; Lien-Fei Chen; Hsiu-Yi Lin; Ming-Long Wu; Che-Hong Chen; Ryan Chen; Heng-Shou Hsu
A 4K×2K H.265/HEVC video codec chip is fabricated in a 28nm CMOS process with a core area of 2.16mm2. This LSI chip integrates a dual-standard (H.265 and H.264) video codec and a series of prevalent (VC-1, WMV-7/8/9, VP-6/8, AVS, RM-8/9/10, MPEG-2/4) decoders into a single chip. It contains 3,558K logic gates and 308KB of internal SRAM. Moreover, it simplifies intra/inter-rate-distortion optimization (RDO) processes and reduces external bandwidth via line-store SRAM pool (LSSP) and data-bus translation (DBT) techniques. For smartphone applications, it completes real-time HEVC encoding and decoding with 4096×2160 resolution and 30fps, and consumes 126.73mW (0.5nJ/pixel) of core power dissipation at 0.9V, at 494MHz (encoding) and 350MHz (decoding). 1080HD and 720HD resolutions are reported as well. The chip features are summarized in Fig. 18.6.1.
IEEE Journal of Solid-state Circuits | 2016
Chi-cheng Ju; Tsu-Ming Liu; Kun-bin Lee; Yung-Chang Chang; Han-Liang Chou; Chih-Ming Wang; Tung-Hsing Wu; Hue-Min Lin; Yi-Hsin Huang; Chia-Yun Cheng; Ting-An Lin; Chun-Chia Chen; Yu-Kun Lin; Min-Hao Chiu; Wei-Cing Li; Sheng-Jen Wang; Yen-Chieh Lai; Ping Chao; Chih-Da Chien; Meng-Jye Hu; Peng-Hao Wang; Yen-Chao Huang; Shun-Hsiang Chuang; Lien-Fei Chen; Hsiu-Yi Lin; Ming-Long Wu; Che-Hong Chen
A 4 K × 2 K H.265/HEVC video codec chip supporting 14 video standard formats is implemented on a 1.49 × 1.45 mm 2 die in a 28 nm CMOS process. Several HEVC fast algorithms reduce the coding modes by analyzing the content features leading to over 68.5% and 83% of complexity reduction in intra and inter coding, respectively. A fully parallel processing element (PE) array is adopted in SAO and IP/ME, which reduce number of accesses to SRAM by 48.7% and 78.4%, respectively. A shared memory management unit (MMU) including line-store SRAM pool (LSSP) and data bus translation (DBT) techniques efficiently reuses and packs the neighboring pixels which contribute 71.6% of external bandwidth reduction. This chip achieves 4096 × 2160@30 fps HEVC encoding/decoding and consumes 126.73 mW, 0.5 nJ/pixel of energy efficiency, under 494 MHz and 350 MHz of clock frequency, enabling 4 K video services for smart-phone applications.
european solid state circuits conference | 2014
Chi-cheng Ju; Tsu-Ming Liu; Yung-Chang Chang; Chih-Ming Wang; Hue-Min Lin; Chia-Yun Cheng; Chun-Chia Chen; Min-Hao Chiu; Sheng-Jen Wang; Ping Chao; Meng-Jye Hu; Fu-Chun Yeh; Shun-Hsiang Chuang; Hsiu-Yi Lin; Ming-Long Wu; Che-Hong Chen; Chung-Hung Tsai
A first-reported 4K×2K@60fps and Main-10 HEVC video decoder integrating 14 video formats is fabricated in a 28nm CMOS process. It adopts an Adaptive Coding Unit Balance (ACUB) and Data-Sharing Wave-front Dual-core (DSWD) architectures to lower the required working frequency by 65%. A 10-bit Smart Pixel Storage (SPS) scheme is proposed to reduce the frame buffer space by 37.5%. Moreover, a weighted memory management unit (W-MMU) and multi-standard architecture reduce DRAM bandwidth and cost by 43% and 28%, respectively. This 4K Main-10 HEVC video decoder chip integrates 3.4M gate counts with area of 2.86mm2. It achieves 530Mpixels/s throughput which is two times larger than the state-of-the-art HEVC design [6] and consumes 0.2nJ/pixel energy efficiency, enabling real-time 4K video playback for UHD-TV applications.
international conference on multimedia and expo | 2015
Tsu-Ming Liu; Yung-Chang Chang; Chih-Ming Wang; Hue-Min Lin; Chia-Yun Cheng; Chun-Chia Chen; Min-Hao Chiu; Sheng-Jen Wang; Ping Chao; Meng-Jye Hu; Fu-Chun Yeh; Shun-Hsiang Chuang; Hsiu-Yi Lin; Ming-Long Wu; Che-Hong Chen; Chia-Lin Ho; Chi-cheng Ju
A 4K and Main-10 HEVC video decoder LSI is fabricated in a 28nm CMOS process. It adopts a block-concealed processor (BcP) to improve the visual quality and a bandwidth-suppressed processor (BsP) is newly designed to reduce 30% and 45% of external data accesses in playback and gaming scenario, respectively. It features fully core scalable (FCS) architecture which lowers the required working frequency by 65%. A 10-bit compact scheme is proposed to reduce the frame buffer space by 37.5%. Moreover, a multi-standard architecture reduces are by 28%. It achieves 530Mpixels/s throughput which is two times larger than the state-of-the-art HEVC design [2] and consumes 0.2nJ/pixel energy efficiency, enabling real-time 4K video playback in Ultra-HD Blu-ray player and TV systems.
european solid state circuits conference | 2016
Chi-cheng Ju; Tsu-Ming Liu; Yung-Chang Chang; Chih-Ming Wang; Chia-Yun Cheng; Hue-Min Lin; Chun-Chia Chen; Min-Hao Chiu; Ping Chao; Ming-Long Wu; Meng-Jye Hu; Sheng-Jen Wang; Che-Hong Chen; Shun-Hsiang Chuang; Hsiu-Yi Lin; Fu-Chun Yeh; Chia-Hung Kao; Yi-Chang Chen; Chia-Lin Ho; Yenchieh Huang; Hsiao-En Chen; Chih-Wen Yang; Hsuan-Wen Peng
A lower power and area efficient VP9 and multi-standard video decoder chip is first-reported for Android 4K TV. It supports prevalent MPEG-x, VP-x, RMx, WMV-x and H.26x series video standards in a single chip. Three high-throughput techniques, look-ahead re-mapping, early stage pipeline and dynamic-scheduled bus translation, are proposed. They cuts the processing times by 51.2% compared to the state-of-the-art design [4]. Moreover, two area-efficient techniques, hybrid backward probability update and tile-to-raster scan ordering, are designed to reduce the internal memory size by 10%. A mass-production chip is fabricated in a 28nm CMOS technology with an energy efficiency of 0.19nJ/pixel and an area of 2.6mm2. Compared to the dual-core decoder design [4], this work achieves the identical performance (4K@60fps) with single core which cut one-half of chip area.
symposium on vlsi circuits | 2014
Chi-cheng Ju; Tsu-Ming Liu; Huaide Wang; Yung-Chang Chang; Chih-Ming Wang; Chang-Lin Hsieh; Brian Liu; Hue-Min Lin; Chia-Yun Cheng; Chun-Chia Chen; Min-Hao Chiu; Sheng-Jen Wang; Ping Chao; Meng-Jye Hu; Ryan Yeh; Ted Chuang; Hsiu-Yi Lin; Chung-Hung Tsai
A first-reported 4K×2K@60fps digital TV SoC processor supporting 9 video formats and integrating HDMI/MHL receiver is fabricated in a 40nm CMOS process. It adopts error compensation processor (ECP) to improve the visual quality, and designs a memory management unit and resource sharing technique to improve the throughput and area efficiency by 38.5% and 34.3%, respectively. Moreover, a lossless compression processor (LCP) is newly designed to reduce 30% and 45% of external data accesses in playback and gaming scenario, respectively. The proposed TV SoC processor includes multi-standard 4K×2K@60fps playback and 3.4Gbps HDMI receiver (Rx), and both scenario dissipate 198.15mW at 1.2V core and 3.3V I/O.
international symposium on circuits and systems | 2012
Chi-cheng Ju; Tsu-Ming Liu; Yung-Chang Chang; Chih-Ming Wang; Chun-Chia Chen; Hue-Min Lin; Chia-Yun Cheng; Min-Hao Chiu; Sheng-Jen Wang; Ping Chao; Meng-Jye Hu; Hao-Wei Li; Chung-Hung Tsai
A first-reported, sub-mW/fps/view multi-view video decoder chip fully compliant to 3D Blu-ray specifications is reported. It explores the resource sharing so as to integrate not only single-view MPEG-2/VC-1/AVC but multi-view MVC standards into a single die. Moreover, it features pipeline management and clock management units so as to improve the processing throughput and clock power efficiency. A test chip for not only single-view video but multi-view H.264/MVC decoding has been designed and fabricated using 40nm 1P7M CMOS process with core area 1.22mm2. Core power dissipation is about 46.5mW under 1920×1080 resolution of single view 60fps or stereo-view 30fps.
international conference on multimedia and expo | 2012
Chi-cheng Ju; Tsu-Ming Liu; Yeh-Lin Chu; Chuang-Chi Chiou; Bin-Jung Tsai; Te-Chi Hsiao; Ginny Chen; Pin-Huan Hsu; Chih-Ming Wang; Chun-Chia Chen; Hue-Min Lin; Chia-Yun Cheng; Min-Hao Chiu; Sheng-Jen Wang; Jiun-Yuan Wu; Yuan-Chun Lin; Yung-Chang Chang; Chung-Hung Tsai
A 3D Blu-ray-compliant multimedia processor integrating video decoder, display and graphic engines is presented. To cope with the bandwidth/cost-starved Blu-ray system, this design exploits the time-sharing techniques, leading to 31.3% and 29.1% of area reduction in display and decoder parts. Moreover, a graphic and on-screen-display hardwired handshake effectively reduces the DRAM space by 40%. A smart graphic command removal eliminates the redundant memory accesses by 14%. For 3D Blu-ray playback requirements, stereo full-HD video decoding, 24Hz display, and stereoscopic graphic UI are realized under the frequency of 333MHz, 148MHz, and 333MHz, respectively. This test chip is fabricated in 40nm CMOS process with core area of 3.92mm2 and power dissipation of 124.1mW.
asian solid state circuits conference | 2012
Chi-cheng Ju; Tsu-Ming Liu; Yi-Hau Chen; Kun-bin Lee; Chia-Yun Cheng; Hsueh-Te Chao; Chih-Ming Wang; Tung-Hsing Wu; Tin-An Lin; Han-Liang Chou; Yu-Kun Lin; Cheng-Hung Liu; Wei-Cing Li; Yi-Hsin Huang; Tsung-Chuan Ma; Chun-Chia Chen; Hue-Min Lin; Min-Hao Chiu; Sheng-Jen Wang; Yung-Chang Chang; Chung-Hung Tsai
A first dual-standard video encoder and decoder LSI providing VP8 (i.e. video format of WebM project for use of webs video) or H.264/AVC video recording and playback simultaneously is implemented with 28nm CMOS and occupies 1.94mm2 of core area. Several area-efficient techniques are realized, leading to 43.6% of area reduction. A new rate control is designed to facilitate the adaptation of video data and frame rates for network services. Two fast algorithms and new bool encoder/decoder are proposed to enhance power efficiency. This chip consumes 28.15mW and 10.02mW of VP8 encoder and decoder average power for 1080p@30fps at 0.9V, respectively.
Archive | 2016
Min-Hao Chiu; Chia-Yun Cheng; Chun-Chia Chen