Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Shaw-Min Lei is active.

Publication


Featured researches published by Shaw-Min Lei.


IEEE Transactions on Circuits and Systems for Video Technology | 2012

Sample Adaptive Offset in the HEVC Standard

Chih-Ming Fu; Elena Alshina; Alexander Alshin; Yu-Wen Huang; Ching-Yeh Chen; Chia-Yang Tsai; Chih-Wei Hsu; Shaw-Min Lei; Jeong-hoon Park; Woo-Jin Han

This paper provides a technical overview of a newly added in-loop filtering technique, sample adaptive offset (SAO), in High Efficiency Video Coding (HEVC). The key idea of SAO is to reduce sample distortion by first classifying reconstructed samples into different categories, obtaining an offset for each category, and then adding the offset to each sample of the category. The offset of each category is properly calculated at the encoder and explicitly signaled to the decoder for reducing sample distortion effectively, while the classification of each sample is performed at both the encoder and the decoder for saving side information significantly. To achieve low latency of only one coding tree unit (CTU), a CTU-based syntax design is specified to adapt SAO parameters for each CTU. A CTU-based optimization algorithm can be used to derive SAO parameters of each CTU, and the SAO parameters of the CTU are inter leaved into the slice data. It is reported that SAO achieves on average 3.5% BD-rate reduction and up to 23.5% BD-rate reduction with less than 1% encoding time increase and about 2.5% decoding time increase under common test conditions of HEVC reference software version 8.0.


multimedia signal processing | 2011

Sample adaptive offset for HEVC

Chih-Ming Fu; Ching-Yeh Chen; Yu-Wen Huang; Shaw-Min Lei

A new video coding tool, sample adaptive offset (SAO), is introduced in this paper. SAO has been adopted into the Working Draft of the new video coding standard, High-Efficiency Video Coding (HEVC). The SAO is located after deblocking in the video coding loop. The concept of SAO is to classify reconstructed pixels into different categories and then reduce the distortion by simply adding an offset for each category of pixels. The pixel intensity and edge properties are used for pixel classification. To further improve the coding efficiency, a picture can be divided into regions for localization of offset parameters. Simulation results show that SAO can achieve on average 2% bit rate reduction and up to 6% bit rate reduction. The run time increases for encoders and decoders are only 2%.


multimedia signal processing | 2011

Motion vector coding techniques for HEVC

Jian-Liang Lin; Yi-Wen Chen; Yu-Pao Tsai; Yu-Wen Huang; Shaw-Min Lei

High Efficiency Video Coding (HEVC) is a new international video coding standard that has been developed by the Joint Collaborative Team on Video Coding (JCT-VC). In this paper, an overview of the motion vector coding techniques for HEVC is presented. Our three proposed coding tools for the motion vector predictor (MVP) in the Inter, Skip and Merge modes of HEVC are also presented, which includes a new location of the temporal MVP, a priority-based derivation method of spatial MVPs, and a derivation method of temporal MVPs. A combination of these three tools can achieve on average 1.3%, 1.8%, 1.2% and 2.2% bit rate reductions for high efficiency random access, low complexity random access, high efficiency low delay, and low complexity low delay, respectively.


visual communications and image processing | 2011

Real-time hand tracking on depth images

Chia-Ping Chen; Yu-Ting Chen; Ping-Han Lee; Yu-Pao Tsai; Shaw-Min Lei

Hand tracking is a fundamental task in a gesture recognition system. Most previous works tracked the hand position on color images and relied heavily on skin color information. However, color information is very vulnerable to lighting variations and skin color varies across difference human races. Furthermore, one can not effectively discriminate faces or other skin-color-like objects from hands when using skin color detection. In this paper, we propose a hand tracking algorithm that uses depth images only, and also a hand click detection method to initialize the hand tracking automatically. We show that depth images suffice and are advantageous to real-time hand tracking. A region growing technique is applied to segment the hand region on depth images. Then a mean-shift based algorithm accurately locates the hand center in the segmented hand region. The experimental results show that the proposed tracking algorithm runs at 300+ FPS, and the average error of the tracked 3D hand positions is less than 1 centimeter. The proposed method enables a plethora of potential applications to natural Human-Computer Interaction (HCI), and is adequate for embedded systems of consumer electronics because of its low complexity and low bandwidth requirement.


IEEE Journal of Selected Topics in Signal Processing | 2013

Adaptive Loop Filtering for Video Coding

Chia-Yang Tsai; Ching-Yeh Chen; Tomoo Yamakage; In Suk Chong; Yu-Wen Huang; Chih-Ming Fu; Takayuki Itoh; Takashi Watanabe; Takeshi Chujoh; Marta Karczewicz; Shaw-Min Lei

Adaptive loop filtering for video coding is to minimize the mean square error between original samples and decoded samples by using Wiener-based adaptive filter. The proposed ALF is located at the last processing stage for each picture and can be regarded as a tool to catch and fix artifacts from previous stages. The suitable filter coefficients are determined by the encoder and explicitly signaled to the decoder. In order to achieve better coding efficiency, especially for high resolution videos, local adaptation is used for luma signals by applying different filters to different regions or blocks in a picture. In addition to filter adaptation, filter on/off control at coding tree unit (CTU) level is also helpful for improving coding efficiency. Syntax-wise, filter coefficients are sent in a picture level header called adaptation parameter set, and filter on/off flags of CTUs are interleaved at CTU level in the slice data. This syntax design not only supports picture level optimization but also achieves a low encoding latency. Simulation results show that the ALF can achieve on average 7% bit rate reduction for 25 HD sequences. The run time increases are 1% and 10% for encoders and decoders, respectively, without special attention to optimization in C++ code.


IEEE Journal of Selected Topics in Signal Processing | 2013

Motion Vector Coding in the HEVC Standard

Jian-Liang Lin; Yi-Wen Chen; Yu-Wen Huang; Shaw-Min Lei

High Efficiency Video Coding (HEVC) is an emerging international video coding standard developed by the Joint Collaborative Team on Video Coding (JCT-VC). Compared to H.264/AVC, HEVC has achieved substantial compression performance improvement. During the HEVC standardization, we proposed several motion vector coding techniques, which were crosschecked by other experts and then adopted into the standard. In this paper, an overview of the motion vector coding techniques in HEVC is firstly provided. Next, the proposed motion vector coding techniques including a priority-based derivation algorithm for spatial motion candidates, a priority-based derivation algorithm for temporal motion candidates, a surrounding-based candidate list, and a parallel derivation of the candidate list, are also presented. Based on HEVC test model 9 (HM9), experimental results show that the combination of the proposed techniques achieves on average 3.1% bit-rate saving under the common test conditions used for HEVC development.


visual communications and image processing | 2011

One-pass encoding algorithm for adaptive loop filter in high-efficiency video coding

Chia-Yang Tsai; Ching-Yeh Chen; Chih-Ming Fu; Yu-Wen Huang; Shaw-Min Lei

In this paper, a one-pass encoding algorithm is proposed for adaptive loop filter (ALF) in high-efficiency video coding (HEVC). ALF can improve both subjective and objective video quality, but it also requires a lot of encoding passes (i.e. picture buffer accesses) that will significantly increase external memory access, encoding latency, and power consumption. Therefore, we propose a method to estimate filtering distortion without performing real filter operation. The number of encoding passes can be effectively reduced from 16 to 1. Combined with an initial guess of filter-on/off blocks by using time-delayed filters, the proposed one-pass algorithm only induces average 0.17% BD-rate increase.


international symposium on circuits and systems | 2014

Inter-view motion prediction in 3D-HEVC

Li Zhang; Ying Chen; Vijayaraghavan Thirumalai; Jian-Liang Lin; Yi-Wen Chen; Jicheng An; Shaw-Min Lei; Laurent Guillo; Thomas Guionnet; Christine Guillemot

This paper presents a novel inter-view motion prediction technique used in 3D-HEVC which provides efficient compression for motion vectors. The 3D extension of HEVC (High Efficiency Video Coding) namely 3D-HEVC is under development in JCT-3V for coding multi-view video and depth data. Inter-view motion prediction takes benefit of the inter-view correlation between views by inferring the motion information of a view from the already coded motion information in another view as well as the local disparity between two views. Experimental results show that inter-view motion prediction in 3D-HEVC provides an average bit-rate savings of 15% for the dependent views.


international conference on image processing | 2010

Enhanced intra prediction and transform for video coding

Kai Zhang; Shaw-Min Lei; Wen Gao

Intra-coding plays an important role in video coding schemes and it has become a hot research topic of the next generation video coding standard. In this paper, we propose two techniques to improve the efficiency of the prediction and the transform, respectively, for intra-coding. First, we introduce an overlapped block intra-prediction (OBIP) method, which makes use of the intra directional prediction modes of neighboring blocks as well as the mode of the current block. A position dependent weighted sum of several possible predictors will be treated as the final prediction. Second, we propose a multiple-model Karhunen-Loève transform (MMKLT) technique to further improve the mode dependent directional transform (MDDT) method. More than one KLT will be trained offline for each intra-prediction mode based on residual variances. Experimental results indicate that the proposed two methods together can achieve an average bit-rate reduction about 11% compared with H.264 or 5% compared with MDDT in all-intra coding.


Proceedings of SPIE | 2012

The adaptive loop filtering techniques in the HEVC standard

Ching-Yeh Chen; Chia-Yang Tsai; Yu-Wen Huang; Tomoo Yamakage; In Suk Chong; Chih-Ming Fu; Takayuki Itoh; Takashi Watanabe; Takeshi Chujoh; Marta Karczewicz; Shaw-Min Lei

This article introduces adaptive loop filtering (ALF) techniques being considered for the HEVC standard. The key idea of ALF is to minimize the mean square error between original pixels and decoded pixels using Wiener-based adaptive filter coefficients. ALF is located at the last processing stage of each picture and can be regarded as a tool trying to catch and fix artifacts from previous stages. The suitable filter coefficients are determined by the encoder and explicitly signaled to the decoder. In order to achieve better coding efficiency, especially for high resolution videos, local adaptation is used for luma signals by applying different filter to different region in a picture. In addition to filter adaptation, filter on/off control at largest coding unit (LCU) level is also helpful for improving coding efficiency. Syntax-wise, filter coefficients are sent in a picture level header called adaptation parameter set (APS), and filter on/off flags of LCUs are interleaved at LCU level in the slice data. Besides supporting picture-based optimization of ALF, the syntax design can support low delay applications as well. When the filter coefficients in APS are trained by using a previous picture, filter on/off decisions can be made on the fly during encoding of LCUs, so the encoding latency is only one LCU. Simulation results show that the ALF can achieve on average 5% bit rate reduction and up to 27% bit rate reduction for 25 HD sequences. The run time increases are 1% and 10% for encoders and decoders, respectively, with un-optimized C++ codes in software.

Collaboration


Dive into the Shaw-Min Lei's collaboration.

Researchain Logo
Decentralizing Knowledge