Is this you? Create Your Porfile

Kai-Kuang Ma

Nanyang Technological University

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Kai-Kuang Ma is active.

Explore More

Publication

Featured researches published by Kai-Kuang Ma.

IEEE Transactions on Image Processing | 2000

A new diamond search algorithm for fast block-matching motion estimation

Shan Zhu; Kai-Kuang Ma

Based on the study of motion vector distribution from several commonly used test image sequences, a new diamond search (DS) algorithm for fast block-matching motion estimation (BMME) is proposed in this paper. Simulation results demonstrate that the proposed DS algorithm greatly outperforms the well-known three-step search (TSS) algorithm. Compared with the new three-step search (NTSS) algorithm, the DS algorithm achieves close performance but requires less computation by up to 22% on average. Experimental results also show that the DS algorithm is better than the four-step search (4SS) and block-based gradient descent search (BBGDS), in terms of mean-square error performance and required number of search points.

IEEE Transactions on Image Processing | 1999

Tri-state median filter for image denoising

Tao Chen; Kai-Kuang Ma; Lihui Chen

In this work, a novel nonlinear filter, called tri-state median (TSM) filter, is proposed for preserving image details while effectively suppressing impulse noise. We incorporate the standard median (SM) filter and the center weighted median (CWM) filter into a noise detection framework to determine whether a pixel is corrupted, before applying filtering unconditionally. Extensive simulation results demonstrate that the proposed filter consistently outperforms other median filters by balancing the tradeoff between noise reduction and detail preservation.

international conference on information and communication security | 1997

A new diamond search algorithm for fast block matching motion estimation

Shan Zhu; Kai-Kuang Ma

Based on the analysis of certain existing fast block matching algorithms (BMAs) and study of motion vector distributions of real-world image sequences, a new diamond search (DS) algorithm for fast block matching motion estimation is proposed in this paper. Simulation results demonstrate that the proposed DS algorithm greatly outperforms the well-known three-step search (TSS) algorithm. Compared with the new three-step search (NTSS) algorithm, the DS algorithm achieves similar performance but requires approximate 20%-25% less computation. Compared with some recently proposed fast BMAs, such as the four-step search (4SS) and the block-based gradient descent search (BBGDS), our DS algorithm also shows its superiority.

IEEE Transactions on Image Processing | 2002

Adaptive rood pattern search for fast block-matching motion estimation

Yao Nie; Kai-Kuang Ma

In this paper, we propose a novel and simple fast block-matching algorithm (BMA), called adaptive rood pattern search (ARPS), which consists of two sequential search stages: 1) initial search and 2) refined local search. For each macroblock (MB), the initial search is performed only once at the beginning in order to find a good starting point for the follow-up refined local search. By doing so, unnecessary intermediate search and the risk of being trapped into local minimum matching error points could be greatly reduced in long search case. For the initial search stage, an adaptive rood pattern (ARP) is proposed, and the ARPs size is dynamically determined for each MB, based on the available motion vectors (MVs) of the neighboring MBs. In the refined local search stage, a unit-size rood pattern (URP) is exploited repeatedly, and unrestrictedly, until the final MV is found. To further speed up the search, zero-motion prejudgment (ZMP) is incorporated in our method, which is particularly beneficial to those video sequences containing small motion contents. Extensive experiments conducted based on the MPEG-4 Verification Model (VM) encoding platform show that the search speed of our proposed ARPS-ZMP is about two to three times faster than that of the diamond search (DS), and our method even achieves higher peak signal-to-noise ratio (PSNR) particularly for those video sequences containing large and/or complex motion contents.

IEEE Transactions on Image Processing | 2002

Fuzzy color histogram and its use in color image retrieval

Ju Han; Kai-Kuang Ma

A conventional color histogram (CCH) considers neither the color similarity across different bins nor the color dissimilarity in the same bin. Therefore, it is sensitive to noisy interference such as illumination changes and quantization errors. Furthermore, CCHs large dimension or histogram bins requires large computation on histogram comparison. To address these concerns, this paper presents a new color histogram representation, called fuzzy color histogram (FCH), by considering the color similarity of each pixels color associated to all the histogram bins through fuzzy-set membership function. A novel and fast approach for computing the membership values based on fuzzy c-means algorithm is introduced. The proposed FCH is further exploited in the application of image indexing and retrieval. Experimental results clearly show that FCH yields better retrieval results than CCH. Such computing methodology is fairly desirable for image retrieval over large image databases.

Image and Vision Computing | 2007

Rotation-invariant and scale-invariant Gabor features for texture image retrieval

Ju Han; Kai-Kuang Ma

Conventional Gabor representation and its extracted features often yield a fairly poor performance in retrieving the rotated and scaled versions of the texture image under query. To address this issue, existing methods exploit multiple stages of transformations for making rotation and/or scaling being invariant at the expense of high computational complexity and degraded retrieval performance. The latter is mainly due to the lost of image details after multiple transformations. In this paper, a rotation-invariant and a scale-invariant Gabor representations are proposed, where each representation only requires few summations on the conventional Gabor filter impulse responses. The optimum setting of the orientation parameter and scale parameter is experimentally determined over the Brodatz and MPEG-7 texture databases. Features are then extracted from these new representations for conducting rotation-invariant or scale-invariant texture image retrieval. Since the dimension of the new feature space is much reduced, this leads to a much smaller metadata storage space and faster on-line computation on the similarity measurement. Simulation results clearly show that our proposed invariant Gabor representations and their extracted invariant features significantly outperform the conventional Gabor representation approach for rotation-invariant and scale-invariant texture image retrieval.

Signal, Image and Video Processing | 2011

A survey on super-resolution imaging

Jing Tian; Kai-Kuang Ma

The key objective of super-resolution (SR) imaging is to reconstruct a higher-resolution image based on a set of images, acquired from the same scene and denoted as ‘low-resolution’ images, to overcome the limitation and/or ill-posed conditions of the image acquisition process for facilitating better content visualization and scene recognition. In this paper, we provide a comprehensive review of SR image and video reconstruction methods developed in the literature and highlight the future research challenges. The SR image approaches reconstruct a single higher-resolution image from a set of given lower-resolution images, and the SR video approaches reconstruct an image sequence with a higher-resolution from a group of adjacent lower-resolution image frames. Furthermore, several SR applications are discussed to contribute some insightful comments on future SR research directions. Specifically, the SR computations for multi-view images and the SR video computation in the temporal domain are discussed.

IEEE Transactions on Circuits and Systems for Video Technology | 2009

Fast Mode Decision for H.264/AVC Based on Macroblock Motion Activity

Huanqiang Zeng; Canhui Cai; Kai-Kuang Ma

The intra-mode and inter-mode predictions have been made available in H.264/AVC for effectively improving coding efficiency. However, exhaustively checking for all the prediction modes for identifying the best one (commonly referred to as exhaustive mode decision) greatly increases computational complexity. In this paper, a fast mode decision algorithm, called the motion activity-based mode decision (MAMD), is proposed to speed up the encoding process by reducing the number of modes required to be checked in a hierarchical manner, and is as follows. For each macroblock, the proposed MAMD algorithm always starts with checking the rate-distortion (RD) cost computed at the SKIP mode for a possible early termination, once the RD cost value is below a predetermined ldquolowrdquo threshold. On the other hand, if the RD cost exceeds another ldquohighrdquo threshold, then this indicates that only the intra modes are worthwhile to be checked. If the computed RD cost falls between the above-mentioned two thresholds, the remaining seven modes, which are classified into three motion activity classes in our work, will be examined, and only one of the three classes will be chosen for further mode checking. The above-mentioned motion activity can be quantitatively measured, which is equal to the maximum city-block length of the motion vector taken from a set of adjacent macroblocks (i.e., region of support, ROS). This measurement is then used to determine the most possible motion-activity class for the current macroblock. Experimental results have shown that, on average, the proposed MAMD algorithm reduces the computational complexity by 62.96%, while incurring only 0.059 dB loss in PSNR (peak signal-to-noise ratio) and 0.19% increment on the total bit rate compared to that of exhaustive mode decision, which is a default approach set in the JM reference software.

IEEE Transactions on Geoscience and Remote Sensing | 2010

Unsupervised Change Detection for Satellite Images Using Dual-Tree Complex Wavelet Transform

Turgay Celik; Kai-Kuang Ma

In this paper, an unsupervised change-detection method for multitemporal satellite images is proposed. The algorithm exploits the inherent multiscale structure of the dual-tree complex wavelet transform (DT-CWT) to individually decompose each input image into one low-pass subband and six directional high-pass subbands at each scale. To avoid illumination variation issue possibly incurred in the low-pass subband, only the DT-CWT coefficient difference resulted from the six high-pass subbands of the two satellite images under comparison is analyzed in order to decide whether each subband pixel intensity has incurred a change. Such a binary decision is based on an unsupervised thresholding derived from a mixture statistical model, with a goal of minimizing the total error probability of change detection. The binary change-detection mask is thus formed for each subband, and all the produced subband masks are merged by using both the intrascale fusion and the interscale fusion to yield the final change-detection mask. For conducting the performance evaluation of change detection, the proposed DT-CWT-based unsupervised change-detection method is exploited for both the noise-free and the noisy images. Extensive simulation results clearly show that the proposed algorithm not only consistently provides more accurate detection of small changes but also demonstrates attractive robustness against noise interference under various noise types and noise levels.

IEEE Transactions on Geoscience and Remote Sensing | 2011

Multitemporal Image Change Detection Using Undecimated Discrete Wavelet Transform and Active Contours

Turgay Celik; Kai-Kuang Ma

In this paper, an unsupervised change detection method for satellite images is proposed. Owing to its robustness against noise, the undecimated discrete wavelet transform is exploited to obtain a multiresolution representation of the difference image, which is obtained from two satellite images acquired from the same geographical area but at different time instances. A region-based active contour model is then applied to the multiresolution representation of the difference image for segmenting the difference image into the “changed” and “unchanged” regions. The proposed change detection method has been conducted on two types of image data sets, i.e., the synthetic aperture radar images and the optical images. The change detection results are compared with several state-of-the-art techniques. The extensive simulation results clearly show that the proposed change detection method consistently yields superior performance.

Explore More