Duc Thanh Nguyen
University of Wollongong
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Duc Thanh Nguyen.
Pattern Recognition | 2013
Duc Thanh Nguyen; Philip Ogunbona; Wanqing Li
Motivated by the discriminative ability of shape information and local patterns in object recognition, this paper proposes a window-based object descriptor that integrates both cues. In particular, contour templates representing object shape are used to derive a set of so-called key points at which local appearance features are extracted. These key points are located using an improved template matching method that utilises both spatial and orientation information in a simple and effective way. At each of the extracted key points, a new local appearance feature, namely non-redundant local binary pattern (NR-LBP), is computed. An object descriptor is formed by concatenating the NR-LBP features from all key points to encode the shape as well as the appearance of the object. The proposed descriptor was extensively tested in the task of detecting humans from static images on the commonly used MIT and INRIA datasets. The experimental results have shown that the proposed descriptor can effectively describe non-rigid objects with high articulation and improve the detection rate compared to other state-of-the-art object descriptors.
international conference on image processing | 2010
Duc Thanh Nguyen; Zhimin Zong; Philip Ogunbona; Wanqing Li
Local Binary Pattern (LBP) as a descriptor, has been successfully used in various object recognition tasks because of its discriminative property and computational simplicity. In this paper a variant of the LBP referred to as Non-Redundant Local Binary Pattern (NRLBP) is introduced and its application for object detection is demonstrated. Compared with the original LBP descriptor, the NRLBP has advantage of providing a more compact description of objects appearance. Furthermore, the NRLBP is more discriminative since it reflects the relative contrast between the background and foreground. The proposed descriptor is employed to encode humans appearance in a human detection task. Experimental results show that the NRLBP is robust and adaptive with changes of the background and foreground and also outperforms the original LBP in detection task.
digital image computing techniques and applications | 2014
Lijuan Zhou; Wanqing Li; Yuyao Zhang; Philip Ogunbona; Duc Thanh Nguyen; Hanling Zhang
This paper presents a method for extracting discriminative key poses for skeleton-based action recognition. Poses are represented by normalized joint locations, velocities and accelerations of skeleton joints. An extended label consistent K-SVD (ELC-KSVD) algorithm is proposed for learning the common and action-specific dictionaries. Discriminative key poses are represented by the atoms of the action-specific dictionaries. With the specific dictionaries, sparse codes are obtained for representing action instances through max pooling and temporal pyramid. A SVM classifier is trained for action recognition. The proposed method was evaluated on the MSRC-12 gesture and MSR-Action 3D datasets. Experimental results have shown that the proposed method is effective in extracting discriminative key poses.
multimedia signal processing | 2011
Hongda Tian; Wanqing Li; Philip Ogunbona; Duc Thanh Nguyen; Ce Zhan
This paper presents a novel and low complexity method for real-time video-based smoke detection. As a local texture operator, Non-Redundant Local Binary Pattern (NRLBP) is more discriminative and robust to illumination changes in comparison with original Local Binary Pattern (LBP), thus is employed to encode the appearance information of smoke. Non-Redundant Local Motion Binary Pattern (NRLMBP), which is computed on the difference image of consecutive frames, is introduced to capture the motion information of smoke. Experimental results show that NRLBP outperforms the original LBP in the smoke detection task. Furthermore, the combination of NRLBP and NRLMBP, which can be considered as a spatial-temporal descriptor of smoke, can lead to remarkable improvement on detection performance.
Neurocomputing | 2014
Duc Thanh Nguyen; Zhimin Zong; Philip Ogunbona; Yasmine Probst; Wanqing Li
Abstract This paper proposes food image classification methods exploiting both local appearance and global structural information of food objects. The contribution of the paper is threefold. First, non-redundant local binary pattern (NRLBP) is used to describe the local appearance information of food objects. Second, the structural information of food objects is represented by the spatial relationship between interest points and encoded using a shape context descriptor formed from those interest points. Third, we propose two methods of integrating appearance and structural information for the description and classification of food images. We evaluated the proposed methods on two datasets. Experimental results verified that the combination of local appearance and structural features can improve classification performance.
international symposium on multimedia | 2010
Zhimin Zong; Duc Thanh Nguyen; Philip Ogunbona; Wanqing Li
This paper proposes a food image classification method using local textural patterns and their global structure to describe the food image. In this paper, a visual codebook of local textural patterns is created by employing Scale Invariant Feature Transformation (SIFT) interest point detector with the Local Binary Pattern (LBP) feature. In addition to describing the food image using local texture, the global structure of the food object is represented as the spatial distribution of the local textural structures and encoded using shape context. We evaluated the proposed method on the Pittsburgh Fast-Food Image (PFI) dataset. Experimental results showed that the proposed method could obtain better performance than the baseline experiment on the PFI dataset.
international conference on image processing | 2011
Duc Thanh Nguyen; Philip Ogunbona; Wanqing Li
This paper presents a human detection method using contour-based local motion features. The local motion is encoded using a variant of the popular Local Binary Pattern (LBP) called Non-Redundant Local Binary Pattern (NRLBP) descriptor computed on the difference image of two consecutive frames. In addition, the local motion features are extracted along the humans boundary contour. Localising features on the contours has the advantage of utilizing a precise human shape description. A motivation of the proposed method is that most of informative movements are performed on boundary contours of the body parts, e.g. legs of pedestrians. Evaluation of the proposed method was conducted on the INRIA and ETH datasets. Apart from showing the importance of motion information, experimental results also showed that localising features along the object boundary contours improves the detection performance.
Neurocomputing | 2013
Duc Thanh Nguyen; Wanqing Li; Philip Ogunbona
Detecting multiple humans in crowded scenes is challenging because the humans are often partially or even totally occluded by each other. In this paper, we propose a novel algorithm for partial inter-occlusion reasoning in human detection based on variational mean field theory. The proposed algorithm can be integrated with various part-based human detectors using different types of features, object representations, and classifiers. The algorithm takes as the input an initial set of possible human objects (hypotheses) detected using a part-based human detector. Each hypothesis is decomposed into a number of parts and the occlusion status of each part is inferred by the proposed algorithm. Specifically, initial detections (hypotheses) with spatial layout information are represented in a graphical model and the inference is formulated as an estimation of the marginal probability of the observed data in a Bayesian network. The variational mean field theory is employed as an effective estimation technique. The proposed method was evaluated on popular datasets including CAVIAR, iLIDS, and INRIA. Experimental results have shown that the proposed algorithm is not only able to detect humans under severe occlusion but also enhance the detection performance when there is no occlusion.
image and vision computing new zealand | 2009
Duc Thanh Nguyen; Wanqing Li; Philip Ogunbona
This paper proposes a part-based template matching method for multi-view human detection. The proposed method includes two stages: matching and verification. In particular, the best individual matching parts given a detection window are determined using an improved template matching algorithm. The hypothesis of the matched parts forming a human is then verified by employing a Bayesian-based model. The verification is not only based on the matching costs of individual parts but also how well the combining the matched parts satisfying the configuration constraints of the human body. Experimental results have shown that the proposed method is robust for detecting humans at multiple views and outperforms other template matching-based methods.
international conference on control, automation, robotics and vision | 2010
Duc Thanh Nguyen; Wanqing Li; Philip Ogunbona
Motivated by the advantages of using shape matching technique in detecting objects in various postures and viewpoints and the discriminative power of local patterns in object recognition, this paper proposes a human detection method combining both shape and appearance cues. In particular, local shapes of the body parts are detected using template matching. Based on body parts shapes, local appearance features are extracted. We introduce a novel local binary pattern (LBP) descriptor, called Non-Redundant LBP (NRLBP), to encode local appearance of human. The proposed method was evaluated and compared with other state-of-the-art human detection methods on two commonly used datasets: MIT and INRIA pedestrian test sets. We also performed extensive experiments on selecting appropriate parameters as well as verifying the improvement of the proposed method through all stages of the framework.