Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Zhaoxiang Zhang is active.

Publication


Featured researches published by Zhaoxiang Zhang.


international conference on pattern recognition | 2008

Estimating the number of people in crowded scenes by MID based foreground segmentation and head-shoulder detection

Min Li; Zhaoxiang Zhang; Kaiqi Huang; Tieniu Tan

This paper proposes a novel method to address the problem of estimating the number of people in surveillance scenes with people gathering and waiting. The proposed method combines a MID (mosaic image difference) based foreground segmentation algorithm and a HOG (histograms of oriented gradients) based head-shoulder detection algorithm to provide an accurate estimation of people counts in the observed area. In our framework, the MID-based foreground segmentation module provides active areas for the head-shoulder detection module to detect heads and count the number of people. Numerous experiments are conducted and convincing results demonstrate the effectiveness of our method.


IEEE Transactions on Systems, Man, and Cybernetics | 2013

Incremental Learning for Video-Based Gait Recognition With LBP Flow

Maodi Hu; Yunhong Wang; Zhaoxiang Zhang; De Zhang; James J. Little

Gait analysis provides a feasible approach for identification in intelligent video surveillance. However, the effectiveness of the dominant silhouette-based approaches is overly dependent upon background subtraction. In this paper, we propose a novel incremental framework based on optical flow, including dynamics learning, pattern retrieval, and recognition. It can greatly improve the usability of gait traits in video surveillance applications. Local binary pattern (LBP) is employed to describe the texture information of optical flow. This representation is called LBP flow, which performs well as a static representation of gait movement. Dynamics within and among gait stances becomes the key consideration for multiframe detection and tracking, which is quite different from existing approaches. To simulate the natural way of knowledge acquisition, an individual hidden Markov model (HMM) representing the gait dynamics of a single subject incrementally evolves from a population model that reflects the average motion process of human gait. It is beneficial for both tracking and recognition and makes the training process of the HMM more robust to noise. Extensive experiments on widely adopted databases have been carried out to show that our proposed approach achieves excellent performance.


IEEE Transactions on Image Processing | 2012

Three-Dimensional Deformable-Model-Based Localization and Recognition of Road Vehicles

Zhaoxiang Zhang; Tieniu Tan; Kaiqi Huang; Yunhong Wang

We address the problem of model-based object recognition. Our aim is to localize and recognize road vehicles from monocular images or videos in calibrated traffic scenes. A 3-D deformable vehicle model with 12 shape parameters is set up as prior information, and its pose is determined by three parameters, which are its position on the ground plane and its orientation about the vertical axis under ground-plane constraints. An efficient local gradient-based method is proposed to evaluate the fitness between the projection of the vehicle model and image data, which is combined into a novel evolutionary computing framework to estimate the 12 shape parameters and three pose parameters by iterative evolution. The recovery of pose parameters achieves vehicle localization, whereas the shape parameters are used for vehicle recognition. Numerous experiments are conducted in this paper to demonstrate the performance of our approach. It is shown that the local gradient-based method can evaluate accurately and efficiently the fitness between the projection of the vehicle model and the image data. The evolutionary computing framework is effective for vehicles of different types and poses is robust to all kinds of occlusion.


international conference on image processing | 2009

Rapid and robust human detection and tracking based on omega-shape features

Min Li; Zhaoxiang Zhang; Kaiqi Huang; Tieniu Tan

This paper proposes a novel method for rapid and robust human detection and tracking based on the omega-shape features of peoples head-shoulder parts. There are two modules in this method. In the first module, a Viola-Jones type classifier and a local HOG (Histograms of Oriented Gradients) feature based AdaBoost classifier are combined to detect head-shoulders rapidly and effectively. Then, in the second module, each detected head-shoulder is tracked by a particle filter tracker using local HOG features to model targets appearance, which shows great robustness in scenarios of crowding, background distractors and partial occlusions. Experimental results demonstrate the effectiveness and efficiency of the proposed approach.


international conference on pattern recognition | 2010

Combining Spatial and Temporal Information for Gait Based Gender Classification

Maodi Hu; Yunhong Wang; Zhaoxiang Zhang; Yiding Wang

In this paper, we address the problem of gait based gender classification. The Gabor feature which is a new attempt for gait analysis, not only improves the robustness to the segmental noise, but also provides a feasible way to purge the additional influence factors like clothing and carrying condition changes before supervised learning. Furthermore, through the agency of Maximization of Mutual Information (MMI), the low dimensional discriminative representation is obtained as the Gabor-MMI feature. After that, gender related Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are constructed for classification work. In this case, supervised learning reduces the dimension of parameter space, and significantly increases the gap between likelihoods of the gender models. In order to assess the performance of our proposed approach, we compare it with other methods on the standard CASIA Gait Databases (Dataset B). Experimental results demonstrate that our approach achieves better Correct Classification Rate (CCR) than the state of the art methods.


computer vision and pattern recognition | 2016

GIFT: A Real-Time and Scalable 3D Shape Search Engine

Song Bai; Xiang Bai; Zhichao Zhou; Zhaoxiang Zhang; Longin Jan Latecki

Projective analysis is an important solution for 3D shape retrieval, since human visual perceptions of 3D shapes rely on various 2D observations from different view points. Although multiple informative and discriminative views are utilized, most projection-based retrieval systems suffer from heavy computational cost, thus cannot satisfy the basic requirement of scalability for search engines. In this paper, we present a real-time 3D shape search engine based on the projective images of 3D shapes. The real-time property of our search engine results from the following aspects: (1) efficient projection and view feature extraction using GPU acceleration, (2) the first inverted file, referred as F-IF, is utilized to speed up the procedure of multi-view matching, (3) the second inverted file (S-IF), which captures a local distribution of 3D shapes in the feature manifold, is adopted for efficient context-based reranking. As a result, for each query the retrieval task can be finished within one second despite the necessary cost of IO overhead. We name the proposed 3D shape search engine, which combines GPU acceleration and Inverted File (Twice), as GIFT. Besides its high efficiency, GIFT also outperforms the state-of-the-art methods significantly in retrieval accuracy on various shape benchmarks and competitions.


IEEE Transactions on Image Processing | 2015

Relevance Metric Learning for Person Re-Identification by Exploiting Listwise Similarities

Jiaxin Chen; Zhaoxiang Zhang; Yunhong Wang

Person re-identification aims to match people across non-overlapping camera views, which is an important but challenging task in video surveillance. In order to obtain a robust metric for matching, metric learning has been introduced recently. Most existing works focus on seeking a Mahalanobis distance by employing sparse pairwise constraints, which utilize image pairs with the same person identity as positive samples, and select a small portion of those with different identities as negative samples. However, this training strategy has abandoned a large amount of discriminative information, and ignored the relative similarities. In this paper, we propose a novel relevance metric learning method with listwise constraints (RMLLCs) by adopting listwise similarities, which consist of the similarity list of each image with respect to all remaining images. By virtue of listwise similarities, RMLLC could capture all pairwise similarities, and consequently learn a more discriminative metric by enforcing the metric to conserve predefined similarity lists in a low-dimensional projection subspace. Despite the performance enhancement, RMLLC using predefined similarity lists fails to capture the relative relevance information, which is often unavailable in practice. To address this problem, we further introduce a rectification term to automatically exploit the relative similarities, and develop an efficient alternating iterative algorithm to jointly learn the optimal metric and the rectification term. Extensive experiments on four publicly available benchmarking data sets are carried out and demonstrate that the proposed method is significantly superior to the state-of-the-art approaches. The results also show that the introduction of the rectification term could further boost the performance of RMLLC.


IEEE Transactions on Image Processing | 2013

Learning the Spherical Harmonic Features for 3-D Face Recognition

Peijiang Liu; Yunhong Wang; Di Huang; Zhaoxiang Zhang; Liming Chen

In this paper, a competitive method for 3-D face recognition (FR) using spherical harmonic features (SHF) is proposed. With this solution, 3-D face models are characterized by the energies contained in spherical harmonics with different frequencies, thereby enabling the capture of both gross shape and fine surface details of a 3-D facial surface. This is in clear contrast to most 3-D FR techniques which are either holistic or feature based, using local features extracted from distinctive points. First, 3-D face models are represented in a canonical representation, namely, spherical depth map, by which SHF can be calculated. Then, considering the predictive contribution of each SHF feature, especially in the presence of facial expression and occlusion, feature selection methods are used to improve the predictive performance and provide faster and more cost-effective predictors. Experiments have been carried out on three public 3-D face datasets, SHREC2007, FRGC v2.0, and Bosphorus, with increasing difficulties in terms of facial expression, pose, and occlusion, and which demonstrate the effectiveness of the proposed method.


systems man and cybernetics | 2011

Gait-Based Gender Classification Using Mixed Conditional Random Field

Maodi Hu; Yunhong Wang; Zhaoxiang Zhang; De Zhang

This paper proposes a supervised modeling approach for gait-based gender classification. Different from traditional temporal modeling methods, male and female gait traits are competitively learned by the addition of gender labels. Shape appearance and temporal dynamics of both genders are integrated into a sequential model called mixed conditional random field (CRF) (MCRF), which provides an open framework applicable to various spatiotemporal features. In this paper, for the spatial part, pyramids of fitting coefficients are used to generate the gait shape descriptors; for the temporal part, neighborhood-preserving embeddings are clustered to allocate the stance indexes over gait cycles. During these processes, we employ evaluation functions like the partition index and Xie and Benis index to improve the feature sparseness. By fusion of shape descriptors and stance indexes, the MCRF is constructed in coordination with intra- and intergender temporary Markov properties. Analogous to the maximum likelihood decision used in hidden Markov models (HMMs), several classification strategies on the MCRF are discussed. We use CASIA (Data set B) and IRIP Gait Databases for the experiments. The results show the superior performance of the MCRF over HMMs and separately trained CRFs.


international conference on image processing | 2007

Real-Time Moving Object Classification with Automatic Scene Division

Zhaoxiang Zhang; Yinghao Cai; Kaiqi Huang; Tieniu Tan

We address the problem of moving object classification. Our aim is to classify moving objects of traffic scene videos into pedestrians, bicycles and vehicles. Instead of supervised learning and manual labeling of large training samples, our classifiers are initialized and refined online automatically. With efficient features extracted and organized, the approach can be real-time and achieve high classification accuracy. Once the view or scene changes detected, the algorithm can automatically refine the classifiers and adapt them to new environments. Experimental results demonstrate the effectiveness and robustness of the proposed approach.

Collaboration


Dive into the Zhaoxiang Zhang's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Kaiqi Huang

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Tieniu Tan

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Chunlei Li

Zhongyuan University of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Min Li

Chinese Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Yiding Wang

North China University of Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge