Francesc Xavier Roca
Autonomous University of Barcelona
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Francesc Xavier Roca.
articulated motion and deformable objects | 2004
Jordi Gonzàlez; Javier Varona; Francesc Xavier Roca; Juan José Villanueva
In this paper, we address the analysis of human actions by comparing different performances of the same action, i.e. walking. To achieve this goal, we define a proper human body model which maximizes the differences between human postures and, moreover, reflects the anatomical structure of the human beings. Subsequently, a human action space, called aSpace, is built in order to represent a performance, i.e., a predefined sequence of postures, as a parametric manifold. The final human action representation is called p–action, which is based on the most characteristic human body postures found during several walking performances. These postures are found automatically by means of a predefined distance function, and they are called key-frames. By using key-frames, we synchronize any performance with respect to the action model. Furthermore, by considering an arc length parameterization, independence from the speed at which performances are played is attained. Consequently, the style of human walking can be successfully analysed by establishing differences between a male and a female walkers.
EURASIP Journal on Advances in Signal Processing | 2012
Wenjuan Gong; Jordi Gonzàlez; Francesc Xavier Roca
We present a novel method for human action recognition (HAR) based on estimated poses from image sequences. We use 3D human pose data as additional information and propose a compact human pose representation, called a weak pose, in a low-dimensional space while still keeping the most discriminative information for a given pose. With predicted poses from image features, we map the problem from image feature space to pose space, where a Bag of Poses (BOP) model is learned for the final goal of HAR. The BOP model is a modified version of the classical bag of words pipeline by building the vocabulary based on the most representative weak poses for a given action. Compared with the standard k-means clustering, our vocabulary selection criteria is proven to be more efficient and robust against the inherent challenges of action recognition. Moreover, since for action recognition the ordering of the poses is discriminative, the BOP model incorporates temporal information: in essence, groups of consecutive poses are considered together when computing the vocabulary and assignment. We tested our method on two well-known datasets: HumanEva and IXMAS, to demonstrate that weak poses aid to improve action recognition accuracies. The proposed method is scene-independent and is comparable with the state-of-art method.
iberian conference on pattern recognition and image analysis | 2011
Nataliya Shapovalova; Wenjuan Gong; Marco Pedersoli; Francesc Xavier Roca; Jordi Gonzàlez
This paper is focused on the automatic recognition of human events in static images. Popular techniques use knowledge of the human pose for inferring the action, and the most recent approaches tend to combine pose information with either knowledge of the scene or of the objects with which the human interacts. Our approach makes a step forward in this direction by combining the human pose with the scene in which the human is placed, together with the spatial relationships between humans and objects. Based on standard, simple descriptors like HOG and SIFT, recognition performance is enhanced when these three types of knowledge are taken into account. Results obtained in the PASCAL 2010 Action Recognition Dataset demonstrate that our technique reaches state-of-the-art results using simple descriptors and classifiers.
ieee international conference on automatic face & gesture recognition | 2008
Javier Orozco; Ognjen Rudovic; Francesc Xavier Roca; Jordi Gonzàlez
In this paper, we address the recognition of subtle facial expressions by reasoning on the classification confidence. Psychological evidences have determined that eyelids and eyebrows are significant for the recognition of subtle facial expressions and the early perception of human emotions. This early perception results in a more complex problem, which requires a confidence assessment for any provided solution. Thus, traditional score-based classifiers (e.g. k-NN and NN) are not able to produce confident estimates. Instead, we first present five confidence estimators and a confidence classification assessment for Case-Based Reasoning (CBR). Second, we improve the expression retrieval from the database by learning the neighbourhoods dimensions for the expected classification confidences. Third, we reuse the previous classified expressions and the confidence assessment to improve the classification achieved by k-NN. Fourth, we improve the database for generalization with new subjects by learning thresholds to minimize misclassification with low confidence, maximize correct classifications with high confidence and re-arrange misclassification with high confidence. The proposed system represents an effective contribution for both subtle expression recognition and CBR methodology. It achieves an average recognition of 97% plusmn 1% with a confidence of 96% plusmn 2% for expressiveness between 20% and 100%.
iberian conference on pattern recognition and image analysis | 2007
Javier Orozco; Jordi Gonzàlez; Ignasi Rius; Francesc Xavier Roca
Most applications on Human Computer Interaction (HCI) require to extract the movements of user faces, while avoiding high memory and time expenses. Moreover, HCI systems usually use low-cost cameras, while current face tracking techniques strongly depend on the image resolution. In this paper, we tackle the problem of eyelid tracking by using Appearance-Based Models, thus achieving accurate estimations of the movements of the eyelids, while avoiding cues, which require high-resolution faces, such as edge detectors or colour information. Consequently, we can track the fast and spontaneous movements of the eyelids, a very hard task due to the small resolution of the eye regions. Subsequently, we combine the results of eyelid tracking with the estimations of other facial features, such as the eyebrows and the lips. As a result, a hierarchical tracking framework is obtained: we demonstrate that combining two appearance-based trackers allows to get accurate estimates for the eyelid, eyebrows, lips and also the 3D head pose by using low-cost video cameras and in real-time. Therefore, our approach is shown suitable to be used for further facial-expression analysis.
Archive | 2007
Jordi Gonzàlez; Francesc Xavier Roca; Juan José Villanueva
Archive | 2008
Ignasi Rius; Jordi Gonzàlez; Mikhail Mozerov; Francesc Xavier Roca
Archive | 2007
Pau Baiget; Joan Soto; Francesc Xavier Roca; Jordi Gonzàlez
iberian conference on pattern recognition and image analysis | 2011
Wenjuan Gong; Marco Pedersoli; Francesc Xavier Roca; Jordi Gonzàlez
Archive | 2007
Murad Al; Francisco J. Orozco; Ariel Amato; Francesc Xavier Roca; Jordi Gonzàlez