Pau Rodríguez
Autonomous University of Barcelona
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Pau Rodríguez.
IEEE Transactions on Systems, Man, and Cybernetics | 2017
Pau Rodríguez; Guillem Cucurull; Jordi Gonzàlez; Josep M. Gonfaus; Kamal Nasrollahi; Thomas B. Moeslund; F. Xavier Roca
Pain is an unpleasant feeling that has been shown to be an important factor for the recovery of patients. Since this is costly in human resources and difficult to do objectively, there is the need for automatic systems to measure it. In this paper, contrary to current state-of-the-art techniques in pain assessment, which are based on facial features only, we suggest that the performance can be enhanced by feeding the raw frames to deep learning models, outperforming the latest state-of-the-art results while also directly facing the problem of imbalanced data. As a baseline, our approach first uses convolutional neural networks (CNNs) to learn facial features from VGG_Faces, which are then linked to a long short-term memory to exploit the temporal relation between video frames. We further compare the performances of using the so popular schema based on the canonically normalized appearance versus taking into account the whole image. As a result, we outperform current state-of-the-art area under the curve performance in the UNBC-McMaster Shoulder Pain Expression Archive Database. In addition, to evaluate the generalization properties of our proposed methodology on facial motion recognition, we also report competitive results in the Cohn Kanade+ facial expression database.
international conference on pattern recognition | 2016
Marco Bellantonio; Mohammad Ahsanul Haque; Pau Rodríguez; Kamal Nasrollahi; Taisi Telve; Sergio Escalera; Jordi Pérez González; Thomas B. Moeslund; Pejman Rasti; Gholamreza Anbarjafari
Automatic pain detection is a long expected solution to a prevalent medical problem of pain management. This is more relevant when the subject of pain is young children or patients with limited ability to communicate about their pain experience. Computer vision-based analysis of facial pain expression provides a way of efficient pain detection. When deep machine learning methods came into the scene, automatic pain detection exhibited even better performance. In this paper, we figured out three important factors to exploit in automatic pain detection: spatial information available regarding to pain in each of the facial video frames, temporal axis information regarding to pain expression pattern in a subject video sequence, and variation of face resolution. We employed a combination of convolutional neural network and recurrent neural network to setup a deep hybrid pain detection framework that is able to exploit both spatial and temporal pain information from facial video. In order to analyze the effect of different facial resolutions, we introduce a super-resolution algorithm to generate facial video frames with different resolution setups. We investigated the performance on the publicly available UNBC-McMaster Shoulder Pain database. As a contribution, the paper provides novel and important information regarding to the performance of a hybrid deep learning framework for pain detection in facial images of different resolution.
Expert Systems With Applications | 2018
Farhood Negin; Pau Rodríguez; Michal Koperski; Adlen Kerboua; Jordi Gonzàlez; Jérémy Bourgeois; Emmanuelle Chapoulie; Philippe Robert; Francois Bremond
Abstract Praxis test is a gesture-based diagnostic test which has been accepted as diagnostically indicative of cortical pathologies such as Alzheimer’s disease. Despite being simple, this test is oftentimes skipped by the clinicians. In this paper, we propose a novel framework to investigate the potential of static and dynamic upper-body gestures based on the Praxis test and their potential in a medical framework to automatize the test procedures for computer-assisted cognitive assessment of older adults. In order to carry out gesture recognition as well as correctness assessment of the performances we have recollected a novel challenging RGB-D gesture video dataset recorded by Kinect v2, which contains 29 specific gestures suggested by clinicians and recorded from both experts and patients performing the gesture set. Moreover, we propose a framework to learn the dynamics of upper-body gestures, considering the videos as sequences of short-term clips of gestures. Our approach first uses body part detection to extract image patches surrounding the hands and then, by means of a fine-tuned convolutional neural network (CNN) model, it learns deep hand features which are then linked to a long short-term memory to capture the temporal dependencies between video frames. We report the results of four developed methods using different modalities. The experiments show effectiveness of our deep learning based approach in gesture recognition and performance assessment tasks. Satisfaction of clinicians from the assessment reports indicates the impact of framework corresponding to the diagnosis.
Image and Vision Computing | 2018
Pau Rodríguez; Miguel Ángel Bautista; Jordi Gonzàlez; Sergio Escalera
Abstract Target encoding plays a central role when learning Convolutional Neural Networks. In this realm, one-hot encoding is the most prevalent strategy due to its simplicity. However, this so widespread encoding schema assumes a flat label space, thus ignoring rich relationships existing among labels that can be exploited during training. In large-scale datasets, data does not span the full label space, but instead lies in a low-dimensional output manifold. Following this observation, we embed the targets into a low-dimensional space, drastically improving convergence speed while preserving accuracy. Our contribution is two fold: (i) We show that random projections of the label space are a valid tool to find such lower dimensional embeddings, boosting dramatically convergence rates at zero computational cost; and (ii) we propose a normalized eigenrepresentation of the class manifold that encodes the targets with minimal information loss, improving the accuracy of random projections encoding while enjoying the same convergence rates. Experiments on CIFAR-100, CUB200-2011, Imagenet, and MIT Places demonstrate that the proposed approach drastically improves convergence speed while reaching very competitive accuracy rates.
Frontiers in Microbiology | 2015
Alexandra Merlos; Pau Rodríguez; Iván Bárcena-Uribarri; Mathias Winterhalter; Roland Benz; Teresa Vinuesa; J. Moya; Miguel Viñas
The aim was to explore the eventual role of bacteria in the induction of lung cancer by smoking habits. Viable bacteria closely related to the genus Bacillus were detected at high frequencies in lung-cancer biopsies. Similar, if not identical, microbes were isolated from cigarettes and in smog. Bacteria present in cigarettes could be transferred to a physiological solution via a “smoker” device that mimicked their potential transfer during smoking those bacteria produce exotoxins able to open transmembrane pores. These channels can be used as a way to penetrate cells of benzopyrenes and other toxic substances present in tobacco products. We hypothesize that Bacillaceae present in tobacco play a key role in the development of lung cancer.
international conference on learning representations | 2017
Pau Rodríguez; Jordi Gonzàlez; Guillem Cucurull; Josep M. Gonfaus; F. Xavier Roca
Lung Cancer | 2005
J. Matilla González; M. García-Yuste; N. Moreno-Mata; Pau Rodríguez; J. Mafé; R. Arrabal; A. Varela; R. Moreno-Balsalobre; E. Members
arXiv: Computer Vision and Pattern Recognition | 2018
Pau Rodríguez; Josep M. Gonfaus; Guillem Cucurull; F. Xavier Roca; Jordi Gonzàlez
arXiv: Computers and Society | 2018
Guillem Cucurull; Pau Rodríguez; V. Oguz Yazici; Josep M. Gonfaus; F. Xavier Roca; Jordi Gonzàlez
Archive | 2018
Pau Rodríguez; Guillem Cucurull; Jordi Pérez González; Josep M. Gonfaus