Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Carles Ventura is active.

Publication


Featured researches published by Carles Ventura.


content based multimedia indexing | 2013

Automatic keyframe selection based on mutual reinforcement algorithm

Carles Ventura; Xavier Giro-i-Nieto; Verónica Vilaplana; Daniel Giribet; Eusebio Carasusan

This paper addresses the problem of video summarization through an automatic selection of a single representative keyframe. The proposed solution is based on the mutual reinforcement paradigm, where a keyframe is selected thanks to its highest and most frequent similarity to the rest of considered frames. Two variations of the algorithm are explored: a first one where only frames within the same video are used (intra-clip mode) and a second one where the decision also depends on the previously selected keyframes of related videos (inter-clip mode). These two algorithms were evaluated by a set of professional documentalists from a broadcasters archive, and results concluded that the proposed techniques outperform the semi-manual solution adopted so far in the company.


computer vision and pattern recognition | 2017

Interpreting CNN Models for Apparent Personality Trait Regression

Carles Ventura; David Masip; Àgata Lapedriza

This paper addresses the problem of automatically inferring personality traits of people talking to a camera. As in many other computer vision problems, Convolutional Neural Networks (CNN) models have shown impressive results. However, despite of the success in terms of performance, it is unknown what internal representation emerges in the CNN. This paper presents a deep study on understanding why CNN models are performing surprisingly well in this complex problem. We use current techniques on CNN model interpretability, combined with face detection and Action Unit (AUs) recognition systems, to perform our quantitative studies. Our results show that: (1) face provides most of the discriminative information for personality trait inference, and (2) the internal CNN representations mainly analyze key face regions such as eyes, nose, and mouth. Finally, we study the contribution of AUs for personality trait inference, showing the influence of certain AUs in the facial trait judgments.


conference on multimedia modeling | 2012

Hierarchical navigation and visual search for video keyframe retrieval

Carles Ventura; Manel Martos; Xavier Giro-i-Nieto; Verónica Vilaplana; Ferran Marqués

This work presents a browser that supports two strategies for video browsing: the navigation through visual hierarchies and the retrieval of similar images. The input videos are firstly processed by a keyframe extractor to reduce the temporal redundancy and decrease the number of elements to consider. These generated keyframes are hierarchically clustered with the Hierachical Cellular Tree (HCT) algorithm, an indexing technique that also allows the creation of data structures suitable for browsing. Different clustering criteria are available, in the current implementation, based on four MPEG-7 visual descriptors computed at the global scale. The navigation can directly drive the user to find the video timestamps that best match the query or to a keyframe which is globally similar in visual terms to the query. In the latter case, a visual search engine is also available to find other similar keyframes, based as well on MPEG-7 visual descriptors.


acm multimedia | 2013

Visual object analysis using regions and interest points

Carles Ventura

This dissertation research will explore region-based and interest points based image representations, two of the most-used image models for object detection, image classification, and visual search among other applications. We will analyze the relationship between both representations with the goal of proposing a new hybrid representation that takes advantage of the strengths and overcomes the weaknesses of both approaches. More specifically, we will focus on the gPb-owt-ucm segmentation algorithm and the SIFT local features since they are the most contrasted techniques in their respective fields. Furthermore, using an object retrieval benchmark, this dissertation research will analyze three basic questions: (i) the usefulness of an interest points hierarchy based on a contour strength signal, (ii) the influence of the context on both interest points location and description, and (iii) the analysis of regions as spatial support for bundling interest points.


international conference on image processing | 2015

Improving spatial codification in semantic segmentation

Carles Ventura; Xavier Giro-i-Nieto; Verónica Vilaplana; Kevin McGuinness; Ferran Marqués; Noel E. O'Connor

This paper explores novel approaches for improving the spatial codification for the pooling of local descriptors to solve the semantic segmentation problem. We propose to partition the image into three regions for each object to be described: Figure, Border and Ground. This partition aims at minimizing the influence of the image context on the object description and vice versa by introducing an intermediate zone around the object contour. Furthermore, we also propose a richer visual descriptor of the object by applying a Spatial Pyramid over the Figure region. Two novel Spatial Pyramid configurations are explored: Cartesian-based and crown-based Spatial Pyramids. We test these approaches with state-of-the-art techniques and show that they improve the Figure-Ground based pooling in the Pascal VOC 2011 and 2012 semantic segmentation challenges.


Multimedia Tools and Applications | 2014

Improving retrieval accuracy of Hierarchical Cellular Trees for generic metric spaces

Carles Ventura; Verónica Vilaplana; Xavier Giro-i-Nieto; Ferran Marqués

Metric Access Methods (MAMs) are indexing techniques which allow working in generic metric spaces. Therefore, MAMs are specially useful for Content-Based Image Retrieval systems based on features which use non Lp norms as similarity measures. MAMs naturally allow the design of image browsers due to their inherent hierarchical structure. The Hierarchical Cellular Tree (HCT), a MAM-based indexing technique, provides the starting point of our work. In this paper, we describe some limitations detected in the original formulation of the HCT and propose some modifications to both the index building and the search algorithm. First, the covering radius, which is defined as the distance from the representative to the furthest element in a node, may not cover all the elements belonging to the node’s subtree. Therefore, we propose to redefine the covering radius as the distance from the representative to the furthest element in the node’s subtree. This new definition is essential to guarantee a correct construction of the HCT. Second, the proposed Progressive Query retrieval scheme can be redesigned to perform the nearest neighbor operation in a more efficient way. We propose a new retrieval scheme which takes advantage of the benefits of the search algorithm used in the index building. Furthermore, while the evaluation of the HCT in the original work was only subjective, we propose an objective evaluation based on two aspects which are crucial in any approximate search algorithm: the retrieval time and the retrieval accuracy. Finally, we illustrate the usefulness of the proposal by presenting some actual applications.


McGuinness, Kevin and Mohedano, Eva and Zhang, Zhenxing and Hu, Feiyan and Albatal, Rami and Gurrin, Cathal and O'Connor, Noel E. and Smeaton, Alan F. and Salvador, Amaia and Giró-i-Nieto, Xavier and Ventura, Carles (2014) Insight Centre for Data Analytics (DCU) at TRECVid 2014: instance search and semantic indexing tasks. In: TRECVid 2014, 8-12 Nov 2014, Orlando FL.. | 2014

Insight Centre for Data Analytics (DCU) at TRECVid 2014: Instance Search and Semantic Indexing Tasks

Kevin McGuinness; Eva Mohedano; Zhenxing Zhang; Feiyan Hu; Rami Abatal; Cathal Gurrin; Noel E. O'Connor; Alan F. Smeaton; Amaia Salvador Aguilera; Xavier Giró i Nieto; Carles Ventura


2015 Digital Heritage | 2015

A new approach to digitalization and data management of cultural heritage sites

Vittorio Amos Ziparo; F. Cottefoglie; Daniele Calisi; F. Giannone; Giorgio Grisetti; Bastian Leibe; Marc Proesmans; P. Salonia; L. Van Gool; Carles Ventura; Cyrill Stachniss


british machine vision conference | 2018

Iterative Deep Learning for Road Topology Extraction.

Carles Ventura; Jordi Pont-Tuset; Sergi Caelles; Kevis-Kokitsi Maninis; Luc Van Gool


arXiv: Computer Vision and Pattern Recognition | 2017

Iterative Deep Learning for Network Topology Extraction.

Carles Ventura; Jordi Pont-Tuset; Sergi Caelles; Kevis-Kokitsi Maninis; Luc Van Gool

Collaboration


Dive into the Carles Ventura's collaboration.

Top Co-Authors

Avatar

Verónica Vilaplana

Polytechnic University of Catalonia

View shared research outputs
Top Co-Authors

Avatar

Xavier Giro-i-Nieto

Polytechnic University of Catalonia

View shared research outputs
Top Co-Authors

Avatar

Ferran Marqués

Polytechnic University of Catalonia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jordi Pont-Tuset

Polytechnic University of Catalonia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Daniele Calisi

Sapienza University of Rome

View shared research outputs
Researchain Logo
Decentralizing Knowledge