Frank Perbet | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Frank Perbet is active.

Explore More

Publication

Featured researches published by Frank Perbet.

International Journal of Computer Vision | 2014

Demisting the Hough Transform for 3D Shape Recognition and Registration

Oliver Woodford; Minh-Tri Pham; Atsuto Maki; Frank Perbet; Björn Stenger

In applying the Hough transform to the problem of 3D shape recognition and registration, we develop two new and powerful improvements to this popular inference method. The first, intrinsic Hough, solves the problem of exponential memory requirements of the standard Hough transform by exploiting the sparsity of the Hough space. The second, minimum-entropy Hough, explains away incorrect votes, substantially reducing the number of modes in the posterior distribution of class and pose, and improving precision. Our experiments demonstrate that these contributions make the Hough transform not only tractable but also highly accurate for our example application. Both contributions can be applied to other tasks that already use the standard Hough transform.

international conference on computer vision | 2011

A new distance for scale-invariant 3D shape recognition and registration

Minh-Tri Pham; Oliver Woodford; Frank Perbet; Atsuto Maki; Björn Stenger; Roberto Cipolla

This paper presents a method for vote-based 3D shape recognition and registration, in particular using mean shift on 3D pose votes in the space of direct similarity transforms for the first time. We introduce a new distance between poses in this space—the SRT distance. It is left-invariant, unlike Euclidean distance, and has a unique, closed-form mean, in contrast to Riemannian distance, so is fast to compute. We demonstrate improved performance over the state of the art in both recognition and registration on a real and challenging dataset, by comparing our distance with others in a mean shift framework, as well as with the commonly used Hough voting approach.

international conference on computer vision | 2009

Correlated probabilistic trajectories for pedestrian motion detection

Frank Perbet; Atsuto Maki; Björn Stenger

This paper introduces an algorithm for detecting walking motion using point trajectories in video sequences. Given a number of point trajectories, we identify those which are spatio-temporally correlated as arising from feet in walking motion. Unlike existing techniques we do not assume clean point tracks but instead propose “probabilistic trajectories” as new features to classify. These are extracted from directed acyclic graphs whose edges represent temporal point correspondences and are weighted with their matching probability in terms of appearance and location. This representation tolerates the inherent trajectory ambiguity, for example due to occlusions. We then learn the correlation between the movement of two feet using a random forest classifier. The effectiveness of the algorithm is demonstrated in experiments on image sequences captured with a static camera.

british machine vision conference | 2009

Random Forest Clustering and Application to Video Segmentation.

Frank Perbet; Björn Stenger; Atsuto Maki

This paper considers the problem of clustering large data sets in a high-dimensional space. Using a random forest, we first generate multiple partitions of the same input space, one per tree. The partitions from all trees are merged by intersecting them, resulting in a partition of higher resolution. A graph is then constructed by assigning a node to each region and linking adjacent nodes. This Graph of Superimposed Partitions (GSP) represents a remapped space of the input data where regions of high density are mapped to a larger number of nodes. Generating such a graph turns the clustering problem in the feature space into a graph clustering task which we solve with the Markov cluster algorithm (MCL). The proposed algorithm is able to capture non-convex structure while being computationally efficient, capable of dealing with large data sets. We show the clustering performance on synthetic data and apply the method to the task of video segmentation.

computer vision and pattern recognition | 2014

Human Body Shape Estimation Using a Multi-resolution Manifold Forest

Frank Perbet; Sam Johnson; Minh-Tri Pham; Björn Stenger

This paper proposes a method for estimating the 3D body shape of a person with robustness to clothing. We formulate the problem as optimization over the manifold of valid depth maps of body shapes learned from synthetic training data. The manifold itself is represented using a novel data structure, a Multi-Resolution Manifold Forest (MRMF), which contains vertical edges between tree nodes as well as horizontal edges between nodes across trees that correspond to overlapping partitions. We show that this data structure allows both efficient localization and navigation on the manifold for on-the-fly building of local linear models (manifold charting). We demonstrate shape estimation of clothed users, showing significant improvement in accuracy over global shape models and models using pre-computed clusters. We further compare the MRMF with alternative manifold charting methods on a public dataset for estimating 3D motion from noisy 2D marker observations, obtaining state-of-the-art results.

5th International Conference on 3D Body Scanning Technologies, Lugano, Switzerland, 21-22 October 2014 | 2014

Virtual Fitting by Single-Shot Body Shape Estimation

Masahiro Sekine; Kaoru Sugita; Frank Perbet; Björn Stenger; Masashi Nishiyama

We propose a novel virtual fitting system for seamlessly adjusting 2D clothing images to users by inferring their 3D body shape models from single-shot depth images. To increase fitting accuracy between virtual clothes and the body of the user, the system transforms and overlays the clothing image onto the body image in real time. Observations indicate that the system attains high fitting accuracy when overlaying clothing images captured from a person with a body shape similar to that of the user. We therefore develop a method for rapidly acquiring body shape models and selecting suitable clothing images based on shape similarity. We show a number of fitting results, and evaluate the fitting accuracy of both our method and an existing method without the consideration of the body shape.

Pattern Recognition Letters | 2013

Detecting bipedal motion from correlated probabilistic trajectories

Atsuto Maki; Frank Perbet; Björn Stenger; Roberto Cipolla

This paper is about detecting bipedal motion in video sequences by using point trajectories in a framework of classification. Given a number of point trajectories, we find a subset of points which are arising from feet in bipedal motion by analysing their spatio-temporal correlation in a pairwise fashion. To this end, we introduce probabilistic trajectories as our new features which associate each point over a sufficiently long time period in the presence of noise. They are extracted from directed acyclic graphs whose edges represent temporal point correspondences and are weighted with their matching probability in terms of appearance and location. The benefit of the new representation is that it practically tolerates inherent ambiguity for example due to occlusions. We then learn the correlation between the motion of two feet using the probabilistic trajectories in a decision forest classifier. The effectiveness of the algorithm is demonstrated in experiments on image sequences captured with a static camera, and extensions to deal with a moving camera are discussed.

international conference on computer vision | 2011

Live 3D shape reconstruction, recognition and registration

Carlos Hernández; Frank Perbet; Minh-Tri Pham; George Vogiatzis; Oliver Woodford; Atsuto Maki; Björn Stenger; Roberto Cipolla

We present a video-based system which interactively captures the geometry of a 3D object in the form of a point cloud, then recognizes and registers known objects in this point cloud in a matter of seconds (fig. 1). In order to achieve interactive speed, we exploit both efficient inference algorithms and parallel computation, often on a GPU. The system can be broken down into two distinct phases: geometry capture, and object inference. We now discuss these in further detail.

International Journal of Computer Vision | 2015

Distances and Means of Direct Similarities

Minh-Tri Pham; Oliver Woodford; Frank Perbet; Atsuto Maki; Riccardo Gherardi; Björn Stenger; Roberto Cipolla

The non-Euclidean nature of direct isometries in a Euclidean space, i.e. transformations consisting of a rotation and a translation, creates difficulties when computing distances, means and distributions over them, which have been well studied in the literature. Direct similarities, transformations consisting of a direct isometry and a positive uniform scaling, present even more of a challenge—one which we demonstrate and address here. In this article, we investigate divergences (a superset of distances without constraints on symmetry and sub-additivity) for comparing direct similarities, and means induced by them via minimizing a sum of squared divergences. We analyze several standard divergences: the Euclidean distance using the matrix representation of direct similarities, a divergence from Lie group theory, and the family of all left-invariant distances derived from Riemannian geometry. We derive their properties and those of their induced means, highlighting several shortcomings. In addition, we introduce a novel family of left-invariant divergences, called SRT divergences, which resolve several issues associated with the standard divergences. In our evaluation we empirically demonstrate the derived properties of the divergences and means, both qualitatively and quantitatively, on synthetic data. Finally, we compare the divergences in a real-world application: vote-based, scale-invariant object recognition. Our results show that the new divergences presented here, and their means, are both more effective and faster to compute for this task.

Machine Learning for Computer Vision | 2013

Scale-Invariant Vote-Based 3D Recognition and Registration from Point Clouds

Minh-Tri Pham; Oliver Woodford; Frank Perbet; Atsuto Maki; Riccardo Gherardi; Björn Stenger; Roberto Cipolla

This chapter presents a method for vote-based 3D shape recognition and registration, in particular using mean shift on 3D pose votes in the space of direct similarity transformations for the first time. We introduce a new distance between poses in this space—the SRT distance. It is left-invariant, unlike Euclidean distance, and has a unique, closed-form mean, in contrast to Riemannian distance, so is fast to compute. We demonstrate improved performance over the state of the art in both recognition and registration on a (real and) challenging dataset, by comparing our distance with others in a mean shift framework, as well as with the commonly used Hough voting approach.

Explore More