Hanbyul Joo
Electronics and Telecommunications Research Institute
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Hanbyul Joo.
computer vision and pattern recognition | 2017
Tomas Simon; Hanbyul Joo; Iain A. Matthews; Yaser Sheikh
We present an approach that uses a multi-camera system to train fine-grained detectors for keypoints that are prone to occlusion, such as the joints of a hand. We call this procedure multiview bootstrapping: first, an initial keypoint detector is used to produce noisy labels in multiple views of the hand. The noisy detections are then triangulated in 3D using multiview geometry or marked as outliers. Finally, the reprojected triangulations are used as new labeled training data to improve the detector. We repeat this process, generating more labeled data in each iteration. We derive a result analytically relating the minimum number of views to achieve target true and false positive rates for a given detector. The method is used to train a hand keypoint detector for single images. The resulting keypoint detector runs in realtime on RGB images and has accuracy comparable to methods that use depth sensors. The single view detector, triangulated over multiple views, enables 3D markerless hand motion capture with complex object interactions.
international conference on computer vision | 2015
Hanbyul Joo; Hao Liu; Lei Tan; Lin Gui; Bart C. Nabbe; Iain A. Matthews; Takeo Kanade; Shohei Nobuhara; Yaser Sheikh
We present an approach to capture the 3D structure and motion of a group of people engaged in a social interaction. The core challenges in capturing social interactions are: (1) occlusion is functional and frequent, (2) subtle motion needs to be measured over a space large enough to host a social group, and (3) human appearance and configuration variation is immense. The Panoptic Studio is a system organized around the thesis that social interactions should be measured through the perceptual integration of a large variety of view points. We present a modularized system designed around this principle, consisting of integrated structural, hardware, and software innovations. The system takes, as input, 480 synchronized video streams of multiple people engaged in social activities, and produces, as output, the labeled time-varying 3D structure of anatomical landmarks on individuals in the space. The algorithmic contributions include a hierarchical approach for generating skeletal trajectory proposals, and an optimization framework for skeletal reconstruction with trajectory re-association.
computer vision and pattern recognition | 2014
Hanbyul Joo; Hyun Soo Park; Yaser Sheikh
Many traditional challenges in reconstructing 3D motion, such as matching across wide baselines and handling occlusion, reduce in significance as the number of unique viewpoints increases. However, to obtain this benefit, a new challenge arises: estimating precisely which cameras observe which points at each instant in time. We present a maximum a posteriori (MAP) estimate of the time-varying visibility of the target points to reconstruct the 3D motion of an event from a large number of cameras. Our algorithm takes, as input, camera poses and image sequences, and outputs the time-varying set of the cameras in which a target patch is visible and its reconstructed trajectory. We model visibility estimation as a MAP estimate by incorporating various cues including photometric consistency, motion consistency, and geometric consistency, in conjunction with a prior that rewards consistent visibilities in proximal cameras. An optimal estimate of visibility is obtained by finding the minimum cut of a capacitated graph over cameras. We demonstrate that our method estimates visibility with greater accuracy, and increases tracking performance producing longer trajectories, at more locations, and at higher accuracies than methods that ignore visibility or use photometric consistency alone.
international conference on robotics and automation | 2009
Hanbyul Joo; Yekeun Jeong; Olivier Duchenne; Seong-Young Ko; In So Kweon
Shape is one of the useful information for object detection. The human visual system can often recognize objects based on the 2-D outline shape alone. In this paper, we address the challenging problem of shape matching in the presence of complex background clutter and occlusion. To this end, we propose a graph-based approach for shape matching. Unlike prior methods which measure the shape similarity without considering the relation among edge pixels, our approach uses the connectivity of edge pixels by generating a graph. A group of connected edge pixels, which is represented by an “edge” of the graph, is considered together and their similarity cost is defined for the “edge” weight by explicit comparison with the corresponding template part. This approach provides the key advantage of reducing ambiguity even in the presence of background clutter and occlusion. The optimization is performed by means of a graph-based dynamic algorithm. The robustness of our method is demonstrated for several examples including long video sequences. Finally, we applied our algorithm to our grasping robot system by providing the object information in the form of prompt hand-drawn templates.
international conference on consumer electronics | 2012
Seong-Jae Lim; Hanbyul Joo; Ji Hyung Lee; Bon-Ki Koo
This paper presents a fully-automatic 3D character model generating system for Smart TV contents. The system applies 3D template model transferring and sweep animation technique for automatic rigging of the 3D character model.
IEEE Transactions on Pattern Analysis and Machine Intelligence | 2017
Hanbyul Joo; Tomas Simon; Xulong Li; Hao Liu; Lei Tan; Lin Gui; Sean Banerjee; Timothy Scott Godisart; Bart C. Nabbe; Iain A. Matthews; Takeo Kanade; Shohei Nobuhara; Yaser Sheikh
We present an approach to capture the 3D motion of a group of people engaged in a social interaction. The core challenges in capturing social interactions are: (1) occlusion is functional and frequent; (2) subtle motion needs to be measured over a space large enough to host a social group; (3) human appearance and configuration variation is immense; and (4) attaching markers to the body may prime the nature of interactions. The Panoptic Studio is a system organized around the thesis that social interactions should be measured through the integration of perceptual analyses over a large variety of view points. We present a modularized system designed around this principle, consisting of integrated structural, hardware, and software innovations. The system takes, as input, 480 synchronized video streams of multiple people engaged in social activities, and produces, as output, the labeled time-varying 3D structure of anatomical landmarks on individuals in the space. Our algorithm is designed to fuse the “weak” perceptual processes in the large number of views by progressively generating skeletal proposals from low-level appearance cues, and a framework for temporal refinement is also presented by associating body parts to reconstructed dense 3D trajectory stream. Our system and method are the first in reconstructing full body motion of more than five people engaged in social interactions without using markers. We also empirically demonstrate the impact of the number of views in achieving this goal.
international conference on image processing | 2011
Hanbyul Joo; Yekeun Jeong; Olivier Duchenne; In-So Kweon
In this paper, we propose a graph-based shape matching method for deformable objects. In our approach, a graph is generated from an over-segmented input image, and the shape matching problem is treated as finding an optimal cycle in the graph. Given a shape template and a graph generated from the input, a product graph is generated to consider every possible correspondence between graph edges and template sub-parts. Because the proposed approach can estimate reasonable correspondences between a target object and a template, it is possible to extract the target object robustly in the presence of shape deformation and background clutter. The experiments on various examples are also presented to verify the performance of proposed method.
Archive | 2010
Howon Kim; Seong-Jae Lim; Hanbyul Joo; Hyun Seo Kang; Bon-Ki Koo; Chang-Woo Chu
Archive | 2010
Seong Jae Lim; Ho Won Kim; Hanbyul Joo; Bon Ki Koo
Archive | 2012
Seong Jae Lim; Hanbyul Joo; Ji Hyung Lee; Bon-Ki Koo