Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Kentaro Ishizuka is active.

Publication


Featured researches published by Kentaro Ishizuka.


international conference on multimodal interfaces | 2008

A realtime multimodal system for analyzing group meetings by combining face pose tracking and speaker diarization

Kazuhiro Otsuka; Shoko Araki; Kentaro Ishizuka; Masakiyo Fujimoto; Martin Heinrich; Junji Yamato

This paper presents a realtime system for analyzing group meetings that uses a novel omnidirectional camera-microphone system. The goal is to automatically discover the visual focus of attention (VFOA), i.e. who is looking at whom, in addition to speaker diarization, i.e. who is speaking and when. First, a novel tabletop sensing device for round-table meetings is presented; it consists of two cameras with two fisheye lenses and a triangular microphone array. Second, from high-resolution omnidirectional images captured with the cameras, the position and pose of peoples faces are estimated by STCTracker (Sparse Template Condensation Tracker); it realizes realtime robust tracking of multiple faces by utilizing GPUs (Graphics Processing Units). The face position/pose data output by the face tracker is used to estimate the focus of attention in the group. Using the microphone array, robust speaker diarization is carried out by a VAD (Voice Activity Detection) and a DOA (Direction of Arrival) estimation followed by sound source clustering. This paper also presents new 3-D visualization schemes for meeting scenes and the results of an analysis. Using two PCs, one for vision and one for audio processing, the system runs at about 20 frames per second for 5-person meetings.


international conference on multimodal interfaces | 2009

Realtime meeting analysis and 3D meeting viewer based on omnidirectional multimodal sensors

Kazuhiro Otsuka; Shoko Araki; Dan Mikami; Kentaro Ishizuka; Masakiyo Fujimoto; Junji Yamato

This demo presents a realtime system for analyzing group meetings. Targeting round-table meetings, this system employs an omnidirectional camera-microphone system. The goal of this system is to automatically discover who is talking to whom and when. To that purpose, the face pose/position of meeting participants are tracked on panorama images acquired from fisheye-based omnidirectional cameras. From audio signals obtained with microphone array, speaker diarization, i.e. the estimation of who is speaking and when, is carried out. The visual focus of attention, i.e. who is looking at whom, is esimated from the result of face tracking. The results are displayed based on a 3D visualization scheme. The advantage of our system is its realtimeness. We will demonstrate the portable version of the system consisting of two laptop PCs. In addition, we will showcase our meeting playback viewer with man-machine interfaces that allow users to freely control space and time of meeting scenes. With this viewer, users can also experince 3D positional sound effect linked with 3D viewpoint, using enhanced audio tracks for each participant.


international conference on multimodal interfaces | 2007

The world of mushrooms: human-computer interaction prototype systems for ambient intelligence

Yasuhiro Minami; Minako Sawaki; Kohji Dohsaka; Ryuichiro Higashinaka; Kentaro Ishizuka; Hideki Isozaki; Tatsushi Matsubayashi; Masato Miyoshi; Atsushi Nakamura; Takanobu Oba; Hiroshi Sawada; Takeshi Yamada; Eisaku Maeda

Our new research project called ambient intelligence concentrates on the creation of new lifestyles through research on communication science and intelligence integration. It is premised on the creation of such virtual communication partners as fairies and goblins that can be constantly at our side. We call these virtual communication partners mushrooms.n To show the essence of ambient intelligence, we developed two multimodal prototype systems: mushrooms that watch, listen, and answer questions and a Quizmaster Mushroom. These two systems work in real time using speech, sound, dialogue, and vision technologies.n We performed preliminary experiments with the Quizmaster Mushroom. The results showed that the system can transmit knowledge to users while they are playing the quizzes.n Furthermore, through the two mushrooms, we found policies for design effects in multimodal interface and integration.


Archive | 2006

DEVICE FOR DETERMINING VOICED SOUND INTERVAL OF MULTIPLE SOUND SOURCES, METHOD AND PROGRAM THEREFOR, AND ITS RECORDING MEDIUM

Akiko Araki; Masakiyo Fujimoto; Kentaro Ishizuka; Kazuhiro Otsuka; Hiroshi Sawada; 和弘 大塚; 宏 澤田; 健太郎 石塚; 章子 荒木; 雅清 藤本


Archive | 2008

Multiple signal sections estimation device and its method, and program and its recording medium

Akiko Araki; Masakiyo Fujimoto; Kentaro Ishizuka; Shoji Makino; 昭二 牧野; 健太郎 石塚; 章子 荒木; 雅清 藤本


Archive | 2007

Device, method and program for estimating voice signal section, and storage medium recording the program

Masakiyo Fujimoto; Kentaro Ishizuka; Tomohiro Nakatani; 智広 中谷; 健太郎 石塚; 雅清 藤本


Archive | 2013

DEVICE AND METHOD FOR TESTING TRANSMISSION LINE FAILURE

Takaaki Matsuo; 隆明 松尾; Kentaro Ishizuka; 健太郎 石塚


Archive | 2009

A PLURALITY OF SIGNALS EMPHASIZING DEVICE AND METHOD AND PROGRAM THEREFOR

Akiko Araki; Masakiyo Fujimoto; Kentaro Ishizuka; Tomohiro Nakatani; 智広 中谷; 健太郎 石塚; 章子 荒木; 雅清 藤本


Archive | 2008

目的信号区間推定装置、目的信号区間推定方法、目的信号区間推定プログラム及び記録媒体

Akiko Araki; Kentaro Ishizuka; Tatsuya Kawahara; 達也 河原; 健太郎 石塚; 章子 荒木


Archive | 2007

TWO-MICROPHONE VOICEACTIVITYDETECTIONBASEDON THE HOMOGENEITY OFTHE DIRECTIONOFARRIVALESTIMATES

Kentaro Ishizuka; Hiroshi Sawada; Shoko Araki; Tomohiro Nakatani

Collaboration


Dive into the Kentaro Ishizuka's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Hiroshi Sawada

Nippon Telegraph and Telephone

View shared research outputs
Top Co-Authors

Avatar

Kazuhiro Otsuka

Nippon Telegraph and Telephone

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Shoko Araki

Nippon Telegraph and Telephone

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Atsushi Nakamura

Nippon Telegraph and Telephone

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Eisaku Maeda

Nippon Telegraph and Telephone

View shared research outputs
Top Co-Authors

Avatar

Hideki Isozaki

Nippon Telegraph and Telephone

View shared research outputs
Researchain Logo
Decentralizing Knowledge