Kentaro Ishizuka
Nippon Telegraph and Telephone
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Kentaro Ishizuka.
international conference on multimodal interfaces | 2008
Kazuhiro Otsuka; Shoko Araki; Kentaro Ishizuka; Masakiyo Fujimoto; Martin Heinrich; Junji Yamato
This paper presents a realtime system for analyzing group meetings that uses a novel omnidirectional camera-microphone system. The goal is to automatically discover the visual focus of attention (VFOA), i.e. who is looking at whom, in addition to speaker diarization, i.e. who is speaking and when. First, a novel tabletop sensing device for round-table meetings is presented; it consists of two cameras with two fisheye lenses and a triangular microphone array. Second, from high-resolution omnidirectional images captured with the cameras, the position and pose of peoples faces are estimated by STCTracker (Sparse Template Condensation Tracker); it realizes realtime robust tracking of multiple faces by utilizing GPUs (Graphics Processing Units). The face position/pose data output by the face tracker is used to estimate the focus of attention in the group. Using the microphone array, robust speaker diarization is carried out by a VAD (Voice Activity Detection) and a DOA (Direction of Arrival) estimation followed by sound source clustering. This paper also presents new 3-D visualization schemes for meeting scenes and the results of an analysis. Using two PCs, one for vision and one for audio processing, the system runs at about 20 frames per second for 5-person meetings.
international conference on multimodal interfaces | 2009
Kazuhiro Otsuka; Shoko Araki; Dan Mikami; Kentaro Ishizuka; Masakiyo Fujimoto; Junji Yamato
This demo presents a realtime system for analyzing group meetings. Targeting round-table meetings, this system employs an omnidirectional camera-microphone system. The goal of this system is to automatically discover who is talking to whom and when. To that purpose, the face pose/position of meeting participants are tracked on panorama images acquired from fisheye-based omnidirectional cameras. From audio signals obtained with microphone array, speaker diarization, i.e. the estimation of who is speaking and when, is carried out. The visual focus of attention, i.e. who is looking at whom, is esimated from the result of face tracking. The results are displayed based on a 3D visualization scheme. The advantage of our system is its realtimeness. We will demonstrate the portable version of the system consisting of two laptop PCs. In addition, we will showcase our meeting playback viewer with man-machine interfaces that allow users to freely control space and time of meeting scenes. With this viewer, users can also experince 3D positional sound effect linked with 3D viewpoint, using enhanced audio tracks for each participant.
international conference on multimodal interfaces | 2007
Yasuhiro Minami; Minako Sawaki; Kohji Dohsaka; Ryuichiro Higashinaka; Kentaro Ishizuka; Hideki Isozaki; Tatsushi Matsubayashi; Masato Miyoshi; Atsushi Nakamura; Takanobu Oba; Hiroshi Sawada; Takeshi Yamada; Eisaku Maeda
Our new research project called ambient intelligence concentrates on the creation of new lifestyles through research on communication science and intelligence integration. It is premised on the creation of such virtual communication partners as fairies and goblins that can be constantly at our side. We call these virtual communication partners mushrooms.n To show the essence of ambient intelligence, we developed two multimodal prototype systems: mushrooms that watch, listen, and answer questions and a Quizmaster Mushroom. These two systems work in real time using speech, sound, dialogue, and vision technologies.n We performed preliminary experiments with the Quizmaster Mushroom. The results showed that the system can transmit knowledge to users while they are playing the quizzes.n Furthermore, through the two mushrooms, we found policies for design effects in multimodal interface and integration.
Archive | 2006
Akiko Araki; Masakiyo Fujimoto; Kentaro Ishizuka; Kazuhiro Otsuka; Hiroshi Sawada; 和弘 大塚; 宏 澤田; 健太郎 石塚; 章子 荒木; 雅清 藤本
Archive | 2008
Akiko Araki; Masakiyo Fujimoto; Kentaro Ishizuka; Shoji Makino; 昭二 牧野; 健太郎 石塚; 章子 荒木; 雅清 藤本
Archive | 2007
Masakiyo Fujimoto; Kentaro Ishizuka; Tomohiro Nakatani; 智広 中谷; 健太郎 石塚; 雅清 藤本
Archive | 2013
Takaaki Matsuo; 隆明 松尾; Kentaro Ishizuka; 健太郎 石塚
Archive | 2009
Akiko Araki; Masakiyo Fujimoto; Kentaro Ishizuka; Tomohiro Nakatani; 智広 中谷; 健太郎 石塚; 章子 荒木; 雅清 藤本
Archive | 2008
Akiko Araki; Kentaro Ishizuka; Tatsuya Kawahara; 達也 河原; 健太郎 石塚; 章子 荒木
Archive | 2007
Kentaro Ishizuka; Hiroshi Sawada; Shoko Araki; Tomohiro Nakatani