Masaru Sugano | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Masaru Sugano is active.

Explore More

Publication

Featured researches published by Masaru Sugano.

international conference on acoustics speech and signal processing | 1998

MPEG audio bit rate scaling on coded data domain

Yasuyuki Nakajima; Hiromasa Yanagihara; Akio Yoneyama; Masaru Sugano

Formerly, once the audio data is compressed, transcoding is used to scale the bit rate, where decoding and re-encoding are taking place. Therefore, data manipulation of coded data has been very complex and time consuming work. We describe three algorithms for bit rate scaling in the coded MPEG data domain. One is a bandwidth limitation method cutting higher frequency components until the target data rate is satisfied. The other two use a re-quantization process where a quantization step in each subband is modified. One of them reflects the psychoacoustic model from bit allocation information obtained in the bitstream in order to improve the bit rate scaling efficiency. The simulation results show that the re-quantization process provides a very high conversion efficiency and a nearly equal sound quality can be obtained as directly coding from PCM by reflecting the psychoacoustic model. It is also shown that a very fast scaling (factor of six) have been achieved when compared with the transcoding method.

international conference on image processing | 1998

A fast scene change detection on MPEG coding parameter domain

Masaru Sugano; Yasuyuki Nakajima; Hiromasa Yanagihara; Akio Yoneyama

This paper proposes a very fast and accurate scene change detection algorithm on MPEG coding parameter domain. The fast operation can be obtained by spatio-temporally subsampling coding information and by exploiting only coding parameters extracted in variable length decoding (VLD) stage, while the accurate detection is accomplished by examining the statistical characteristics of various scene changes on the coding parameter domain. The computer simulation shows that the proposed algorithm can accomplish detection more than 5 times faster than that of real-time playback for MPEG-2 video sequences using a standard workstation. It is also shown that most of abrupt scene changes, dissolve transitions, and wipe transitions have been successfully detected.

multimedia signal processing | 1999

MPEG encoding algorithm with scene adaptive dynamic GOP structure

Akio Yoneyama; Yasuyuki Nakajima; Hiromasa Yanagihara; Masaru Sugano

Although a GOP (group of picture) length is often fixed in MPEG, this may not guarantee the best picture quality from the point of view of coding efficiency. Furthermore, it is also well known that P-picture interval M=3 does not provide the best quality for all sequences. We propose an MPEG encoding algorithm with scene adaptive dynamic GOP structure to enhance the picture quality in the cases mentioned above. Using macroblock characteristics such as luminance activity and simple motion compensation on the activity domain, N and M values are dynamically determined.

visual communications and image processing | 1998

Moving-object detection from MPEG coded data

Yasuyuki Nakajima; Akio Yoneyama; Hiromasa Yanagihara; Masaru Sugano

We describe a method of moving object detection directly from MPEG coded data. Since motion information in MPEG coded data is determined in terms of coding efficiency point of view, it does not always provide real motion information of objects. We use a wide variety of coding information including motion vectors and DCT coefficients to estimate real object motion. Since such information can be directly obtained from coded bitstream, very fast operation can be expected. Moving objects are detected basically analyzing motion vectors and spatio-temporal correlation of motion in P-, and B-pictures. Moving objects are also detected in intra macroblocks by analyzing coding characteristics of intra macroblocks in P- and B-pictures and by investigating temporal motion continuity in I-pictures. The simulation results show that successful moving object detection has been performed on macroblock level using several test sequences. Since proposed method is very simple and requires much less computational power than the conventional object detection methods, it has a significant advantage as motion analysis tool.

international conference on image processing | 1999

Moving object detection and identification from MPEG coded data

Akio Yoneyama; Yasuyuki Nakajima; Hiromasa Yanagihara; Masaru Sugano

Moving object detection is a useful tool for such purposes as intelligent video browsing/analysis and video surveillance systems. Previously several algorithms have been reported on moving object detection from compressed video data. However, in these algorithms, flat regions are often falsely detected. In addition if multiple moving objects are involved, each moving object may not be identified and therefore object tracking becomes very difficult. We describe a fast moving detection and identification method from MPEG coded data. The experimental results show that each moving object is successfully identified for several surveillance sequences.

international conference on image processing | 2002

Automated MPEG audio-video summarization and description

Masaru Sugano; Yasuyuki Nakajima; Hiromasa Yanagihara

The paper proposes a novel method for automatically summarizing MPEG coded audio-video content in the compressed domain, and describing audio-video summaries in the MPEG-7 standard format. Semantically important features for summary generation are carefully analyzed, and two types of summaries, digest and highlight, are extracted. A digest extraction is based on an audio level estimation and visual features analysis, where digest shots are adaptively determined for user-specified duration by utilizing the MPEG-7 based motion features. A highlight-based summarization from TV broadcast sports programmes is achieved by analyzing audio class and audio level. The extracted summaries and related features are described by the MPEG-7 description tools. Experimental results show that the digests and highlights can be successfully extracted in various kinds of contents without a priori knowledge of content characteristics.

international conference on image processing | 2003

Shot genre classification using compressed audio-visual features

Masaru Sugano; R. Isaksson; Yasuyuki Nakajima; Hiromasa Yanagihara

This paper proposes shot genre classification from MPEG compressed movies, as one of the high-level indexing methods for audio-visual contents. Through statistical analysis of low-level and mid-level audio-visual features on compressed domain, the proposed method can achieve subjectively accurate shot classification within the movies into predefined genre set, which can be applied to various content handling applications, such as summarization, navigation, editing, filtering, and so on. By feeding subjectively evaluated feature set for each shot genre into the linear machine decision tree classifier, each shot is classified at very low cost. The experimental results show that most of the shots in the movies can be classified into subjectively accurate genres, and also the dominant shot genre can correctly resolve each movie genre.

international conference on consumer electronics | 2000

Scalable video decoder and its application to multi-channel multicast system

Hiromasa Yanagihara; Masaru Sugano; Akio Yoneyama; Yasuyuki Nakajima

In this paper, we propose scalable video decoder algorithms using several DCT manipulations such as low pass filtering and resolution conversion in the DCT domain, and their application to a multi-channel multicast system. In the experiment, it has been shown that simultaneous decoding of tens of channels in an ordinary PC can be realized by selecting an appropriate decoding scheme.

Systems and Computers in Japan | 1999

Moving object detection from MPEG video stream

Akio Yoneyama; Yasuyuki Nakajima; Hiromasa Yanagihara; Masaru Sugano

Moving object detection is a useful tool for the manipulation of video contents for such purposes as intelligent video browsing/analysis and video surveillance systems. There are a variety of methods to detect moving objects not only on pixel domain data but also on coded data domain. In this article, we propose an effective moving object detection method from MPEG-coded data. This method utilizes motion vectors and DCT coefficient data in the MPEG streams. Since motion vectors and DCT coefficients can be obtained without full decoding of original MPEG stream, fast detection can be expected. However, motion vector information does not always represent the real object motion. In addition, there are cases where no motion information is provided from coded data, such as the case of intracoded picture. It is shown in the experiments that the proposed method can detect moving objects successfully in such cases.

international conference on image processing | 2001

An efficient transcoding from MPEG-2 to MPEG-1

Masaru Sugano; Yasuyuki Nakajima; Hiromasa Yanagihara; Akio Yoneyama

This paper proposes a fast and efficient transcoding algorithm from MPEG-2 video to MPEG-1 video where spatial resolution reduction is involved. We use quantization parameters along with motion information of the incoming MPEG-2 video bitstream to estimate the motion vectors for MPEG-1 transcoding without any additional motion estimation process. Since such information can be directly extracted from the original MPEG-2 video bitstream, very fast transcoding can be achieved. In addition, since the proposed method is not restricted to a specific coding structure, it can be applied to various types of GOP structures. The experimental results show that the proposed method can provide approximate motion vectors to that of full range motion estimation. therefore transcoding efficiency can be increased and picture quality is enhanced when compared with the conventional fast transcoding methods.

Explore More