Yutaka Ota
Toshiba
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Yutaka Ota.
asia and south pacific design automation conference | 2010
Hiroaki Tanaka; Yutaka Ota; Nobu Matsumoto; Takuji Hieda; Yoshinori Takeuchi; Masaharu Imai
Although SIMD instructions are effective for many digital signal processing applications, current compilers cannot take full advantage of SIMD instructions. One factor inhibiting SIMD code generation is control flow structure; the target scope of SIMD code generation is currently limited to single basic block or loop that consists of single basic block. SIMD instructions cannot be mapped typically across basic block boundaries even if basic blocks inside the control structure have enough parallelism. In this paper, a new compilation technique to generate SIMD code without modifying control flow structure is proposed. The data dependency between basic blocks is exploited to generate SIMD instructions. The packing cost is introduced for effective vectorization to maintain data dependency across basic block boundaries. Experimental results show that the new SIMD code generation technique reduced 67% of dynamic execution cycles of inter prediction in H.264 decoder.
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | 2007
Hiroaki Tanaka; Yoshinori Takeuchi; Keishi Sakanushi; Masaharu Imai; Hiroki Tagawa; Yutaka Ota; Nobu Matsumoto
SIMD instructions are often implemented in modern multimedia oriented processors. Although SIMD instructions are useful for many digital signal processing applications, most compilers do not exploit SIMD instructions. The difficulty in the utilization of SIMD instructions stems from data parallelism in registers. In assembly code generation, the positions of data in registers must be noted. A technique of generating pack instructions which pack or reorder data in registers is essential for exploitation of SIMD instructions. This paper presents a code generation technique for SIMD instructions with pack instructions. SIMD instructions are generated by finding and grouping the same operations in programs. After the SIMD instruction generation, pack instructions are generated. In the pack instruction generation, Multi-valued Decision Diagram (MDD) is introduced to represent and to manipulate sets of packed data. Experimental results show that the proposed code generation technique can generate assembly code with SIMD and pack instructions performing repacking of 8 packed data in registers for a RISC processor with a dual-issue coprocessor which supports SIMD and pack instructions. The proposed method achieved speedup ratio up to about 8.5 by SIMD instructions and multiple-issue mechanism of the target processor.
international conference on hardware/software codesign and system synthesis | 2006
Nobu Matsumoto; Yutaka Ota; Masaharu Imai; Keishi Sakanushi; Yoshinori Takeuchi; Masaki Nakagawa; Tanaka Hiroaki
SIMD instructions are often implemented in modern multimedia oriented processors. Although SIMD instructions are useful for many digital signal processing applications, most compilers do not exploit SIMD instructions. The difficulty in the utilization of SIMD instructions stems from data parallelism in registers. In assembly code generation, the positions of data in registers must be noted. A technique of generating pack instructions which pack or reorder data in registers is essential for exploitation of SIMD instructions. This paper presents a code generation technique for SIMD instructions with pack instructions. SIMD instructions are generated by finding and grouping the same operations in programs. After the SIMD instruction generation, pack instructions are generated. In the pack instruction generation, multi-valued decision diagram (MDD) is introduced to represent and to manipulate sets of packed data. Experimental results show that our code generation technique can generate assembly code with SIMD and pack instructions performing complex repacking of 8 packed data in registers for a commercial VLIW processor with 6 pack instructions and achieved speedup ratio of up to 7.7.
Archive | 2004
Yutaka Ota
Archive | 2008
Ken Tanabe; Yutaka Ota; Nobu Matsumoto
Archive | 2005
Kazuhyoshi Kohno; Atsushi Mizuno; Atsushi Masuda; Ryuichiro Ohyama; Yutaka Ota
Archive | 2006
Hiroaki Nishi; Nobu Matsumoto; Yutaka Ota
Archive | 2006
Yutaka Ota; Atsushi Mizuno
Archive | 2007
Nobu Matsumoto; Yutaka Ota; Takeshi Tanabe; 裕 太田; 展 松本; 健 田辺
Archive | 2010
Yuji Ishikawa; Yutaka Ota; Yu Nakanishi