Kiyokuni Kawachiya | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Kiyokuni Kawachiya is active.

Explore More

Publication

Featured researches published by Kiyokuni Kawachiya.

international conference on performance engineering | 2018

Involving CPUs into Multi-GPU Deep Learning

Tung D. Le; Taro Sekiyama; Yasushi Negishi; Haruki Imai; Kiyokuni Kawachiya

The most important part of deep learning, training the neural network, often requires the processing of a large amount of data and can takes days to complete. Data parallelism is widely used for training deep neural networks on multiple GPUs in a single machine thanks to its simplicity. However, its scalability is bound by the number of data transfers, mainly for exchanging and accumulating gradients among the GPUs. In this paper, we present a novel approach to data parallel training called CPU-GPU data parallel (CGDP) training that utilizes free CPU time on the host to speed up the training in the GPUs. We also present a cost model for analyzing and comparing the performances of both the typical data parallel training and the CPU-GPU data parallel training. Using the cost model, we formally show why our approach is better than the typical one and clarify the remaining issues. Finally, we explain how we optimized CPU-GPU data parallel training by introducing chunks of layers and present a runtime algorithm that automatically finds a good configuration for the training. The algorithm is effective for very deep neural networks, which are the current trend in deep learning. Experimental results showed that we achieved speedups of

Archive | 2008

Method and system for speeding up mutual exclusion

Takeshi Ogasawara; Akira Koseki; Hideaki Komatsu; Kiyokuni Kawachiya; Tamiya Onodera

1.21

Archive | 1998

Manipulative pointing device, and portable information processing apparatus

Kiyokuni Kawachiya; Hiroshi Ishikawa

Archive | 2005

Efficient class memory management

Kiyokuni Kawachiya; Kazunori Ogata; Tamiya Onodera; Trent A. Gray-Donald

1.04

Archive | 1993

Message passing apparatus for determining if counted acknowledgements from a set of processors are within a defined range

Nobuyuki Ooba; Kiyokuni Kawachiya

Archive | 2001

Computer system, memory management method, storage medium and program transmission apparatus

Kiyokuni Kawachiya; Tamiya Onodera

1.21

Archive | 2009

IMAGE DRAWING SYSTEM, IMAGE DRAWING SERVER, IMAGE DRAWING METHOD, AND COMPUTER PROGRAM

Hiroshi Horii; Kiyokuni Kawachiya; Akira Koseki; Toshihiro Takahashi

and

Archive | 2011

METHOD AND SYSTEM FOR CODE GENERATION AND INLINING

Derek B. Inglis; Kiyokuni Kawachiya; Tamiya Onodera; Michiaki Tatsubori

1.07

Archive | 2008

Caching mechanism based on update frequency

Hiroshi Horii; Kiyokuni Kawachiya; Yosuke Ozawa; 洋堀井; 陽介小澤; 清久仁河内谷

for four state-of-the-art neural networks: AlexNet, GoogLeNet-v1, VGGNet-16, and ResNet-152, respectively. Weak scaling efficiency greater than

Archive | 1999

Method and apparatus for managing locks of objects and method and apparatus for unlocking objects

Tamiya Onodera; Kiyokuni Kawachiya

Explore More

Collaboration

Dive into the Kiyokuni Kawachiya's collaboration.

Top Co-Authors

Tamiya Onodera

IBM

View shared research outputs

Top Co-Authors

Kazunori Ogata

IBM

View shared research outputs

Top Co-Authors

Michiaki Tatsubori

IBM

View shared research outputs

Top Co-Authors

Hiroshi Horii

IBM

View shared research outputs

Top Co-Authors

Hiroshi Ishikawa

IBM

View shared research outputs

Top Co-Authors

Akira Koseki

IBM

View shared research outputs

Top Co-Authors

Derek B. Inglis

IBM

View shared research outputs

Top Co-Authors

Mikio Takeuchi

IBM

View shared research outputs

Top Co-Authors

Hideaki Komatsu

IBM

View shared research outputs

Top Co-Authors

Michihiro Horie

IBM

View shared research outputs

Explore More