2019 56th ACM/IEEE Design Automation Conference (DAC) | 2019

Exact and Heuristic Allocation of MuIti-kernel Applications to Multi-FPGA Platforms

 
 
 
 
 

Abstract


FPGA-based accelerators demonstrated high energy efficiency compared to GPUs and CPUs. However, single FPGA designs may not achieve sufficient task parallelism. In this work, we optimize the mapping of high-performance multi-kernel applications, like Convolutional Neural Networks, to multi-FPGA platforms. First, we formulate the system level optimization problem, choosing within a huge design space the parallelism and number of compute units for each kernel in the pipeline. Then we solve it using a combination of Geometric Programming, producing the optimum performance solution given resource and DRAM bandwidth constraints, and a heuristic allocator of the compute units on the FPGA cluster.

Volume None
Pages 1-6
DOI 10.1145/3316781.3317821
Language English
Journal 2019 56th ACM/IEEE Design Automation Conference (DAC)

Full Text