2021 IEEE 32nd International Conference on Application-specific Systems, Architectures and Processors (ASAP) | 2021

An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design

 
 
 
 
 
 

Abstract


The fast development of object detection techniques has attracted attention to developing efficient Deep Neural Networks (DNNs). However, the current state-of-the-art DNN models can not provide a balanced solution among accuracy, speed, and model size. This paper proposes an efficient real-time object detection framework on resource-constricted hardware devices through hardware and software co-design. The Tensor Train (TT) decomposition is proposed for compressing the YOLOv5 model. By unitizing the unique characteristics given by the TT decomposition, we develop an efficient hardware accelerator based on FPGA devices. Experimental results show that the proposed method can significantly reduce the model size and improve the execution time.

Volume None
Pages 77-84
DOI 10.1109/ASAP52443.2021.00020
Language English
Journal 2021 IEEE 32nd International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Full Text