2021 International Conference on Intelligent Technologies (CONIT) | 2021

A study on transformer-based Object Detection

Abstract

This paper focuses on transformers based end-to-end object detection methods. End to end object detection is a new paradigm that has got attention in recent times. It does not require complex hand-engineered components such as non-max suppression to detect objects inside an image. Various methods are proposed to date to enhance fully end-to-end object detectors, most of them are based on the attention mechanism. In this work, we analyze some algorithms which involve transformers for the purpose of object detection. We discuss end-to-end models in which we have focused on Adaptive clustering-based transformers which solve attention encoder redundancy, Deformable Detection Transformers in which the attention module attends a limited collection of key sampling points, Unsupervisedly pre-trained Detection Transformers which are pre-trained on random query patches from the given image to improve accuracy and finally the Transformer-based Set Prediction using FCOS. These enhanced models not only improve the mean average precision of the model but also improves the total convergence time.

Volume None

2021 International Conference on Intelligent Technologies (CONIT) | 2021

A study on transformer-based Object Detection

Abstract

Volume None

Pages 1-6

DOI 10.1109/CONIT51480.2021.9498550

Language English

Journal 2021 International Conference on Intelligent Technologies (CONIT)

Full Text