2020 25th International Conference on Pattern Recognition (ICPR) | 2021

Weakly Supervised Attention Rectification for Scene Text Recognition

 
 
 
 
 

Abstract


Scene text recognition has become a hot topic in recent years due to its booming real-life applications. Attention-based encoder-decoder framework has become one of the most popular frameworks especially in the irregular text scenario. However, the “attention drift” problem hinders the recognition performance for most existing attention-based scene text recognition methods. To solve this problem, we propose an auxiliary supervision branch along with the attention-based encoder-decoder framework. A new loss function is designed to refine the feature map and to help the attention region align the target character area. Compared with existing attention rectification mechanisms, our method does not require character-level annotations or introduce any additional trainable parameter. Furthermore, our method can improve the performance for both RNN-Attention and Scaled Dot-Product Attention. The experiment results on various benchmarks have demonstrated that the proposed approach outperforms the state-of-the-art methods in both regular and irregular text recognition scenarios.

Volume None
Pages 779-786
DOI 10.1109/ICPR48806.2021.9412037
Language English
Journal 2020 25th International Conference on Pattern Recognition (ICPR)

Full Text