2020 25th International Conference on Pattern Recognition (ICPR) | 2021

IBN-STR: A Robust Text Recognizer for Irregular Text in Natural Scenes

 
 
 
 

Abstract


Although text recognition methods based on deep neural networks have promising performance, there are still challenges due to the variety of text styles, perspective distortion, text with large curvature, and so on. To obtain a robust text recognizer, we have improved the performance from two aspects: data aspect and feature representation aspect. In terms of data, we transform the input images into S-shape distorted images in order to increase the diversity of training data. Besides, we explore the effects of different training data. In terms of feature representation, the combination of instance normalization and batch normalization improves the model s capacity and generalization ability. This paper proposes a robust scene text recognizer IBN-STR, which is an attention-based model. Through extensive experiments, the model analysis and comparison have been carried out from the aspects of data and feature representation, and the effectiveness of IBN-STR on both regular and irregular text instances has been verified. Furthermore, IBN-STR is an end-to-end recognition system that can achieve state-of-the-art performance.

Volume None
Pages 9522-9528
DOI 10.1109/ICPR48806.2021.9412775
Language English
Journal 2020 25th International Conference on Pattern Recognition (ICPR)

Full Text