IEEE Transactions on Image Processing | 2021

Robust Text Image Recognition via Adversarial Sequence-to-Sequence Domain Adaptation

 
 
 
 

Abstract


Robust text reading is a very challenging problem, due to the distribution of text images changing significantly in real-world scenarios. One effective solution is to align the distribution between different domains by domain adaptation methods. However, we found that these methods might struggle when dealing sequence-like text images. An important reason is that conventional domain adaptation methods strive to align images as a whole, while text images consist of variable-length fine-grained character information. To address this issue, we propose a novel Adversarial Sequence-to-Sequence Domain Adaptation (ASSDA) method to learn “where to adapt” and “how to align” the sequential image. Our key idea is to mine the local regions that contain characters, and focus on aligning them across domains in an adversarial manner. Extensive text recognition experiments show the ASSDA could efficiently transfer sequence knowledge and validate the promising power towards the various domain shift in the real world applications.

Volume 30
Pages 3922-3933
DOI 10.1109/TIP.2021.3066903
Language English
Journal IEEE Transactions on Image Processing

Full Text