Neurocomputing | 2021

GCCNet: Grouped channel composition network for scene text detection

 
 
 
 
 
 
 

Abstract


Abstract Anchor mechanism is widely applied in scene text detection methods and demonstrates promising performance. However, existing anchor mechanisms have two major limitations, namely handcrafted anchor design and hard-wired anchor assignment. We propose a novel Grouped Channels Composition(GCC) block to achieve the data-driven anchor design and adaptive anchor assignment. To be more specific, our GCC block uses optimizable anchor functions rather than handcrafted ones to achieve data-drive anchor design. In our GCC block, an adaptive anchor assignment is achieved with the attention mechanism instead of empirically assigning anchor according to the Intersection Over Union (IoU) between ground truth and targets. We then build a corresponding network named GCCNet with our GCC blocks. We also propose a Unified Loss Weighting module to alleviate the inconsistency between classification score and localization accuracy. Experiments conducted on publicly available datasets demonstrate the state-of-the-art performance of our methods. keyword: Anchor, Scene Text Detection, Grouped Channel Composition Network

Volume 454
Pages 135-151
DOI 10.1016/J.NEUCOM.2021.04.095
Language English
Journal Neurocomputing

Full Text