2019 IEEE International Conference on Image Processing (ICIP) | 2019

Adaptive Hard Example Mining for Image Captioning

Abstract

Reinforcement Learning (RL) based methods optimize evaluation metric directly in image captioning task. In these methods, metric scores of captions are regard as rewards for examples. However, existing methods suffer from inferior performances on hard examples. In this paper, we propose an adaptive hard example mining method with additional supervised training for image captioning. Beam search algorithm is leveraged to estimate score expectation for each example. Examples whose caption scores are lower than expectation are selected automatically. For the selected hard examples, we propose an additional reward policy for high-scoring captions to force model learning from them. The proposed method is hyper-parameter free without tuning. Experimental results on MSCOCO dataset validate effectiveness of the proposed method.

Volume None

2019 IEEE International Conference on Image Processing (ICIP) | 2019

Adaptive Hard Example Mining for Image Captioning

Abstract

Volume None

Pages 3342-3346

DOI 10.1109/ICIP.2019.8803418

Language English

Journal 2019 IEEE International Conference on Image Processing (ICIP)

Full Text