2020 25th International Conference on Pattern Recognition (ICPR) | 2021

Learning to Sort Handwritten Text Lines in Reading Order through Estimated Binary Order Relations

 
 

Abstract


Recent advances in Handwritten Text Recognition and Document Layout Analysis make it possible to extract information from digitized documents and make them accessible beyond the archive shelves. But the reading order of the elements in those documents still is an open problem that has to be solved in order to provide that information with the correct structure. Most of the studies on the reading order task are rule-base approaches that focus on printed documents, while less attention has been paid to handwritten text documents. In this work we propose a new approach to automatically determine the reading order of text lines in handwritten text documents. The task is approached as a sorting problem where the order-relation operator is learned directly from examples. We demonstrate the effectiveness of our method on three different datasets.

Volume None
Pages 7661-7668
DOI 10.1109/ICPR48806.2021.9413256
Language English
Journal 2020 25th International Conference on Pattern Recognition (ICPR)

Full Text