Archive | 2019

Improving Ancient Cham Glyph Recognition from Cham Inscription Images Using Data Augmentation and Transfer Learning

 
 
 
 
 

Abstract


Ancient Cham glyphs have mostly appeared in inscriptions on stones at some museums in Vietnam. Unfortunately, these inscriptions are being abrasive by the time. To conserve Cham heritage as well as to make them widely accessible and readable by users, digitization and recognition of ancient Cham glyphs become necessary. In our previous work, we have built the first dataset of champ inscription images, manually segmented them in glyphs and annotated by an ancient Cham expert. We adapted some automatic recognition methods and conducted experiments on the manually denoised dataset. The aim of this paper is to extend that earlier research to work on noising data. To this end, we face two main issues. Firstly, the current pre-built dataset is still small which is usually a main drawback for deep learning based methods. Therefore, some data augmentation techniques will be evaluated and investigated to increase the number and variation of samples in the dataset. Second, even with the augmented dataset, the fact of training a deep model from scratch could be very long and sometimes cannot meet a good local minimum. Therefore, we use a simple transfer learning procedure which inherits knowledge from similar or of the same family language. Experiments on both the raw test set and its denoised version show very promising results (\\(64.4\\%\\) and \\(88.5\\%\\) of F1-score on two test sets respectively).

Volume None
Pages 115-125
DOI 10.1007/978-3-030-30754-7_12
Language English
Journal None

Full Text