Archive | 2021

UNBNLP at SemEval-2021 Task 1: Predicting lexical complexity with masked language models and character-level encoders

 
 
 
 

Abstract


In this paper, we present three supervised systems for English lexical complexity prediction of single and multiword expressions for SemEval-2021 Task 1. We explore the use of statistical baseline features, masked language models, and character-level encoders to predict the complexity of a target token in context. Our best system combines information from these three sources. The results indicate that information from masked language models and character-level encoders can be combined to improve lexical complexity prediction.

Volume None
Pages 650-654
DOI 10.18653/v1/2021.semeval-1.83
Language English
Journal None

Full Text