Digital | 2021

Leveraging Vector Space Similarity for Learning Cross-Lingual Word Embeddings: A Systematic Review

 
 

Abstract


This article presents a systematic literature review on quantifying the proximity between independently trained monolingual word embedding spaces. A search was carried out in the broader context of inducing bilingual lexicons from cross-lingual word embeddings, especially for low-resource languages. The returned articles were then classified. Cross-lingual word embeddings have drawn the attention of researchers in the field of natural language processing (NLP). Although existing methods have yielded satisfactory results for resource-rich languages and languages related to them, some researchers have pointed out that the same is not true for low-resource and distant languages. In this paper, we report the research on methods proposed to provide better representation for low-resource and distant languages in the cross-lingual word embedding space.

Volume None
Pages None
DOI 10.3390/DIGITAL1030011
Language English
Journal Digital

Full Text