Morphology-rich Alphasyllabary Embeddings.

LREC(2020)

引用 0|浏览4
暂无评分
摘要
Word embeddings have been successfully trained in many languages. However, both intrinsic and extrinsic metrics are variable across languages, especially for languages that depart significantly from English in morphology and orthography. This study focuses on building a word embedding model suitable for the Semitic language of Amharic (Ethiopia), which is both morphologically rich and written as an alphasyllabary (abugida) rather than an alphabet. We compare embeddings from tailored neural models, simple pre-processing steps, off-the-shelf baselines, and parallel tasks on a better-resourced Semitic language - Arabic. Experiments show our model's performance on word analogy tasks, illustrating the divergent objectives of morphological vs. semantic analogies.
更多
查看译文
关键词
morphology-rich
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要