Exploring a Choctaw Language Corpus with Word Vectors and Minimum Distance Length.

Jacqueline Brixey, David J. Sides, Timothy Vizthum,David R. Traum,Khalil Iskarous

LREC(2020)

引用 0|浏览0
暂无评分
摘要
This work introduces additions to the corpus ChoCo (Choctaw language Corpus), a multimodal corpus for the American indigenous language Choctaw. Using texts from the corpus, we develop new computational resources by using two off-the-shelf tools: word2vec and Linguistica. Our results indicate these tools need expert input to reliably interpret the results.
更多
查看译文
关键词
endangered languages, indigenous language, low resource languages, Choctaw, Native American languages
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要