Developing learner corpus annotation for Chinese grammatical errors

2016 International Conference on Asian Language Processing (IALP)(2016)

引用 7|浏览30
暂无评分
摘要
This study describes the construction of the TOCFL (Test Of Chinese as a Foreign Language) learner corpus, including the collection and grammatical error annotation of 2,837 essays written by Chinese language learners originating from a total of 46 different mother-tongue languages. We propose hierarchical tagging sets to manually annotate grammatical errors, resulting in 33,835 inappropriate usages. Our built corpus has been provided for the shared tasks on Chinese grammatical error diagnosis. These demonstrate the usability of our learner corpus annotation.
更多
查看译文
关键词
error schema,error tagging,second language acquisition,interlanguage,grammatical error diagnosis,computer-assisted language learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要