Build Fast and Accurate Lemmatization for Arabic

Language Resources and Evaluation(2017)

引用 2|浏览4
暂无评分
摘要
In this paper we describe the complexity of building a lemmatizer for Arabic which has a rich and complex derivational morphology, and we discuss the need for a fast and accurate lammatization to enhance Arabic Information Retrieval (IR) results. We also introduce a new data set that can be used to test lemmatization accuracy, and an efficient lemmatization algorithm that outperforms state-of-the-art Arabic lemmatization in terms of accuracy and speed. We share the data set and the code for public.
更多
查看译文
关键词
Arabic NLP,Lemmatization,Stemming,Information Retrieval,Diactitization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要