Towards Building An Arabic Plagiarism Detection System: Plagiarism Detection In Arabic

INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH(2019)

引用 1|浏览0
暂无评分
摘要
This article describes a plagiarism detection system for the Arabic language that combines different similarity-measure techniques to uncover plagiarism in Arabic documents. The proposed system consists of two main components, one document-retrieval and the other detailed similarity analysis. The document-retrieval component generates queries from a given suspicious document and makes use of Google search API to retrieve candidate source documents from the Web. The similarity analysis component takes each source document in turn and attempts to identify the plagiarized parts in the suspicious document. The proposed system is thoroughly evaluated using an indigenous corpus. At the document-retrieval level, the system achieved above 75% accuracy in terms of f-score, whereas at the detailed similarity-computation level, the f-score is above 70%.
更多
查看译文
关键词
Arabic Plagiarism Corpus, Document Signature, Google API, Information Retrieval, Plagiarism Detection, Plagiarism in Arabic, Suspicious Document, Text Similarity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要