Online plagiarism detection through exploiting lexical, syntactic, and semantic information

ACL '12 Proceedings of the ACL 2012 System Demonstrations(2012)

引用 2|浏览0
暂无评分
摘要
In this paper, we introduce a framework that identifies online plagiarism by exploiting lexical, syntactic and semantic features that includes duplication-gram, reordering and alignment of words, POS and phrase tags, and semantic similarity of sentences. We establish an ensemble framework to combine the predictions of each model. Results demonstrate that our system can not only find considerable amount of real-world online plagiarism cases but also outperforms several state-of-the-art algorithms and commercial software.
更多
查看译文
关键词
ensemble framework,online plagiarism,real-world online plagiarism case,semantic feature,semantic similarity,commercial software,considerable amount,phrase tag,state-of-the-art algorithm,Online plagiarism detection,semantic information
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要