Eliminating Incorrect Cross-Language Links in Wikipedia.

WISE(2017)

引用 23|浏览22
暂无评分
摘要
Many Wikipedia articles that cover the same topic in different language editions are interconnected via cross-language links that enable the understanding of topics in multiple languages, as well as cross-language information retrieval applications. However, cross-language links are added manually by the users of Wikipedia and, as such, are often incorrect. In this paper, we propose an approach to automatically eliminate incorrect cross-language links based on the observation that groups of articles that are pairwise connected through cross-language links form independent connected components. For each incoherent component (i.e., one that contains two or more articles from the same language edition), our approach assigns a correctness score to its crosslinks and removes those with the lowest score to make the component coherent. The results of our evaluation on a snapshot of Wikipedia in 8 languages indicates that our approach shows quantitative promise.
更多
查看译文
关键词
Wikipedia, Cross-language links, Multi-language information retrieval
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要