Constructing and Cleaning Identity Graphs in the LOD Cloud

Data Intelligence(2020)

引用 4|浏览36
暂无评分
摘要
In the absence of a central naming authority on the Semantic Web, it is common for different data sets to refer to the same thing by different names. Whenever multiple names are used to denote the same thing, owl:same As statements are needed in order to link the data and foster reuse. Studies that date back as far as 2009, observed that the owl:same As property is sometimes used incorrectly. In our previous work, we presented an identity graph containing over 500 million explicit and 35 billion implied owl:same As statements, and presented a scalable approach for automatically calculating an error degree for each identity statement. In this paper, we generate subgraphs of the overall identity graph that correspond to certain error degrees. We show that even though the Semantic Web contains many erroneous owl:same As statements, it is still possible to use Semantic Web data while at the same time minimizing the adverse effects of misusing owl:same As.
更多
查看译文
关键词
Linked Open Data,Identity,Quality,Reasoning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要