Sustainable Linked Data Generation: The Case of DBpedia.

Lecture Notes in Computer Science(2017)

引用 14|浏览86
暂无评分
摘要
DBpedia EF, the generation framework behind one of the Linked Open Data cloud's central interlinking hubs, has limitations with regard to quality, coverage and sustainability of the generated dataset. DBpedia can be further improved both on schema and data level. Errors and inconsistencies can be addressed by amending (i) the DBpedia EF; (ii) the DBpedia mapping rules; or (iii) Wikipedia itself from which it extracts information. However, even though the DBpedia EF and mapping rules are continuously evolving and several changes were applied to both of them, there are no significant improvements on the DBpedia dataset since its limitations were identified. To address these shortcomings, we propose adapting a different semantic-driven approach that decouples, in a declarative manner, the extraction, transformation and mapping rules execution. In this paper, we provide details regarding the new DBpedia EF, its architecture, technical implementation and extraction results. This way, we achieve an enhanced data generation process, which can be broadly adopted, and that improves its quality, coverage and sustainability.
更多
查看译文
关键词
Linked Data Generation, DBpedia Mappings, DBpedia Dataset, Mapping Rules, DBpedia Community
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要