Directing The Development Of Constraint Languages By Checking Constraints On Rdf Data

INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING(2016)

引用 2|浏览7
暂无评分
摘要
For research institutes, data libraries, and data archives, validating RDF data according to pre-defined constraints is a much sought-after feature, particularly as this is taken for granted in the XML world. Based on our work in two international working groups on RDF validation and jointly identified requirements to formulate constraints and validate RDF data, we have published 81 types of constraints that are required by various stakeholders for data applications.In this paper, we evaluate the usability of identified constraint types for assessing RDF data quality by (1) collecting and classifying 115 constraints on vocabularies commonly used in the social, behavioral, and economic sciences, either from the vocabularies themselves or from domain experts, and (2) validating 15,694 data sets (4.26 billion triples) of research data against these constraints. We classify each constraint according to (1) the severity of occurring violations and (2) based on which types of constraint languages are able to express its constraint type. Based on the large-scale evaluation, we formulate several findings to direct the further development of constraint languages.
更多
查看译文
关键词
RDF data validation, RDF data quality, constraint languages, semantic web, linked data, RDF
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要