Measuring Information Gain using Provenance

PROCEEDINGS OF 14TH INTERNATIONAL WORKSHOP ON THE THEORY AND PRACTICE OF PROVENANCE, TAPP 2022(2022)

引用 0|浏览4
暂无评分
摘要
In recent years, a large amount of data is collected from multiple sources and the demands for analyzing these data have increased enormously. Data sharing is a valuable part of this data-intensive and collaborative environment due to the synergies and added values created by multi-modal datasets generated from different sources. In this work, we introduce a technique that can be used for quantifying the degree of information gain (IG) that may be obtained over data sharing. Our method captures both where- (to compute the IG over values) and how-provenance (to find matching records) and accurately computes the IG based on them. We conduct a preliminary evaluation to show the runtime of our approach over a real-world dataset.
更多
查看译文
关键词
how and where provenance,information gain
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要