On the appropriate pattern frequentness measure and pattern generation mode: a critical review

Proceedings of the 23rd International Database Applications & Engineering Symposium(2019)

引用 1|浏览10
暂无评分
摘要
The classic case pattern mining is a fundamental subject in data mining and big data science. The goal of the mining is to find correctly from a given dataset the patterns and their respective intrinsic frequentness. This paper examines two important yet misused instruments, the pattern frequentness measure "support" and the full enumeration pattern generation mode, which cause serious Overfitting thus deviate from the mining goal. A theoretic combined solution for the two critical issues is then proposed. This solution plus the equilibrium condition introduced in this paper forms a set of three fundamental rationality check criteria that every mining approach should observe. As such, the rationality of the mining theory and the reliability of the mining results would be substantially improved from the previous work. These together promise a significant change towards more effective pattern mining.
更多
查看译文
关键词
data mining, frequentness measure, overfitting, pattern frequency, pattern mining, probability anomaly, selective pattern generation, underfitting
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要