Local support-based partition algorithm for frequent pattern mining

Vijayakumar Kadappa, Shivaraju Nagesh

Pattern Analysis and Applications(2018)

引用 6|浏览2
暂无评分
摘要
Frequent pattern (itemset) mining is one of the established approaches for knowledge discovery. Minimizing the number of database scans (I/O overhead) is a challenging task in Frequent itemset mining. Partition algorithm is one of the early novel approaches to reduce the database I/O overhead as compared to Apriori algorithm and other related methods. However, Partition algorithm suffers from a significant database I/O overhead (that is, it reads the database twice from the secondary storage) and higher time complexity for computation of frequent itemsets in large databases. In this work, an improved partition algorithm is proposed, which reads the database only once and makes use of local support information to avoid further scans of the database. The proposed algorithm outperforms Apriori and Partition algorithms and shows closer performance to FP-Growth algorithm, in terms of computational time. The proposed method outpaces FP-Growth algorithm in terms of memory usage and is competitive to other algorithms. In terms of database access time, the proposed method exhibits better performance over FP-Growth, Partition and Apriori methods.
更多
查看译文
关键词
Data mining,Frequent itemset mining,Partition algorithm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要