Efficiently Mining Large Gradual Patterns Using Chunked Storage Layout

ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2021(2021)

引用 1|浏览5
暂无评分
摘要
Existing approaches for extracting gradual patterns become inefficient in terms of memory usage when applied on data sets with huge numbers of objects. This inefficiency is caused by the contiguous nature of loading binary matrices into main memory as single blocks when validating candidate gradual patterns. This paper proposes an efficient storage layout that allows these matrices to be split and loaded into/from memory in multiple smaller chunks. We show how HDF5 (Hierarchical Data Format version 5) may be used to implement this chunked layout and our experiments reveal a great improvement in memory usage efficiency especially on huge data sets.
更多
查看译文
关键词
Binary matrices, Gradual patterns, HDF5, Memory chunk, Zarr
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要