Scalable parallelization of skyline computation for multi-core processors

Data Engineering(2015)

引用 54|浏览87
暂无评分
摘要
The skyline is an important query operator for multi-criteria decision making. It reduces a dataset to only those points that offer optimal trade-offs of dimensions. In general, it is very expensive to compute. Recently, multicore CPU algorithms have been proposed to accelerate the computation of the skyline. However, they do not sufficiently minimize dominance tests and so are not competitive with state-of-the-art sequential algorithms. In this paper, we introduce a novel multicore skyline algorithm, Hybrid, which processes points in blocks. It maintains a shared, global skyline among all threads, which is used to minimize dominance tests while maintaining high throughput. The algorithm uses an efficiently-updatable data structure over the shared, global skyline, based on point-based partitioning. Also, we release a large benchmark of optimized skyline algorithms, with which we demonstrate on challenging workloads a 100-fold speedup over state-of-the-art multicore algorithms and a 10-fold speedup with 16 cores over state-of-the-art sequential algorithms.
更多
查看译文
关键词
multi-threading,multiprocessing systems,query processing,data structure,dominance test minimization,hybrid algorithm,multicore cpu algorithms,multicore processors,multicriteria decision making,optimized skyline algorithms,point-based partitioning,query operator,scalable parallelization,shared-global skyline,skyline computation,throughput,data structures,parallel processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要