An Enhanced Distributed Algorithm for Area Skyline Computation Based on Apache Spark.

Chen Li, Yang Cao,Ye Zhu, Jinli Zhang,Annisa,Debo Cheng, Huidong Tang, Shuai Jiang, Kenta Maruyama,Yasuhiko Morimoto

KSEM (4)(2023)

引用 0|浏览4
暂无评分
摘要
Skyline computations are a way of finding the best data points based on multiple criteria for location-based decision-making. However, as the input data grows larger, these computations become slower and more challenging. To address this issue, we propose an efficient algorithm that uses Apache Spark, a platform for distributed processing, to perform area skyline computations faster and more salable. Our algorithm consists of three main phases: calculating distances between data points, generating distance tuples, and computing the skyline. In the second phase, we apply a technique called local partial skyline extraction, which reduces the amount of data that needs to be sent from each executor (a parallel processing unit) to the driver (a central processing unit). The driver then computes the final skyline from the received data and creates filters to eliminate irrelevant points. Our experiments show that our algorithm can significantly reduce the data size and the computation time of the area skyline.
更多
查看译文
关键词
area skyline computation,algorithm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要