A Taxonomy and Survey of Data Partitioning Algorithms for Big Data Distributed Systems

Quadri Waseem,Mohd Aizaini Maarof, Mohd Yazid Idris,Amril Nazir

Lecture notes in networks and systems(2020)

引用 1|浏览0
暂无评分
摘要
Data partitioning is a backbone of distributed systems that boost the performance of big data applications, especially in distributed systems. In past years, many data partitioning algorithms have been developed which had improved the big data management and its processing for the real-time applications of the big data stores. Furthermore, the feature of “elasticity” to the data partitioning has removed the need for human interaction while handling the big data applications on the distributed system during the high workloads and skews. In this survey, a taxonomy is proposed that characterizes and classifies various types of data partitioning algorithms, which will help to identify the current limitations in the state of the art and will extend the state of the art to improve the enhancements for the effective and efficient performance of the big data stores on distributed systems. The taxonomy not only highlights the design, the similarities, and the differences within state of the art for different types of data partitioning algorithms but also identifies the areas that need further research.
更多
查看译文
关键词
Big data, Distributed systems, Data partitioning, Elasticity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要