Optimizing Data Partition For Nosql Cluster

2015 IEEE 12th Intl Conf on Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom)(2015)

引用 1|浏览29
暂无评分
摘要
The data partition balance impacts the performance of NoSQL systems significantly. Most of the P2P NoSQL systems use consistent hashing to partition data automatically. Currently, these systems use random virtual nodes or manual configuration to divide the consistent hashing ring, which may cause load imbalance and degrade the performance. The problem is pronounced especially for heterogeneous clusters. In this paper, we focus on the partition strategy of consistent hashing ring and propose a data partition quantified criterion. When initializing a cluster, we convert the problem to an optimization problem to find the most even partitioning result. Experiments on Cassandra and Voldemort show these methods are better than current implementations. Besides, the algorithms are very efficient even for heterogeneous clusters.
更多
查看译文
关键词
DHT,Data Partition,Cassandra
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要