A Recommendation-Based Parameter Tuning Approach for Hadoop

2017 IEEE 7th International Symposium on Cloud and Service Computing (SC2)(2017)

引用 6|浏览14
暂无评分
摘要
Nowadays we have entered the big data era. Hadoop, one of the popular big data processing platforms, has many parameters that relate closely to the utilization of resources (e.g. CPU or memory). Tuning these parameters thus becomes one of the important approaches to improve the resource utilization of Hadoop. However, tuning parameters manually is impractical because the time cost fortuning is too high. Hence it is necessary to configure parameters automatically and quickly to optimize resource utilization. The former auto-tuning methods often take a long time before getting the optimal configuration, which would reduce the overall resource efficiency of cluster. In this paper, we propose mrEtalon, an adaptive tuning framework to recommend a near-optimal configuration for the new job in a short time. mrEtalon sets a configuration repository to provide candidate configurations, as well as a collaborative filtering based recommendation engine that can accelerate the optimization for parameters. We have deployed mrEtalon in our experimental cluster, and the results demonstrate that, for a new MapReduce application, compared to the former methods, mrEtalon can reduce the recommend time to 20% to 30% while keeping nearly the same recommendation quality.
更多
查看译文
关键词
Big data processing,resource utilization optimization,parameter tuning,online configuration recommendation,collaborative filtering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要