Querying Capability Comparison Of Hadoop Technologies To Find The More Sustainable Platform For Big Data

2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI)(2017)

引用 0|浏览1
暂无评分
摘要
Big Data analytics is an upcoming field in the world of computing. It has much more complex requirements for processing and maintenance and thus it should be dealt using high-end servers. So, in turn, it has a lot more energy requirements. High energy requirements can lead to more energy wastage which is not a sustainable practice. In this paper, the comparison is done between 3 platforms that provide the framework for big data management. The platforms are Hive, Impala, and Spark. Many other platforms are also available but these three have been chosen because they can be easily implemented on even a single node Hadoop cluster. The results have shown that Cloudera Impala outperforms both the other platforms as it takes approximately 90% lesser time to do the same task.
更多
查看译文
关键词
Big Data, Hadoop Map-Reduce, Impala, Hive, Spark, Performance Comparison
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要