Evaluate H2Hadoop and Amazon EMR performances by processing MR jobs in text data sets

2016 IEEE Long Island Systems, Applications and Technology Conference (LISAT)(2016)

引用 1|浏览6
暂无评分
摘要
Text data is defined as sequences of characters that may become big data that has no specific format and only can be processed using the original Hadoop. Amazon Web Services AWS provides virtual Cloud Computing services such as storing data using S3 service and processing big data using EMR service. Amazon Elastic MapReduce EMR uses the original Hadoop as a processing environment to its Cloud Computing services. Also, H2Hadoop is a developed version of Hadoop that provides big data processing service that uses the metadata of related jobs to improve Hadoop performance. In this paper, we process a find sequence job in text data using Amazon EMR and H2Hadoop, and we came up with a comparison between them that shows H2Hadoop performance is more efficient than Amazon EMR in some cases under different considerations.
更多
查看译文
关键词
BigData,Cloud Computing,Hadoop,H2Hadoop,Amazon EMR,Text Data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要