TomusBlobs: Towards Communication-Efficient Storage for MapReduce Applications in Azure

Cluster, Cloud and Grid Computing(2012)

引用 16|浏览0
暂无评分
摘要
The emergence of cloud computing brought the opportunity to use large-scale compute infrastructures for a broad spectrum of applications and users. As the cloud paradigm gets attractive for the " elasticity'' in resource usage and associated costs (the users only pay for resources actually used), cloud applications still suffer from the high latencies and low performance of cloud storage services. Enabling high-throughput massive data processing on cloud data becomes a critical issue, as it impacts the overall application performance. In this paper we address the above challenge at the level of the cloud storage. We introduce a concurrency-optimized data storage system which federates the virtual disks associated to VMs. We demonstrate the performance of our solution for efficient data-intensive processing on commercial clouds by building an optimized prototype MapReduce framework for Azure that leverages the benefits of our storage solution. We perform extensive synthetic benchmarks as well as experiments with real-world applications: they demonstrate that our solution brings substantial benefits to data intensive applications compared to approaches relying on state-of-the-art cloud object storage.
更多
查看译文
关键词
state-of-the-art cloud object storage,cloud paradigm,commercial cloud,cloud storage,cloud storage service,cloud computing,cloud application,towards communication-efficient storage,concurrency-optimized data storage system,mapreduce applications,cloud data,storage solution,concurrency control,prototypes,spectrum,computer architecture,data processing,data storage,scalability,virtual machines,cloud applications,scheduling,throughput,vm,resource allocation,distributed databases,high throughput
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要