PeerDedupe: Insights into the Peer-Assisted Sampling Deduplication

Peer-to-Peer Computing(2010)

引用 19|浏览25
暂无评分
摘要
As the digital data rapidly inflates to a world-wide storage crisis, data deduplication is showing its increasingly prominent function in data storage. Driven by the problems behind the mainstream server-side deduplication schemes, recently there has been a tendency of introducing peer-assisted methods into the deduplication systems. However, this topic is still quite vague at present and lacks thorough research. In this paper, we conduct in-depth and quantitative investigation on the peer-assisted deduplication. Through measurements we observe that the inter-peer duplication accounts for a large proportion of the total duplication, and exhibits strong peer locality. Then based on our observations, we propose PeerDedupe, a novel peer-assisted sampling deduplication approach. Experiments show that PeerDedupe can remove over 98% duplication with each peer coordinating with no more than 5 other peers, and it requires much less server RAM usage than the existing works.
更多
查看译文
关键词
ram usage,random-access storage,data deduplication,peerdedupe,storage management,data compression,inter-peer duplication account,peer-assisted sampling deduplication,mainstream server-side deduplication,peer-to-peer computing,sampling methods,digital data,greedy algorithms,data storage,servers,weibull distribution,redundancy,estimation,accuracy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要