Nondeterminism In Mapreduce Considered Harmful? An Empirical Study On Non-Commutative Aggregators In Mapreduce Programs

ICSE '14: 36th International Conference on Software Engineering Hyderabad India May, 2014(2014)

引用 50|浏览92
暂无评分
摘要
The simplicity of MapReduce introduces unique subtleties that cause hard-to-detect bugs; in particular, the unfixed order of reduce function input is a source of nondeterminism that is harmful if the reduce function is not commutative and sensitive to input order. Our extensive study of production MapReduce programs reveals interesting findings on commutativity, nondeterminism, and correctness. Although non-commutative reduce functions lead to five bugs in our sample of well-tested production programs, we surprisingly have found that many non-commutative reduce functions are mostly harmless due to, for example, implicit data properties. These findings are instrumental in advancing our understanding of MapReduce program correctness.
更多
查看译文
关键词
MapReduce,nondeterminism,commutativity,bug
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要