Partially redundant fence elimination for x86, ARM, and power processors.

CC(2017)

引用 7|浏览14
暂无评分
摘要
We show how partial redundancy elimination (PRE) can be instantiated to perform provably correct fence elimination for multi-threaded programs running on top of the x86, ARM and IBM Power relaxed memory models. We have implemented our algorithm in the backends of the
LLVM
compiler infrastructure. The optimisation does not induce an observable overhead at compile-time and can result in up-to 10% speedup on some benchmarks.
更多
查看译文
关键词
Compiler optimisations, Shared-memory concurrency, Weak-memory models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要