EXPERT: Effective and flexible error protection by redundant multithreading

2018 Design, Automation & Test in Europe Conference & Exhibition (DATE)(2018)

引用 27|浏览32
暂无评分
摘要
Resiliency is a first-order design concern in modern microprocessor design. Compiler-level Redundant MultiThreading (RMT) schemes are promising because of their capability to detect the manifestation of hardware transient and permanent faults. In this work, we propose EXPERT, a compiler-level RMT scheme which can detect the manifestation of hardware faults in all hardware components. EXPERT transformation generates a checker thread for program main execution thread. These redundant threads execute simultaneously on two physically different cores of a multi-core processor. They perform mostly same computations, however, after each memory write operation committed by the main thread, the checker thread loads back the written data from the memory and checks it against its own locally computed values. If they match, execution continues. Otherwise, the error flag will be raised. Our processor-wide statistical transient and permanent fault injection experiments show that EXPERT error coverage is ~65x better than the state-of-the-art scheme.
更多
查看译文
关键词
EXPERT error coverage,flexible error protection,first-order design concern,modern microprocessor design,compiler-level Redundant MultiThreading schemes,compiler-level RMT scheme,EXPERT transformation,program main execution thread,multicore processor,error flag,permanent fault injection experiments
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要