RTailor: Parameterizing Soft Error Resilience for Mixed-Criticality Real-Time Systems.

Shaoyu Huang,Jianping Zeng, Xuanliang Deng, Sen Wang, Ashrarul H. Sifat, Burhanuddin Bharmal, Jia-Bin Huang,Ryan K. Williams,Haibo Zeng,Changhee Jung

2023 IEEE Real-Time Systems Symposium (RTSS)(2023)

引用 0|浏览0
暂无评分
摘要
Equipping real-time systems with soft error resilience can be challenging due to the tradeoff of the timing and failure requirements for mixed-criticality tasks. Violation of these requirements yields failed task scheduling in one way or another. However, not every task requires the same degree of soft error resilience. For example, low-criticality tasks can run with low or even no soft error resilience, whereas mid- or highcriticality tasks may require relatively high resilience depending on their inherent failure requirement. Unfortunately, existing soft error resilience schemes do not have the ability to control the degree of their resilience in a fine-grained way, i.e., they can only be turned on or off as a whole during task execution. To this end, this paper presents RTailor (Resilience Tailor), a compiler-directed parameterized soft error resilience scheme that achieves the desired level of soft error protection according to the demand of each task. The key idea is that for a given protection ratio, compilers can transform a hot loop such that the number of its iterations protected over the total iterations matches the ratio. Compared to full resilience protecting every iteration, RTailor's parameterized soft error resilience significantly reduces the performance overhead of tasks, thereby improving their real-time schedulability. The experimental results highlight that for four representative fault rates, RTailor achieves 15%~average schedulability improvements over the state-of-the-art work that lacks parameterized soft error resilience.
更多
查看译文
关键词
Soft Errors,Task Execution,Task Scheduling,Defect Rate,Total Iterations,Performance Overhead,Replica,Random Generation,Unmanned Aerial Vehicles,Error Detection,Optimal Sequence,Levels Of Resilience,Forward Error Correction,Program Execution,Idempotent,Total Execution Time,Real-time Tasks,Knapsack Problem,Code Size,Benchmark Suite,Recovery Error,Checkpointing,Runtime Overhead,Transient Faults,Voltage Scaling,Cache Misses,Version Of Task,Entailment,Back Edge,Compile Time
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要