Average Task Execution Time Minimization under (m, k) Soft Error Constraint.

RTAS(2023)

引用 0|浏览13
暂无评分
摘要
Safety-critical systems are often subjected to transient faults. Since these transient faults may lead to soft errors that cause catastrophic consequences, error-handling must be addressed by design. Full-protection against faults is too costly in terms of resource usage. A common approach to relax the resource demands and limit the impact of errors is to consider (m, k)-constraints, which requires that at least m jobs out of any k consecutive jobs are error-free. To assure (m, k)-compliance, static patterns are widely used to select the job execution modes, i.e., either in an error-free mode at the cost of increased worst-case execution time or in an error-prone mode with the advantage of less execution time. Although static patterns have been shown to be effective in energy-aware designs, resource over-provision is inevitable due to the relatively low rate of error probability. In this work, we propose two dynamic (and adaptive) approaches that allow the scheduler to opportunistically select execution modes based on the error-history of the past jobs and the actual error probability. We firstly propose a Markov chain based solution if the error-probability is known and static and secondly a reinforcement learning-based approach that can handle unknown error probabilities. Experimental evaluations show that our approaches outperform the state-of-the-art in most of the evaluated cases in terms of average utilization for each task and the overall utilization for multitask systems.
更多
查看译文
关键词
actual error probability,average task execution time minimization,catastrophic consequences,energy-aware designs,error-free mode,error-handling,error-history,error-probability,error-prone mode,increased worst-case execution time,job execution modes,k consecutive jobs,m jobs,reinforcement learning-based approach,resource demands,resource over-provision,resource usage,safety-critical systems,soft errors,static patterns,transient faults,unknown error probabilities
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要