Runtime Verification of Learning Properties for Reinforcement Learning Algorithms.
CoRR(2023)
摘要
Reinforcement learning (RL) algorithms interact with their environment in a
trial-and-error fashion. Such interactions can be expensive, inefficient, and
timely when learning on a physical system rather than in a simulation. This
work develops new runtime verification techniques to predict when the learning
phase has not met or will not meet qualitative and timely expectations. This
paper presents three verification properties concerning the quality and
timeliness of learning in RL algorithms. With each property, we propose design
steps for monitoring and assessing the properties during the system's
operation.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要