Protecting Reward Function of Reinforcement Learning via Minimal and Non-catastrophic Adversarial Trajectory

2021 40th International Symposium on Reliable Distributed Systems (SRDS)(2021)

引用 0|浏览34
暂无评分
摘要
Reward functions are critical hyperparameters with commercial values for individual or distributed reinforcement learning (RL), as slightly different reward functions result in significantly different performance. However, existing inverse reinforcement learning (IRL) methods can be utilized to approximate reward functions just based on collected expert trajectories through observing. Thus, in the...
更多
查看译文
关键词
Measurement,Costs,Perturbation methods,Clustering algorithms,Reinforcement learning,Predictive models,Prediction algorithms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要