CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems

Sagar Patel,Sangeetha Abdu Jyothi,Nina Narodytska

AAAI 2024（2024）

引用 0|浏览10

暂无评分

摘要

We present CrystalBox, a novel, model-agnostic, posthoc explainability framework for Deep Reinforcement Learning (DRL) controllers in the large family of input-driven environments which includes computer systems. We combine the natural decomposability of reward functions in input-driven environments with the explanatory power of decomposed returns. We propose an efficient algorithm to generate future-based explanations across both discrete and continuous control environments. Using applications such as adaptive bitrate streaming and congestion control, we demonstrate CrystalBox's capability to generate high-fidelity explanations. We further illustrate its higher utility across three practical use cases: contrastive explanations, network observability, and guided reward design, as opposed to prior explainability techniques that identify salient features.

查看译文

关键词

ML: Transparent, Interpretable, Explainable ML,ML: Reinforcement Learning,APP: Web,APP: Other Applications

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要