Curiosity-Driven Reinforcement Learning For Dialogue Management

Paula Wesselmann,Yen-Chen Wu,Milica Gasic

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)（2019）

引用 4|浏览56

暂无评分

摘要

In this paper we describe the use of curiosity rewards for dialogue policy learning of goal oriented dialogues via reinforcement learning. Using curiosity improves state-action space exploration and helps overcome reward sparsity. Additionally, for goal oriented dialogues it makes sense to perform inherently curious actions in order to gain knowledge about the user goal. We show that intrinsic curiosity rewards can replace random epsilon-greedy exploration and stabilize training. The best results are achieved when curiosity rewards are combined with epsilon-greedy exploration.

查看译文

关键词

curiosity-driven, reinforcement learning, dialogue management, intrinsic rewards, exploration

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要