Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play

arxiv(2021)

引用 0|浏览26
暂无评分
摘要
Task-oriented dialog systems are often trained on human/human dialogs, such as collected from Wizard-of-Oz interfaces. However, human/human corpora are frequently too small for supervised training to be effective. This paper investigates two approaches to training agent-bots and user-bots through self-play, in which they autonomously explore an API environment, discovering communication strategies that enable them to solve the task. We give empirical results for both reinforcement learning and game-theoretic equilibrium finding.
更多
查看译文
关键词
dialog agents,building collaborative,task-oriented,self-play
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要