ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning.

Hangyu Mao,Zhibo Gong,Yan Ni,Xiangyu Liu, Quanbin Wang, Weichen Ke, Chao Ma, Yiping Song,Zhen Xiao

arXiv: Artificial Intelligence(2017)

引用 26|浏览60
暂无评分
摘要
Communication is a critical factor for the big multi-agent world to stay organized and productive. Typically, most previous multi-agent learning-to-communicate studies try to predefine the communication protocols or use technologies such as tabular reinforcement learning and evolutionary algorithm, which can not generalize to changing environment or large collection of agents. In this paper, we propose an Actor-Coordinator-Critic Net (ACCNet) framework for solving learning-to-communicate problem. The ACCNet naturally combines the powerful actor-critic reinforcement learning technology with deep learning technology. It can efficiently learn the communication protocols even from scratch under partially observable environment. We demonstrate that the ACCNet can achieve better results than several baselines under both continuous and discrete action space environments. We also analyse the learned protocols and discuss some design considerations.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要