A Value Factorization Method for MARL Based on Correlation between Individuals

MATHEMATICAL PROBLEMS IN ENGINEERING(2022)

引用 0|浏览6
暂无评分
摘要
Value factorization is a popular method for cooperative multi-agent deep reinforcement learning, which effectively solves explosion of state-action spatial dimension and partial observability problems. However, most existing algorithms only consider the impact of individuals rather than correlation between individuals, which leads to poor coordination between agents in complex environments. In order to resolve this problem, this paper proposes a multi-agent deep reinforcement learning value factorization method based on correlation between individuals, CI-VF, which promotes coordination between agents effectively. Firstly, the individual value function vectors are obtained according to the output of individual networks in each round. Secondly, a Spearman correlation coefficient matrix can be calculated by the vectors to measure the correlation degree of agents, and the joint correlation coefficient can be obtained to optimize joint value function. Next, we use optimized joint value function to train individual networks. Experimental results show that our method outperforms QMIX and other baselines in various scenarios under the StarCraft Multi-Agent Challenge environment.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要