Mastering Complex Coordination Through Attention-Based Dynamic Graph

NEURAL INFORMATION PROCESSING, ICONIP 2023, PT I(2024)

引用 0|浏览1
暂无评分
摘要
The coordination between agents in multi-agent systems has become a popular topic in many fields. To catch the inner relationship between agents, the graph structure is combined with existing methods and improves the results. But in large-scale tasks with numerous agents, an overly complex graph would lead to a boost in computational cost and a decline in performance. Here we present DAGMIX, a novel graph-based value factorization method. Instead of a complete graph, DAGMIX generates a dynamic graph at each time step during training, on which it realizes a more interpretable and effective combining process through the attention mechanism. Experiments show that DAGMIX significantly outperforms previous SOTA methods in large-scale scenarios, as well as achieving promising results on other tasks.
更多
查看译文
关键词
Multi-Agent Reinforcement Learning,Coordination,Value Factorization,Dynamic Graph,Attention
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要