Team formation through an assessor: choosing MARL agents in pursuit–evasion games

Yue Zhao, Lushan Ju,Josè Hernández-Orallo

Complex & Intelligent Systems(2024)

引用 0|浏览3
暂无评分
摘要
Team formation in multi-agent systems usually assumes the capabilities of each team member are known, and the best formation can be derived from that information. As AI agents become more sophisticated, this characterisation is becoming more elusive and less predictive about the performance of a team in cooperative or competitive situations. In this paper, we introduce a general and flexible way of anticipating the outcome of a game for any lineups (the agents, sociality regimes and any other hyperparameters for the team). To this purpose, we simply train an assessor using an appropriate team representation and standard machine learning techniques. We illustrate how we can interrogate the assessor to find the best formations in a pursuit–evasion game for several scenarios: offline team formation, where teams have to be decided before the game and not changed afterwards, and online team formation, where teams can see the lineups of the other teams and can be changed at any time.
更多
查看译文
关键词
Team formation,Multi-agent reinforcement learning,Pursuit–evasion Games,Multi-agent systems
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要