Reinforcement Learning in Swarms that Learn

James F. Peters,Christopher Henry,Sheela Ramanna

IAT（2005）

引用 18|浏览14

暂无评分

摘要

This paper introduces an approach to reinforcement learning by cooperating agents using a variation of the actor critic method. This is made possible by considering behavior patterns of swarms in the context of approximation spaces. Rough set theory introduced by Zdzis - law Pawlak in 1982 provides a ground for deriving pattern-based rewards within approximation spaces. The framework provided by an approximation space makes it possible to derive pattern-based reference rewards used to estimate action preferences. Approximation spaces are used to derive action-based reference rewards at the swarm intelligence level. Two different forms of the actor critic reinforcement learning method are considered as a part of a study of learning in realtime by a swarm. The contribution of this article is the presentation of a new actor critic method defined in the context of approximation spaces. An ecosystem designed to facilitate study of reinforcement learning by swarms is briefly described. In addition, the results of ecosystem experiments for two forms of the actor critic method are given.

查看译文

关键词

swarm intelligence level,new actor critic method,pattern-based reference,ecosystem experiment,approximation space,reinforcement learning,actor critic reinforcement,actor critic method,action preference,pattern-based reward,action-based reference reward,learning artificial intelligence,swarm intelligence,rough set theory,real time,multi agent systems

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要