Brain-Machine Interface control of a robot arm using actor-critic rainforcement learning.

EMBC(2012)

引用 25|浏览28
暂无评分
摘要
Here we demonstrate how a marmoset monkey can use a reinforcement learning (RL) Brain-Machine Interface (BMI) to effectively control the movements of a robot arm for a reaching task. In this work, an actor-critic RL algorithm used neural ensemble activity in the monkey's motor cortext to control the robot movements during a two-target decision task. This novel approach to decoding offers unique advantages for BMI control applications. Compared to supervised learning decoding methods, the actor-critic RL algorithm does not require an explicit set of training data to create a static control model, but rather it incrementally adapts the model parameters according to its current performance, in this case requiring only a very basic feedback signal. We show how this algorithm achieved high performance when mapping the monkey's neural states (94%) to robot actions, and only needed to experience a few trials before obtaining accurate real-time control of the robot arm. Since RL methods responsively adapt and adjust their parameters, they can provide a method to create BMIs that are robust against perturbations caused by changes in either the neural input space or the output actions they generate under different task requirements or goals.
更多
查看译文
关键词
motion control,robots,real-time robot arm control,neurophysiology,supervised learning decoding methods,brain-machine interface control,learning (artificial intelligence),brain-computer interfaces,two-target decision task,training,monkey neural states,neural ensemble activity,feedback signal,feedback,static control model,actor-critic rl algorithm,bmi control applications,robot actions,marmoset monkey,monkey motor cortex,actor-critic reinforcement learning,robot arm movements control,decoding,neural input space,real-time systems,training data,brain computer interfaces,real time systems,learning artificial intelligence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要