Multi-Skill Policy Transfer by Option-based Deep Reinforcement Learning for Autonomous Driving.

Bo Wei,Jianxin Zhao,Yinuo Zhao, Feng Tian

International Conference on Big Data Computing and Communications(2023)

引用 0|浏览5
暂无评分
摘要
Autonomous Driving presents a promising solution to the issue of road accidents, which are mostly caused by human errors. The use of artificial intelligence technologies in this field has resulted in significant advancements in tasks such as object detection, path planning, and obstacle avoidance, leading to safer and more efficient transportation. Reinforcement learning (RL) is a powerful machine learning algorithm that has demonstrated effectiveness in various autonomous driving applications. However, the vanilla single RL policy is inadequate when faced with more complex transportation scenarios involving heavy and dynamic traffic. In this paper, we propose a novel OPtion-based multi-skill policy Transfer method with deep RL for autonomous driving, called "Opt-RL", to learn a more complex target policy by integrating basic skills from multiple source policies. An adaptive option learning module is designed to efficiently use learned skills in higher-level target domains, determining when and where to distil policies from different sources. We conduct experiments on challenging tasks in the Mujoco Maze2D benchmark and a simulated highway environment. Experimental results demonstrate that Opt-RL can achieve knowledge transfer among different levels of policies and successfully train a complex high-level decision-making policy by reasonably integrating multiple basic skills; it also achieves a longer safe driving distance 16% higher than the baseline DQN.
更多
查看译文
关键词
reinforcement learning,policy distillation,highway traffic management
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要