Long-Term Occupancy Grid Prediction Using Recurrent Neural Networks

2019 International Conference on Robotics and Automation (ICRA)(2019)

引用 68|浏览17
暂无评分
摘要
We tackle the long-term prediction of scene evolution in a complex downtown scenario for automated driving based on Lidar grid fusion and recurrent neural networks (RNNs). A bird's eye view of the scene, including occupancy and velocity, is fed as a sequence to a RNN which is trained to predict future occupancy. The nature of prediction allows generation of multiple hours of training data without the need of manual labeling. Thus, the training strategy and loss function is designed for long sequences of real-world data (unbalanced, continuously changing situations, false labels, etc.). The deep CNN architecture comprises convolutional long short-term memories (ConvLSTMs) to separate static from dynamic regions and to predict dynamic objects in future frames. Novel recurrent skip connections show the ability to predict small occluded objects, i.e. pedestrians, and occluded static regions. Spatio-temporal correlations between grid cells are exploited to predict multimodal future paths and interactions between objects. Experiments also quantify improvements to our previous network, a Monte Carlo approach, and literature.
更多
查看译文
关键词
long-term occupancy grid prediction,recurrent neural networks,scene evolution,automated driving,Lidar grid fusion,birds eye view,RNNs,CNN architecture,convolutional long short-term memories,ConvLSTMs,Monte Carlo approach
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要