LiDAR-based Online 3D Video Object Detection with Graph-based Message Passing and Spatiotemporal Transformer Attention

CVPR(2020)

引用 152|浏览265
暂无评分
摘要
Existing LiDAR-based 3D object detectors usually focus on the single-frame detection, while ignoring the spatiotemporal information in consecutive point cloud frames. In this paper, we propose an end-to-end online 3D video object detector that operates on point cloud sequences. The proposed model comprises a spatial feature encoding component and a spatiotemporal feature aggregation component. In the former component, a novel Pillar Message Passing Network (PMPNet) is proposed to encode each discrete point cloud frame. It adaptively collects information for a pillar node from its neighbors by iterative message passing, which effectively enlarges the receptive field of the pillar feature. In the latter component, we propose an Attentive Spatiotemporal Transformer GRU (AST-GRU) to aggregate the spatiotemporal information, which enhances the conventional ConvGRU with an attentive memory gating mechanism. AST-GRU contains a Spatial Transformer Attention (STA) module and a Temporal Transformer Attention (TTA) module, which can emphasize the foreground objects and align the dynamic objects, respectively. Experimental results demonstrate that the proposed 3D video object detector achieves state-of-the-art performance on the large-scale nuScenes benchmark.
更多
查看译文
关键词
TTA,temporal transformer attention module,STA,attentive spatiotemporal transformer GRU,receptive field,PMPNet,pillar message passing network,end-to-end online 3D video object detector,spatial transformer attention module,graph-based message passing,LiDAR-based online 3D video object detection,dynamic objects,foreground objects,attentive memory gating mechanism,AST-GRU,pillar feature,iterative message passing,pillar node,discrete point cloud frame,spatiotemporal feature aggregation component,spatial feature encoding component,point cloud sequences,consecutive point cloud frames,spatiotemporal information,single-frame detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要