ARFA: Adaptive Reception Field Aggregation for 3-D Detection From LiDAR Point Cloud

IEEE Sensors Journal(2023)

引用 4|浏览3
暂无评分
摘要
Submanifold convolution is widely used in 3-D detection. However, it brings different receptive fields to voxels due to the nonuniform distribution in Light Detection and Ranging (LiDAR) point clouds, resulting in degradation of the feature extraction ability for distant voxels and the performance of detectors. We propose a solution, adaptive receptive field aggregation (ARFA) network, an end-to-end two-stage LiDAR 3-D object detection architecture. ARFA searches the top- K nearest neighbors (KNNs) to adaptively adjust the receptive field of sparse voxels, followed by a self-attention aggregation (SA) module with density feature embedding (DE) to aggregate the semantic information in the receptive field. In order to further strengthen the detection performance for small objects, we also propose an upsampling bird's eyes view (U-BEV) backbone and a Intersection over Union (IoU)-aware head to enhance the quality of the proposals and rectify the confidence of the predicted bounding boxes. ARFA outperforms the state-of-the-art methods on the Waymo Open dataset and achieves competitive results on the popular KITTI dataset.
更多
查看译文
关键词
Convolution, Feature extraction, Three-dimensional displays, Point cloud compression, Proposals, Detectors, Sensors, 3-D object detection, Light Detection and Ranging (LiDAR), point cloud data processing, receptive field
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要