TorchSparse plus plus : Efficient Point Cloud Engine

Haotian Tang,Shang Yang,Zhijian Liu,Ke Hong,Zhongming Yu,Xiuyu Li,Guohao Dai,Yu Wang,Song Han

CVPR Workshops（2023）

引用 3|浏览22

暂无评分

摘要

Point cloud computation has become an increasingly more important workload for autonomous driving and other applications. Unlike dense 2D computation, point cloud convolution has sparse and irregular computation patterns and thus requires dedicated inference system support with specialized high-performance kernels. While existing point cloud deep learning libraries have developed different dataflows for convolution on point clouds, they assume a single dataflow throughout the execution of the entire model. In this work, we systematically analyze and improve existing dataflows. Our resulting system, TorchSparse++, achieves 2.9x, 3.3x, 2.2x and 1.8x measured end-to-end speedup on an NVIDIA A100 GPU over the state-of-the-art MinkowskiEngine, SpConv 1.2, TorchSparse and SpConv v2 in inference respectively. Furthermore, TorchSparse++ is the only system to date that supports all necessary primitives for 3D segmentation, detection, and reconstruction workloads in autonomous driving. Code is publicly released at https://github.com/mit-han- lab/torchsparse.

查看译文

关键词

autonomous driving,dataflows,deep learning libraries,dense 2D computation,efficient point cloud engine,high-performance kernels,inference system support,irregular computation patterns,measured end-to-end speedup,point cloud computation,point cloud convolution,point clouds,reconstruction workloads,single dataflow,sparse computation patterns,SpConv,TorchSparse

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要