WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects under Occlusion
arxiv(2024)
摘要
Current methods for 2D and 3D object understanding struggle with severe
occlusions in busy urban environments, partly due to the lack of large-scale
labeled ground-truth annotations for learning occlusion. In this work, we
introduce a novel framework for automatically generating a large, realistic
dataset of dynamic objects under occlusions using freely available time-lapse
imagery. By leveraging off-the-shelf 2D (bounding box, segmentation, keypoint)
and 3D (pose, shape) predictions as pseudo-groundtruth, unoccluded 3D objects
are identified automatically and composited into the background in a clip-art
style, ensuring realistic appearances and physically accurate occlusion
configurations. The resulting clip-art image with pseudo-groundtruth enables
efficient training of object reconstruction methods that are robust to
occlusions. Our method demonstrates significant improvements in both 2D and 3D
reconstruction, particularly in scenarios with heavily occluded objects like
vehicles and people in urban scenes.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要