No More Ambiguity in 360$^\circ$ Room Layout via Bi-Layout Estimation
CVPR 2024(2024)
摘要
Inherent ambiguity in layout annotations poses significant challenges to
developing accurate 360 room layout estimation models. To address this
issue, we propose a novel Bi-Layout model capable of predicting two distinct
layout types. One stops at ambiguous regions, while the other extends to
encompass all visible areas. Our model employs two global context embeddings,
where each embedding is designed to capture specific contextual information for
each layout type. With our novel feature guidance module, the image feature
retrieves relevant context from these embeddings, generating layout-aware
features for precise bi-layout predictions. A unique property of our Bi-Layout
model is its ability to inherently detect ambiguous regions by comparing the
two predictions. To circumvent the need for manual correction of ambiguous
annotations during testing, we also introduce a new metric for disambiguating
ground truth layouts. Our method demonstrates superior performance on benchmark
datasets, notably outperforming leading approaches. Specifically, on the
MatterportLayout dataset, it improves 3DIoU from 81.70
full test set and notably from 54.80
ambiguity. Project page: https://liagm.github.io/Bi_Layout/
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要