3UR-LLM: an End-to-End Multimodal Large Language Model for 3D Scene Understanding
CoRR(2025)
Key words
3D scene understanding,multi-modal large language models,visual question answering
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined