A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
CVPR, pp. 10143-10152, 2020.
We propose a local-to-global scene segmentation framework to cover a hierarchical temporal and semantic information
Scene, as the crucial unit of storytelling in movies, contains complex activities of actors and their interactions in a physical environment. Identifying the composition of scenes serves as a critical step towards semantic understanding of movies. This is very challenging -- compared to the videos studied in conventional vision problems...更多
下载 PDF 全文