On the use of audio events for improving video scene segmentation.

Image Analysis for Multimedia Interactive Services(2010)

引用 18|浏览10
暂无评分
摘要
This work deals with the problem of automatic temporal segmentation of a video into elementary semantic units known as scenes. Its novelty lies in the use of high-level audio information in the form of audio events for the improvement of scene segmentation performance. More specifically, the proposed technique is built upon a recently proposed audio-visual scene segmentation approach that involves the construction of multiple scene transition graphs (STGs) that separately exploit information coming from different modalities. In the extension of the latter approach presented in this work, audio event detection results are introduced to the definition of an audio-based scene transition graph, while a visual-based scene transition graph is also defined independently. The results of these two types of STGs are subsequently combined. The application of the proposed technique to broadcast videos demonstrates the usefulness of audio events for scene segmentation.
更多
查看译文
关键词
graph theory,image segmentation,object detection,video signal processing,audio event detection,audio-based scene transition graph,automatic temporal segmentation,high-level audio information,multiple scene transition graph,video scene segmentation,visual-based scene transition graph
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要