Browsing videos by automatically detected audio events

EUROCON - International Conference Computer as a Tool(2011)

引用 5|浏览13
暂无评分
摘要
This paper focuses on Audio Event Detection (AED), a research area which aims to substantially enhance the access to audio in multimedia content. With the ever-growing quantity of multimedia documents uploaded on the Web, automatic description of the audio content of videos can provide very useful information, to index, archive and search multimedia documents. Preliminary experiments with a sound effects corpus showed good results for training models. However, the performance on the real data test set, where there are overlapping audio events and continuous background noise is lower. This paper describes the AED framework and methodologies used to build 6 Audio Event detectors, based on statistical machine learning tools (Support Vector Machines). The detectors showed some promising improvements achieved by adding background noises to the training data, comprised of clean sound effects that are quite different from the real audio events in real life videos and movies. A graphical interface prototype is also presented, that allows browsing a movie by its content and provides an audio event description with time codes.
更多
查看译文
关键词
audio signal processing,cinematography,multimedia communication,statistical analysis,support vector machines,video retrieval,video signal processing,AED framework,World Wide Web,audio access,audio event description,audio event detection,clean sound effect,continuous background noise,graphical interface prototype,movie browsing,multimedia content,multimedia document archive,multimedia document index,multimedia document search,overlapping audio event,real audio event,real life movies,real life video,sound effect corpus,statistical machine learning tool,support vector machine,time code,video audio content,video browsing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要