A LDA-based method for automatic tagging of Youtube videos

WIAMIS(2013)

引用 2|浏览27
暂无评分
摘要
This article presents a method for automatic tagging of Youtube videos. The proposed method combines an automatic speech recognition (ASR) system, that extracts the spoken contents, and a keyword extraction component that aims at finding a small set of tags representing a video. In order to improve the robustness of the tagging system to the recognition errors, a video transcription is represented in a topic space obtained by a Latent Dirichlet Allocation (LDA), in which each dimension is automatically characterized by a list of weighted terms. Tags are extracted by combining the weighted word list of the best LDA classes. We evaluate this method by employing the user-provided tags of Youtube videos as reference and we investigate the impact of the topic model granularity. The obtained results demonstrate the interest of such model to improve the robustness of the tagging system.
更多
查看译文
关键词
weighted word list,lda based method,video signal processing,multimedia systems,speech recognition,weighted term,spoken content extraction,video transcription,keyword extraction,structuring multimedia collection,youtube video,recognition error,tagging system,automatic speech recognition system,audio categorization,social networking (online),video retrieval,latent dirichlet allocation,automatic tagging,robustness,speech,semantics,acoustics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要