Knowledge Discovery via Content Indexing of Multimedia and Text

msra(2005)

引用 23|浏览8
暂无评分
摘要
Indexing and retrieving audio or video content presents challenges specific to the nature of these media. Two primary difficulties are the inaccuracy of speech recognition and the timed nature of streaming media — that is, the property that words and other information in audio/video are tied to times, and are not readily accessed and scanned at arbitrary positions. StreamSage's approach to resolving these problems exploits statistical properties of language to determine whether an occurrence of a query term is likely to signal a truly relevant interval, and where the segment should begin and end. We describe here the steps required to process large bodies of information in order to make them useful for intelligence analysts or other searchers. These include a set of content- specific capabilities, such as categorization and classification, large coverage word- sense disambiguation, topic detection, and
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要