Metadata for mixed-media access

Multimedia Data Management(1994)

引用 46|浏览24
暂无评分
摘要
In this paper, we discuss mixed-media access , an information access paradigm for multimedia data in which the media type of a query may differ from that of the data. The types of media considered in this paper are speech, images of text, and full-length text. Some examples of metadata for mixed-media access are locations of keywords in speech and images, identification of speakers, locations of emphasized regions in speech, and locations of topic boundaries in text. Algorithms for automatically generating this metadata are described, including word spotting, speaker segmentation, emphatic speech detection, and subtopic boundary location. We illustrate queries composed of diverse media types in an example of access to recorded meetings, via speaker and keyword location.
更多
查看译文
关键词
information access paradigm,mixed-media access,subtopic boundary location,multimedia data,keyword location,speaker segmentation,media type,emphatic speech detection,diverse media type,full-length text,speech detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要