Parameter-Free Audio Motif Discovery in Large Data Archives

ICDM(2013)

引用 24|浏览18
暂无评分
摘要
The discovery of repeated structure, i.e. motifs/near-duplicates, is often the first step in exploratory data mining. As such, the last decade has seen extensive research efforts in motif discovery algorithms for text, DNA, time series, protein sequences, graphs, images, and video. Surprisingly, there has been less attention devoted to finding repeated patterns in audio sequences, in spite of their ubiquity in science and entertainment. While there is significant work for the special case of motifs in music, virtually all this work makes many assumptions about data (often to the point of being genre specific) and thus these algorithms do not generalize to audio sequences containing animal vocalizations, industrial processes, or a host of other domains that we may wish to explore. In this work we introduce a novel technique for finding audio motifs. Our method does not require any domain-specific tuning and is essentially parameter-free. We demonstrate our algorithm on very diverse domains, finding audio motifs in laboratory mice vocalizations, wild animal sounds, music, and human speech. Our experiments demonstrate that our ideas are effective in discovering objectively correct or subjectively plausible motifs. Moreover, we show our novel probabilistic early abandoning approach is efficient, being two to three orders of magnitude faster than brute-force search, and thus faster than real-time for most problems.
更多
查看译文
关键词
large data archives,repeated structure discovery,audio motif finding technique,exploratory data mining,audio motif,music,spectrogram,parameter-free audio motif discovery,human speech,laboratory mice vocalizations,wild animal sounds,industrial processes,animal vocalizations,probabilistic early abandoning approach,audio sequence repeated pattern finding,anytime algorithm,data mining,domain-specific tuning,audio signal processing,probability,brute-force search
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要