On the Use of Sparce Time Relative Auditory Codes for Music

ISMIR 2013(2008)

引用 36|浏览94
暂无评分
摘要
Many if not most audio features used in MIR research are inspired by work done in speech recognition and are varia- tions on the spectrogram. Recently, much attention has been given to new representations of audio that are sparse and time-relative. These representations are efficient and able to avoid the time-frequency trade-off of a spectrogram. Yet lit- tle work with music streams has been conducted and these features remain mostly unused in the MIR community. In this paper we further explore the use of these features for musical signals. In particular, we investigate their use on realistic music examples (i.e. released commercial music) and their use as input features for supervised learning. Fur- thermore, we identify three specific issues related to these features which will need to be further addressed in order to obtain the full benefit for MIR applications.
更多
查看译文
关键词
time frequency,speech recognition,supervised learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要