Sound Event Localization and Detection Based on Iterative Separation in Embedding Space

Zeyu Yuan, Donghang Wu,Xihong Wu,Tianshu Qu

2023 6th International Conference on Information Communication and Signal Processing (ICICSP)(2023)

引用 0|浏览0
暂无评分
摘要
Current Sound Event Localization and Detection(SELD) methods mainly adopt the output format from SELDnet that the Direction Of Arrival(DOA) prediction is for each category rather than event, thus these methods cannot handle the simultaneous occurrence of the same type of sound event in different directions. Although track-wise based methods could detect the homogeneous overlap, they are still limited to the need to know the maximum number of overlapping sound sources. In order to solve these problems, we propose a SELD method based on iterative separation in embedding space: Sep-SELD. Our localization and detection are performed on each single event, instead of locating and detecting all events at the same time. This is done by introducing separation in the embedding space. Meanwhile, to deal with the inconsistent and potential unknown number of active events in different frames, the separation is performed in an iterative manner. We conduct experiments on the DCASE2020 TASK3 dataset, and the results show that the proposed method has comparable performance to track-wise methods and flexibility to handle overlapping events without retraining from scratch.
更多
查看译文
关键词
Sound event localization and detection,iterative separation,embedding space
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要