Spherical Convolutional Recurrent Neural Network for Real-Time Sound Source Tracking.

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)(2022)

引用 3|浏览36
暂无评分
摘要
Neural networks have been widely applied in direction-of-arrival (DOA) estimation and source tracking systems. In this paper, we introduce a spherical convolutional recurrent neural network that utilizes Deepsphere, a graph-based spherical convolutional neural network, employing the steered response power with phase transform (SRP-PHAT) power maps as input features for real-time robust sound source DOA estimation and tracking applications. The proposed method achieves a performance similar to that of state-of-the-art 3D convolutional neural networks (3D-CNNs) method and reduces the processing time by 88.6%, the parameter count by 85.5%, and the training memory usage by 54.0% respectively. The shallow structure of proposed network demonstrates effectiveness and efficiency.
更多
查看译文
关键词
Microphone arrays,direction-of-arrival,SRP-PHAT,spherical CNNs,recurrent neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要