Multimodal fusion for indoor sound source localization

Pattern Recognition(2021)

引用 9|浏览20
暂无评分
摘要
•We propose a novel solution based on fusing visual and acoustic models to accurately identify the localization information of sound localization.•We develop a HMM-based method for separation of the acoustic transfer function (ATF) to describe clean speech sound.•We propose a new Fourier domain method for fast implementation of the HOG-type polar feature descriptor.•The proposed method has rotation-invariant capabilities and also preserves the discriminative power of extracted features.
更多
查看译文
关键词
Sound source localization,Acoustic transfer function,HMM,Polar HOG,SVM
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要