Deep Segment-Attentive Network for Altered-Engine Recognition.

Chong-Xin Gan,Man-Wai Mak,Ivan Wang Hei Ho, Steven Wing-Chi Lee

2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)(2023)

引用 0|浏览5
暂无评分
摘要
Altered engine recognition is of significant research and application prospects since it is a sub-task of environmental sound classification and helps detect altered vehicles for law enforcement. In this work, we propose a simple but effective framework based on a sound embedding network with an attention mechanism to identify noise emitted from vehicles with altered engines. Real-world vehicle sound data were collected at a number of streets with different amounts of traffic in Hong Kong. In particular, we developed a proprietary dataset consisting of the environmental and engine sounds through manual segmentation and annotation, and we found that the attention mechanism can help emphasize the informative segments in the frame-level embeddings outputted by the embedding network. We also demonstrate the effectiveness of the proposed attention mechanism on the ESC-50 and UrbanSound8K datasets. The experimental results show that our method achieves remarkable performance on UrbanSound8K and outperforms the baseline on ESC-50 by more than 10%.
更多
查看译文
关键词
Attention Mechanism,Sound Detection,Network Embedding,Neural Network,Training Set,Convolutional Neural Network,Deep Neural Network,Convolutional Layers,Audio Recordings,Air Conditioning,Recurrent Neural Network,Attention Network,Mean Vector,Environmental Noise,Max-pooling Layer,Number Of Filters,Sound Effects,Attention Scores,SVM Classifier,Audio Clips,Statistical Pooling,Dog Barking,Visual Channels,Audio Files,Feature Maps,Representation Of Sounds,Large Amount Of Data,Self-attention Network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要