An Analysis of Audio Classification Techniques using Deep Learning Architectures

Mohammed Safwat Imran, Afia Fahmida Rahman,Sifat Tanvir, Hamim Hassan Kadir,Junaid Iqbal,Moin Mostakim

2021 6th International Conference on Inventive Computation Technologies (ICICT)(2021)

引用 6|浏览1
暂无评分
摘要
Failure to classify audio data with high efficiency causes major setbacks in audio processing, voice recognition, and noise cancellation. To find the best possible neural network models for audio classification, this article shows the steps in the experiments performed in the newly designed CF Model and CF Clean Model in both CNN and RNN and compares the results with some existing models such as DCNN and PiczakCNN. To get a clear view of the consistency of the results, three different datasets have been experimented on, which are UrbanSound8k, FSDKaggle2018 and ESC-50. It also performs best in terms of the training and testing dataset based on accuracy and loss percentage. Finally, this article shows influences about the envelope function, normalization, segmentation, regularization techniques, and dropout layers in the overall progress.
更多
查看译文
关键词
Audio Classification,Neural Network,Deep Learning,Convolutional Neural Network,Recurrent Neural Network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要