Identifying Low-Resource Languages in Speech Recordings through Deep Learning

2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)(2022)

引用 0|浏览0
暂无评分
摘要
The aim of this paper is to build a system that identifies a low resource language, like the Albanian language, in speech recordings. Our proposed system is based on the conversion of audio signals into spectrograms. We have built 2 models for the identification of spoken language based on spectrograms images using Artificial Neural Networks (ANN) and Convolutional Neural Networks (CNN). The dataset with spoken audio signals in the Albanian language, we have built manually. The results are taken based on two languages, but the system works if other languages are added. Both models have shown good capabilities to learn Albanian language patterns from spectrograms and the achieved accuracies are 85% (ANN) and 94% (CNN) respectively. We have studied different cases how spectrograms' color and size impact the performance of our models.
更多
查看译文
关键词
language identification in speeches,low-resource language,Albanian language,deep learning,Convolutional Neural Network,Artificial Neural Network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要