Hybrid approach for speaker recognition based on formant and pitch extraction

2023 International Conference on Cyberworlds (CW)(2023)

引用 0|浏览1
暂无评分
摘要
Human voice is an ideal data source for identifying people in many applications. Because of the increasing need for security in different public places, voice biometrics may be a good solution, as we can easily take voice records. This paper provides a brief overview of the approaches utilized in recognizing speakers, and then presents a novel approach for recognizing speakers in degraded smart-home conditions. The suggested approach includes a pre-processing phase, a feature extraction phase, and a classification phase, where the feature extraction phase consists of formant extraction to get the spectrum energy maxima of speech audio, dynamic time warping (DTW)to find an optimal alignment between two provided temporal sequences under definite restrictions, and refinement process to improve the results of the DTW system output. The experiments are carried out on a database containing 1,248 samples in order to validate the suggested approach. The latter has good results as regards the state of the art with 94.5% accuracy.
更多
查看译文
关键词
Formant,pitch,DTW,speaker recognition,degraded conditions,smart home
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要