Representation learning strategies to model pathological speech: Effect of multiple spectral resolutions

COMPUTER SPEECH AND LANGUAGE(2024)

引用 0|浏览8
暂无评分
摘要
This paper considers a representation learning strategy to model speech signals from patients with Parkinson's disease, with the goal of predicting the presence of the disease, and evaluating the level of degradation of a patient's speech. In particular, we propose a novel fusion strategy that combines wideband and narrowband spectral resolutions using a representation learning strategy based on autoencoders, called the multi-spectral autoencoder. The proposed model is able to classify the speech from Parkinson's disease patients with accuracy up to 97%. The proposed model is also able to assess the dysarthria severity of Parkinson's disease patients with a Spearman correlation up to 0.79. These results outperform those observed in literature where the same problem was addressed with the same corpus.
更多
查看译文
关键词
Parkinson's disease,Representation learning,Dysarthria
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要