Acoustic Analysis And Digital Signal Processing For The Assessment Of Voice Quality

BIOMEDICAL SIGNAL PROCESSING AND CONTROL(2021)

引用 5|浏览13
暂无评分
摘要
Purpose: This paper addresses the application of digital signal processing (DSP) techniques to the robust measurement of acoustical features of the human voice. It then addresses the use of regression based techniques for the estimation of grade, roughness, breathiness, asthenia and strain, from these acoustical features. These five properties of voice are the basis of the widely used 'GRBAS' characterisation of voice disorders. Method: A well-known cross-correlation technique has been enhanced for more reliably measuring the fundamental frequency of vowels which is crucial for the derivation of acoustic features such as the harmonic to-noise-ratio, jitter and shimmer. Regression techniques including K-Nearest Neighbour Regression and Multiple Linear Regression are employed for derivation of GRBAS properties. Results: Validation of the enhanced cross-correlation technique against well established published or commercially available techniques has been carried out by analysing synthetic sustained vowels. It was found that the enhanced method is capable of producing more reliable and robust measurements, in the context of our experiments, than the well-established Praat technique and Multi-Dimensional-Voice-Program (MDVP) software, especially in cases where the signal to noise ratio is low. Estimation of GRBAS components using our methods has been found to be in good agreement with traditional GRBAS scoring by speech and language therapists (SLTs). Conclusion: Voice analysis using DSP to extract acoustic features has the potential for objective and computerised GRBAS voice assessment. Such assessment can usefully augment GRBAS assessment as traditionally carried out subjectively by SLTs.
更多
查看译文
关键词
Praat, MDVP, Speech, Acoustic, HNR, SNR, Shimmer, Jitter, Fundamental frequency (f(o))
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要