Subjective quality ratings and physiological correlates of synthesized speech.

QoMEX(2013)

引用 9|浏览24
暂无评分
摘要
Evaluating the quality of text-to-speech systems (TTS) is usually achieved by subjective methods where participants have to rate the stimulus on multiple scales, such as naturalness, prosody, and overall quality. In the present study, we aim towards evaluating TTS system quality using not only conventional subjective methods, but also via a neurophysiological approach based on obtaining neural correlates of TTS quality perception using electroencephalography (EEG). Such an approach allows for better insight into the perception processes involved during the human quality judgement process, and may open doors to innovative subjective testing methods and/or objective measurement tools. In our experiments, we have shown an inverse relationship between TTS speech quality and the amplitude of an EEG evoked response called the 'P300,' suggesting an increase in cognitive load as TTS quality decreases, likely due to reduction in speech intelligibility.
更多
查看译文
关键词
electroencephalography,medical signal processing,speech intelligibility,speech synthesis,EEG,P300,TTS evaluation,cognitive load,electroencephalography,human quality judgement process,innovative subjective testing methods,neurophysiological approach,objective measurement tools,perception processes,physiological correlates,quality of text-to-speech systems,speech intelligibility,speech synthesis,subjective quality ratings,Audio,Electroencephalography,Mean Opinion Score,Quality of Experience,Text-to-Speech
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要