Speech-Based Stress Classification Based On Modulation Spectral Features And Convolutional Neural Networks
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)(2019)
摘要
Interest in stress recognition has notably increased over the past few years. In this work, we focus on recognizing stress from speech. We propose the use of modulation spectral features as input to a convolutional neural network (CNN) for classifying stress. As benchmark, the OpenSMILE features used in the IN 1LRSPEECH 2010 Paralinguistic Challenge is adopted and evaluated with a support vector machine (SVM) and a deep neural network (DNN) based backends. Experiments are performed with the well-known Speech Under Simulated and Actual Stress (SUSAS) database. Performances are investigated considering 2-class, 4-class and 9-class classification problems. Results show that the proposed approach outperforms the benchmark on a challenging 9-class classification task with accuracy as high as 70% representing gains of roughly 18% over the benchmark.
更多查看译文
关键词
Stress detection, modulation spectrum, convolutional neural network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络