MNITJ-SEHSD: A Hindi Emotional Speech Database

2023 International Conference on Communication, Circuits, and Systems (IC3S)(2023)

引用 0|浏览9
暂无评分
摘要
This paper introduces an emotional speech database in Hindi to study the emotions embedded in speech signals for speech emotion recognition. The proposed database was created with the help of students from Malaviya National Institute of Technology, Jaipur, India. It is compiled employing five distinct emotions using some emotionless texts. Anger, Fear, Happy, Neutral, and Sad are the emotions presented in the database, considering these are the most common emotions in daily life. The speech corpus is developed at MNIT, Jaipur, named Malaviya National Institute of Technology Jaipur Simulated Emotion Hindi Speech Database (MNITJ-SEHSD). The built database’s emotion classification used various well-known audio features. Mel frequency cepstral coefficients (MFCCs), chromagram, and Mel-spectrogram extract spectral information. Prosody information is represented by pitch, the standard deviation of pitch, energy, duration and zero-crossing rate. The performance of emotion recognition using the aforementioned audio representations is around 57% and 90% for prosodic and spectral features, respectively. The database is also validated on our previously developed CNN model. The development, collection, processing, and evaluation of the proposed speech database are all described in this work. Subjective listening methods evaluate the overall quality of the emotions stored in the database.
更多
查看译文
关键词
MNITJSEHSD,Speech emotion recognition,Spectral features,Emotional speech database,MFCC,CNN
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要