Detection of Emotion Categories' Change in Speeches

ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3(2022)

引用 2|浏览1
暂无评分
摘要
In the past few years, a lot of research has been conducted to predict emotions from speech. The majority of the studies aim to recognize emotions from pre-segmented data with one global label (category). Despite the fact that emotional states are constantly changing and evolving across time, the emotion change has gotten less attention. Mainly, the exiting studies focus either on predicting arousal-valence values or on detecting the instant of the emotion change. To the best of the authors knowledge, this is the first paper that addresses the emotion category change (i.e., predicts the classes existing in a signal such as angry, happy, sad etc.). As a result of that, we propose a model based on the Connectionist Temporal Classification (CTC) loss, along with new evaluation metrics.
更多
查看译文
关键词
Connectionist Temporal Classification, Emotion Recognition, Neural Networks, Spectrograms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要