Refining Valence-Arousal Estimation with Dual-Stream Label Density Smoothing.

Hongxia Xie, I-Hsuan Li,Ling Lo,Hong-Han Shuai,Wen-Huang Cheng

IEEE International Conference on Consumer Electronics（2024）

引用 0|浏览1

暂无评分

摘要

Emotion recognition through facial expressions remains a long-standing research pursuit, yet the challenges persist, particularly in dynamic real-world scenarios. In-the-wild datasets are hampered by limited emotion annotations due to resource constraints, hindering multi-task methodology advancements. Recent years have witnessed a surge of approaches addressing the valence-arousal problem. However, data imbalance, especially in valence-arousal annotation, persists. This work proposes a novel two-stream valence-arousal estimation network, inspired by MIMAMO Net, leveraging spatial and temporal learning to enhance emotion recognition. Label Density Smoothing (LDS) is introduced to counter skewed distributions. Experimental results showcase the approach’s efficacy, achieving a Concordance Correlation Coefficient (CCC) of 0.591 for valence and 0.617 for arousal on the Aff-Wild2 validation set. This work contributes to the advancement of valence-arousal modeling in facial expression recognition.

查看译文

关键词

Labeling Density,Valence-arousal Estimation,Validation Set,Facial Expressions,Face Recognition,Emotion Recognition,Imbalanced Data,Facial Expression Recognition,Concordance Correlation Coefficient,Generative Adversarial Networks,Label Distribution,Action Units,Empirical Density,Valence Values,Facial Action Units,Arousal Values,Two-stream Network,YouTube

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要