Modeling Hierarchical Uncertainty for Multimodal Emotion Recognition in Conversation

IEEE TRANSACTIONS ON CYBERNETICS(2024)

引用 2|浏览163
暂无评分
摘要
Approximating the uncertainty of an emotional AI agent is crucial for improving the reliability of such agents and facilitating human-in-the-loop solutions, especially in critical scenarios. However, none of the existing systems for emotion recognition in conversation (ERC) has attempted to estimate the uncertainty of their predictions. In this article, we present HU-Dialogue, which models hierarchical uncertainty for the ERC task. We perturb contextual attention weight values with source-adaptive noises within each modality, as a regularization scheme to model context-level uncertainty and adapt the Bayesian deep learning method to the capsule-based prediction layer to model modality-level uncertainty. Furthermore, a weight-sharing triplet structure with conditional layer normalization is introduced to detect both invariance and equivariance among modalities for ERC. We provide a detailed empirical analysis for extensive experiments, which shows that our model outperforms previous state-of-the-art methods on three popular multimodal ERC datasets.
更多
查看译文
关键词
Uncertainty,Emotion recognition,Predictive models,Context modeling,Reliability,Bayes methods,Adaptation models,Bayesian deep learning,capsule network (CapsNet),conditional layer normalization (CLN),emotion recognition in conversation (ERC),uncertainty
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要