LBVCNN: Local Binary Volume Convolutional Neural Network for Facial Expression Recognition From Image Sequences

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)(2019)

引用 42|浏览0
暂无评分
摘要
Recognizing facial expressions is one of the central problems in computer vision. Temporal image sequences have useful spatio-temporal features for recognizing expressions. In this paper, we propose a new 3D Convolution Neural Network (CNN) that can be trained end-to-end for facial expression recognition on temporal image sequences without using facial landmarks. More specifically, a novel 3D convolutional layer that we call Local Binary Volume (LBV) layer is proposed. The LBV layer, when used with our newly proposed LBVCNN network, achieve comparable results compared to state-of-the-art landmark-based or without landmark-based models on image sequences from CK+, Oulu-CASIA, and UNBC McMaster shoulder pain datasets. Furthermore, our LBV layer reduces the number of trainable parameters by a significant amount when compared to a conventional 3D convolutional layer. As a matter of fact, when compared to a 3 × 3 × 3 conventional 3D convolutional layer, the LBV layer uses 27 times less trainable parameters.
更多
查看译文
关键词
LBV layer,LBVCNN,local binary volume convolutional neural network,facial expression recognition,facial expressions,temporal image sequences,spatio-temporal features,3D convolution neural network,facial landmarks,landmark-based models,3D convolutional layer,local binary volume layer,expression recognition,trainable parameters
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要