Grassmannian learning mutual subspace method for image set recognition

Neurocomputing(2023)

引用 3|浏览20
暂无评分
摘要
This paper addresses the problem of object recognition given a set of images as input (e.g., multiple cam-era sources and video frames). Convolutional neural network (CNN)-based frameworks do not exploit these sets effectively, processing a pattern as observed, not capturing the underlying feature distribution as it does not consider the variance of images in the set. To address this issue, we propose the Grassmannian learning mutual subspace method (G-LMSM), a NN layer embedded on top of CNNs that can process image sets more effectively and can be trained in an end-to-end manner. The image set is first represented by a low-dimensional input subspace and then this input subspace is matched with dic-tionary subspaces by a similarity of their canonical angles, an interpretable and easy to compute metric. The key idea of G-LMSM is that the dictionary subspaces are learned as points on the Grassmann man-ifold, optimized with Riemannian stochastic gradient descent. This learning is stable, efficient and theo-retically well-grounded. We demonstrate the effectiveness of our proposed method on hand shape recognition, face identification, and facial emotion recognition.(c) 2022 Elsevier B.V. All rights reserved.
更多
查看译文
关键词
Grassmannian learning mutual subspace,method,Learning subspace methods,Subspace learning,Image recognition,Deep neural networks,Manifold optimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要