Multi-view Subspace Learning with Diversity Enforced Skeleton Embedding

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)(2017)

引用 2|浏览85
暂无评分
摘要
We consider the task of multi-view subspace learning which integrates multi-view information to learn a unified representation for multimedia data. In real-world scenarios, we encounter views with high diversities of semantic levels. Neglecting the problem of semantic inconsistency, existing graph-based methods directly convert heterogeneous information into local affinity matrices to conduct a fusion process, which inevitably destroys the valuable high-semantic-level structure. To address semantic inconsistency, we propose Multi-view Subspace Skeleton Embedding (MSSE), in which the high-level semantic structure of the learned subspace is explicitly taken as the skeleton of the learned subspace. Specifically, cooperating with a set of anchor points, the high-level semantic structure is adopted as semantic constraints to guide the multi-graph learning process based on RESCAL tensor factorization. To guarantee sufficient geometric coverage of the skeleton in the learned subspace, we enforce the diversity of anchor points by a Determinantal Point Process (DPP) regularizer. Compared with traditional methods, the learned subspace is endowed with higher semantic consistency and more robust to noisy views. Experiments on real-world image datasets demonstrate the promising performance comparing to state-of-the-art graph-based methods.
更多
查看译文
关键词
multi-view subspace learning,diversity enforced skeleton embedding,multi-view information,multimedia data representation,graph-based methods,local affinity matrices,fusion process,semantic inconsistency,MSSE,multi-graph learning process,RESCAL tensor factorization,determinantal point process regularizer,DPP regularizer
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要