Multimodal visual dictionary learning via heterogeneous latent semantic sparse coding

Proceedings of SPIE(2014)

引用 0|浏览21
暂无评分
摘要
Visual dictionary learning as a crucial task of image representation has gained increasing attention. Specifically, sparse coding is widely used due to its intrinsic advantage. In this paper, we propose a novel heterogeneous latent semantic sparse coding model. The central idea is to bridge heterogeneous modalities by capturing their common sparse latent semantic structure so that the learned visual dictionary is able to describe both the visual and textual properties of training data. Experiments on both image categorization and retrieval tasks demonstrate that our model shows superior performance over several recent methods such as K-means and Sparse Coding.
更多
查看译文
关键词
Information retrieval,Image representation,Multimodal visual dictionary learning,bag-of-visual words
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要