Image pseudo tag generation with Deep Boltzmann machine anc topic-concept similarity map.

Satoru Ishikawa,Jorma Laaksonen,Juha Karhunen

IJCNN（2017）

引用 22|浏览19

暂无评分

摘要

General purpose search engines are used for searching not only plain text but also multimedia information. In multimodal search, it is common to use multiple queries to find the demanded information in the different media modalities. In most cases, however, it is hard to prepare such multimodal search queries. In addition, the semantic connection between the individual modalities is often weak or totally lacking in such multimodal search. Hence, single modality searching makes it hard to find the searched for information in the multimodal domain. In this paper we improve the Deep Boltzmann Machine applied to multimodal search by using GoogLeNet deep convolutional neural network and semantic concept features. We also propose a supervised method to produce a similarity map between hidden topics in text documents and the visual concepts in corresponding images, and an unsupervised method which uses the hidden topics in the documents as pseudo labels. The model can be used to infer and generate pseudo tags for untagged input query images in order to complement an image-only query to a multimodal one. The classification results with pseudo tag inputs show in our experiments improvement compared to the original tag inputs.

查看译文

关键词

image pseudo tag generation,deep Boltzmann machine,topic-concept similarity map,general purpose search engines,multimedia information,multimodal search queries,media modalities,GoogLeNet deep convolutional neural network,semantic concept features,supervised method,text documents,unsupervised method,image-only query,untagged input query images

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要