ManiGAN: Text-Guided Image Manipulation

Li Bowen,Qi Xiaojuan,Lukasiewicz Thomas,Torr Philip H. S.

CVPR（2020）

引用 290|浏览454

暂无评分

摘要

The goal of our paper is to semantically edit parts of an image to match a given text that describes desired attributes (e.g., texture, colour, and background), while preserving other contents that are irrelevant to the text. To achieve this, we propose a novel generative adversarial network (ManiGAN), which contains two key components: text-image affine combination module (ACM) and detail correction module (DCM). The ACM selects image regions relevant to the given text and then correlates the regions with corresponding semantic words for effective manipulation. Meanwhile, it encodes original image features to help reconstruct text-irrelevant contents. The DCM rectifies mismatched attributes and completes missing contents of the synthetic image. Finally, we suggest a new metric for evaluating image manipulation results, in terms of both the generation of new attributes and the reconstruction of text-irrelevant contents. Extensive experiments on the CUB and COCO datasets demonstrate the superior performance of the proposed method.

查看译文

关键词

image region selection,affine combination module,image matching,text guided image manipulation,generative adversarial network,text irrelevant contents,image features,semantic words,detail correction module,ManiGAN

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要