Boundary-aware GAN for multiple overlapping objects in layout-to-image generation

Multimedia Systems(2024)

引用 0|浏览3
暂无评分
摘要
Existing layout-to-image generation methods based on generative adversarial networks (GANs) have made great progress. However, a common problem that has not been effectively solved is region missing; that is, as the number of target objects in an image increases, multiple overlapping objects cannot be accurately generated at the boundary. The generation of the overlapped object parts directly determines the quality difference between the generated image and the real image. To solve this problem, we propose the Boundary-Aware GAN model (BAGAN). We adopt a strategy of separately generating the foreground and background. In the foreground generation process, BAGAN first uses attention regularization to accurately locate the boundaries of overlapping objects and then generates a clear foreground image by sharing the parameters of two transfer networks. During background generation, BAGAN uses conditional normalization to improve the quality of the generated background image. To better judge the overlap generation quality, we design the BoundaryFID evaluation metric. We validate and test our model on the COCO-Stuff and Visual Genome datasets. The experimental results show that according to the general evaluation metrics and BoundaryFID, our model can achieve the best results compared with the state-of-the-art methods.
更多
查看译文
关键词
Layout-to-image generation,Generative adversarial network,Attention regularization,Transformation network,Conditional normalization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要