Dressing in the Wild by Watching Dance Videos

Xin Dong,Fuwei Zhao,Zhenyu Xie,Xijin Zhang,Daniel K. Du,Min Zheng,Xiang Long,Xiaodan Liang,Jianchao Yang

IEEE Conference on Computer Vision and Pattern Recognition（2022）

引用 18|浏览266

暂无评分

摘要

While significant progress has been made in garment transfer, one of the most applicable directions of human-centric image generation, existing works overlook the in-the-wild imagery, presenting severe garment-person mis-alignment as well as noticeable degradation in fine texture details. This paper, therefore, attends to virtual try-on in real-world scenes and brings essential improvements in authenticity and naturalness especially for loose garment (e.g., skirts, formal dresses), challenging poses (e.g., cross arms, bent legs), and cluttered backgrounds. Specifically, we find that the pixel flow excels at handling loose gar-ments whereas the vertex flow is preferred for hard poses, and by combining their advantages we propose a novel generative network called wFlow that can effectively push up garment transfer to in-the-wild context. Moreover, former approaches require paired images for training. Instead, we cut down the laboriousness by working on a newly constructed large-scale video dataset named Dance50k with self-supervised cross-frame training and an online cycle op-timization. The proposed Dance50k can boost real-world virtual dressing by covering a wide variety of garments under dancing poses. Extensive experiments demonstrate the superiority of our w Flow in generating realistic garment transfer results for in-the-wild images without resorting to expensive paired datasets. 1 1 Xiaodan Liang is the corresponding author. The project page of wFlow is https://awesome-wflow.github.io.

查看译文

关键词

Image and video synthesis and generation, Vision applications and systems

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要