PoT-GAN: Pose Transform GAN for Person Image Synthesis,IEEE Transactions on Image Processing

当前位置： X-MOL 学术 › IEEE Trans. Image Process. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

PoT-GAN: Pose Transform GAN for Person Image Synthesis
IEEE Transactions on Image Processing ( IF 10.6 ) Pub Date : 2021-08-27 , DOI: 10.1109/tip.2021.3104183
Tianjiao Li , Wei Zhang , Ran Song , Zhiheng Li , Jun Liu , Xiaolei Li , Shijian Lu

Pose-based person image synthesis aims to generate a new image containing a person with a target pose conditioned on a source image containing a person with a specified pose. It is challenging as the target pose is arbitrary and often significantly differs from the specified source pose, which leads to large appearance discrepancy between the source and the target images. This paper presents the Pose Transform Generative Adversarial Network (PoT-GAN) for person image synthesis where the generator explicitly learns the transform between the two poses by manipulating the corresponding multi-scale feature maps. By incorporating the learned pose transform information into the multi-scale feature maps of the source image in a GAN architecture, our method reliably transfers the appearance of the person in the source image to the target pose with no need for any hard-coded spatial information depicting the change of pose. According to both qualitative and quantitative results, the proposed PoT-GAN demonstrates a state-of-the-art performance on three publicly available datasets for person image synthesis.

中文翻译：

PoT-GAN：用于人物图像合成的姿势变换 GAN

基于姿势的人物图像合成旨在以包含具有指定姿势的人物的源图像为条件，生成包含具有目标姿势的人物的新图像。这是具有挑战性的，因为目标姿态是任意的，并且通常与指定的源姿态显着不同，这导致源图像和目标图像之间的外观差异很大。本文介绍了用于人物图像合成的姿势变换生成对抗网络 (PoT-GAN)，其中生成器通过操纵相应的多尺度特征图显式地学习两个姿势之间的变换。通过将学习到的姿态变换信息合并到 GAN 架构中源图像的多尺度特征图中，我们的方法可靠地将源图像中人物的外观转移到目标姿势，而无需任何描述姿势变化的硬编码空间信息。根据定性和定量结果，所提出的 PoT-GAN 在用于人物图像合成的三个公开可用数据集上展示了最先进的性能。

更新日期：2021-09-10

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>