Unsupervised multi-domain multimodal image-to-image translation with explicit domain-constrained disentanglement.,Neural Networks

当前位置： X-MOL 学术 › Neural Netw. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Unsupervised multi-domain multimodal image-to-image translation with explicit domain-constrained disentanglement.
Neural Networks ( IF 7.8 ) Pub Date : 2020-07-25 , DOI: 10.1016/j.neunet.2020.07.023
Weihao Xia ₁ , Yujiu Yang ₂ , Jing-Hao Xue ₃

Affiliation

Image-to-image translation has drawn great attention during the past few years. It aims to translate an image in one domain to a target image in another domain. However, three big challenges remain in image-to-image translation: (1) the lack of large amounts of aligned training pairs for various tasks; (2) the ambiguity of multiple possible outputs from a single input image; and (3) the lack of simultaneous training for multi-domain translation with a single network. Therefore in this paper, we propose a unified framework for learning to generate diverse outputs using unpaired training data and allow for simultaneous multi-domain translation via a single model. Moreover, we also observed from experiments that the implicit disentanglement of content and style could lead to undesirable results. Thus we investigate how to extract domain-level signal as explicit supervision so as to achieve better image-to-image translation. Extensive experiments show that the proposed method outperforms or is comparable with the state-of-the-art methods for various applications.

中文翻译：

具有显式域约束解缠结的无监督多域多模态图像到图像转换。

图像到图像的翻译在过去几年中引起了极大的关注。它旨在将一个域中的图像转换为另一域中的目标图像。但是，图像到图像的翻译仍然面临三大挑战：（1）缺乏针对各种任务的大量对齐培训对；（2）来自单个输入图像的多个可能输出的歧义；（3）缺乏同时使用单个网络进行多域翻译的培训。因此，在本文中，我们提出了一个统一的框架，用于学习使用不成对的训练数据生成不同的输出，并允许通过单个模型同时进行多域翻译。此外，我们还从实验中观察到内容和样式的隐式分离可能导致不良结果。因此，我们研究了如何提取域级信号作为显式监督，以实现更好的图像到图像翻译。大量实验表明，所提出的方法优于或可与各种应用的最新方法相媲美。

更新日期：2020-08-03

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>