当前位置: X-MOL 学术Int. J. Intell. Syst. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Fine-grained semantic ethnic costume high-resolution image colorization with conditional GAN
International Journal of Intelligent Systems ( IF 5.0 ) Pub Date : 2021-11-12 , DOI: 10.1002/int.22726
Di Wu 1 , Jianhou Gan 1, 2 , Juxiang Zhou 1, 2 , Jun Wang 1, 2 , Wei Gao 1, 3
Affiliation  

Grayscale image colorization, especially for ethnic costume images, is highly challenging due to its rich and complex color features. The existing image colorization methods usually take the costume image as a whole in practical applications that lead to the ignorance of the semantic information of different parts of the costume. It is known that each part's color distribution of the ethnic costume is different. So, the color mapping of other parts is also diverse, which is determined by distinctive ethnic characteristics. This study introduces fine-grained level semantic information and proposes a high-resolution image colorization model for ethnic costumes targeting enhancement, inspired by semantic-level colorization. The semantic information of different regions of ethnic costumes has a significant impact on the performance of the coloring task. Using Pix2PixHD as the backbone network, we create a new network architecture that maintains color distribution correspondence and spatial consistency of costume images using fine-grained semantic information. In our network, we take the splice result of fine-grained semantic for ethnic costume and grayscale image as the conditions and then feed them into the generative adversarial networks. We also discuss and analyze the influences of the grayscale channel and fine-grained semantics on discriminator. Extensive experiments demonstrate that our method performs well compared with other state-of-the-art automatic colorization methods.

中文翻译:

带有条件GAN的细粒度语义民族服装高分辨率图像着色

灰度图像着色,尤其是民族服饰图像,由于其丰富而复杂的色彩特征,极具挑战性。现有的图像着色方法在实际应用中往往将服装图像视为一个整体,导致忽视了服装不同部位的语义信息。据了解,民族服饰各部位的色彩分布是不同的。所以,其他部分的颜色映射也是多种多样的,这是由鲜明的民族特征决定的。本研究引入细粒度级语义信息,并在语义级着色的启发下,提出了一种用于民族服饰靶向增强的高分辨率图像着色模型。民族服饰不同区域的语义信息对着色任务的表现有显着影响。使用 Pix2PixHD 作为骨干网络,我们创建了一个新的网络架构,使用细粒度的语义信息来保持服装图像的颜色分布对应和空间一致性。在我们的网络中,我们以民族服装和灰度图像的细粒度语义拼接结果为条件,然后将它们输入到生成对抗网络中。我们还讨论和分析了灰度通道和细粒度语义对判别器的影响。大量实验表明,与其他最先进的自动着色方法相比,我们的方法表现良好。我们创建了一个新的网络架构,使用细粒度的语义信息来保持服装图像的颜色分布对应和空间一致性。在我们的网络中,我们以民族服装和灰度图像的细粒度语义拼接结果为条件,然后将它们输入到生成对抗网络中。我们还讨论和分析了灰度通道和细粒度语义对判别器的影响。大量实验表明,与其他最先进的自动着色方法相比,我们的方法表现良好。我们创建了一个新的网络架构,使用细粒度的语义信息来保持服装图像的颜色分布对应和空间一致性。在我们的网络中,我们以民族服装和灰度图像的细粒度语义拼接结果为条件,然后将它们输入到生成对抗网络中。我们还讨论和分析了灰度通道和细粒度语义对判别器的影响。大量实验表明,与其他最先进的自动着色方法相比,我们的方法表现良好。我们以民族服装和灰度图像的细粒度语义拼接结果为条件,然后将它们输入到生成对抗网络中。我们还讨论和分析了灰度通道和细粒度语义对判别器的影响。大量实验表明,与其他最先进的自动着色方法相比,我们的方法表现良好。我们以民族服装和灰度图像的细粒度语义拼接结果为条件,然后将它们输入到生成对抗网络中。我们还讨论和分析了灰度通道和细粒度语义对判别器的影响。大量实验表明,与其他最先进的自动着色方法相比,我们的方法表现良好。
更新日期:2021-11-12
down
wechat
bug