UDBNET: Unsupervised Document Binarization Network via Adversarial Game,arXiv - CS - Computer Vision and Pattern Recognition

当前位置： X-MOL 学术 › arXiv.cs.CV › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

UDBNET: Unsupervised Document Binarization Network via Adversarial Game
arXiv - CS - Computer Vision and Pattern Recognition Pub Date : 2020-07-14 , DOI: arxiv-2007.07075
Amandeep Kumar, Shuvozit Ghose, Pinaki Nath Chowdhury, Partha Pratim Roy, Umapada Pal

Degraded document image binarization is one of the most challenging tasks in the domain of document image analysis. In this paper, we present a novel approach towards document image binarization by introducing three-player min-max adversarial game. We train the network in an unsupervised setup by assuming that we do not have any paired-training data. In our approach, an Adversarial Texture Augmentation Network (ATANet) first superimposes the texture of a degraded reference image over a clean image. Later, the clean image along with its generated degraded version constitute the pseudo paired-data which is used to train the Unsupervised Document Binarization Network (UDBNet). Following this approach, we have enlarged the document binarization datasets as it generates multiple images having same content feature but different textual feature. These generated noisy images are then fed into the UDBNet to get back the clean version. The joint discriminator which is the third-player of our three-player min-max adversarial game tries to couple both the ATANet and UDBNet. The three-player min-max adversarial game stops, when the distributions modelled by the ATANet and the UDBNet align to the same joint distribution over time. Thus, the joint discriminator enforces the UDBNet to perform better on real degraded image. The experimental results indicate the superior performance of the proposed model over existing state-of-the-art algorithm on widely used DIBCO datasets. The source code of the proposed system is publicly available at https://github.com/VIROBO-15/UDBNET.

中文翻译：

UDBNET：通过对抗博弈的无监督文档二值化网络

降级文档图像二值化是文档图像分析领域最具挑战性的任务之一。在本文中，我们通过引入三人最小最大对抗博弈来提出一种新的文档图像二值化方法。通过假设我们没有任何配对训练数据，我们在无监督设置中训练网络。在我们的方法中，对抗性纹理增强网络 (ATANet) 首先将退化的参考图像的纹理叠加在干净的图像上。后来，干净的图像及其生成的降级版本构成了用于训练无监督文档二值化网络 (UDBNet) 的伪配对数据。按照这种方法，我们扩大了文档二值化数据集，因为它生成了多个具有相同内容特征但不同文本特征的图像。然后将这些生成的噪声图像输入 UDBNet 以获取干净的版本。联合鉴别器是我们的三人最小-最大对抗游戏的第三个玩家，它试图将 ATANet 和 UDBNet 结合起来。当 ATANet 和 UDBNet 建模的分布随着时间的推移与相同的联合分布对齐时，三人最小-最大对抗游戏停止。因此，联合鉴别器强制 UDBNet 在真实降级图像上表现更好。实验结果表明，所提出的模型在广泛使用的 DIBCO 数据集上优于现有的最先进算法。拟议系统的源代码可在 https://github.com/VIROBO-15/UDBNET 上公开获得。联合鉴别器是我们的三人最小-最大对抗游戏的第三个玩家，它试图将 ATANet 和 UDBNet 结合起来。当 ATANet 和 UDBNet 建模的分布随着时间的推移与相同的联合分布对齐时，三人最小-最大对抗游戏停止。因此，联合鉴别器强制 UDBNet 在真实降级图像上表现更好。实验结果表明，所提出的模型在广泛使用的 DIBCO 数据集上优于现有的最先进算法。拟议系统的源代码可在 https://github.com/VIROBO-15/UDBNET 上公开获得。联合鉴别器是我们的三人最小-最大对抗游戏的第三个玩家，它试图将 ATANet 和 UDBNet 结合起来。当 ATANet 和 UDBNet 建模的分布随着时间的推移与相同的联合分布对齐时，三人最小-最大对抗游戏停止。因此，联合鉴别器强制 UDBNet 在真实降级图像上表现更好。实验结果表明，所提出的模型在广泛使用的 DIBCO 数据集上优于现有的最先进算法。拟议系统的源代码可在 https://github.com/VIROBO-15/UDBNET 上公开获得。当 ATANet 和 UDBNet 建模的分布随着时间的推移与相同的联合分布对齐时。因此，联合鉴别器强制 UDBNet 在真实降级图像上表现更好。实验结果表明，所提出的模型在广泛使用的 DIBCO 数据集上优于现有的最先进算法。拟议系统的源代码可在 https://github.com/VIROBO-15/UDBNET 上公开获得。当 ATANet 和 UDBNet 建模的分布随着时间的推移与相同的联合分布对齐时。因此，联合鉴别器强制 UDBNet 在真实降级图像上表现更好。实验结果表明，所提出的模型在广泛使用的 DIBCO 数据集上优于现有的最先进算法。拟议系统的源代码可在 https://github.com/VIROBO-15/UDBNET 上公开获得。

更新日期：2020-10-28

点击分享查看原文

点击收藏

阅读更多本刊最新论文