当前位置: X-MOL 学术Int. J. Doc. Anal. Recognit. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Hyperkernel-based intuitionistic fuzzy c-means for denoising color archival document images
International Journal on Document Analysis and Recognition ( IF 2.3 ) Pub Date : 2020-03-10 , DOI: 10.1007/s10032-020-00352-2
Walid Elhedda , Maroua Mehri , Mohamed Ali Mahjoub

In this article, we have addressed the problem of denoising and enhancement of color archival handwritten document images by separating noise from text and background. Indeed, archival document images that originated from scanning or photographing paper documents are mainly digitized in full color mode. Thus, it is necessary to preserve and exploit color information when applying an enhancement method or a denoising technique. Thus, the focus of our work has been to model a color image using a hyperspace. The defined hyperspace formed by the image pixels is obtained by using both topological and color spaces. The novelty of our work lies in exploiting the obtained hyperspace to cluster the extracted low-level features (topological and color) and, thereafter, to separate noise from text and background. Indeed, based on combining the obtained hyperspace with an adapted kernel-based intuitionistic fuzzy c-means (KIFCM) algorithm we have proposed a novel hyper-KIFCM (HKIFCM) method for denoising color historical document images. To illustrate the effectiveness of the HKIFCM method, a thorough experimental study has been firstly conducted with qualitative and quantitative observations obtained from color archival handwritten document images collected from both the Tunisian national archives and two datasets provided in the context of open competitions at ICDAR and ICFHR conferences. Then, we have compared the results achieved with those obtained using the state-of-the-art methods.

中文翻译:

基于超核的直觉模糊c均值对彩色档案文档图像进行降噪

在本文中,我们通过将噪声从文本和背景中分离出来,解决了彩色档案手写文档图像的降噪和增强问题。实际上,源自扫描或照相纸质文档的档案文档图像主要以全色模式数字化。因此,当应用增强方法或去噪技术时,有必要保存和利用颜色信息。因此,我们的工作重点是使用超空间对彩色图像进行建模。通过使用拓扑空间和颜色空间来获得由图像像素形成的定义的超空间。我们工作的新颖性在于利用获得的超空间对提取的低级特征(拓扑和颜色)进行聚类,然后将噪声从文本和背景中分离出来。确实,在将获得的超空间与基于核的直觉模糊c均值(KIFCM)算法相结合的基础上,我们提出了一种用于对彩色历史文档图像进行去噪的新颖的超KIFCM(HKIFCM)方法。为了说明HKIFCM方法的有效性,首先进行了一次彻底的实验研究,从在突尼斯国家档案馆和在ICDAR和ICFHR公开比赛的背景下提供的两个数据集收集的彩色档案手写文档图像中获得了定性和定量观察结果。会议。然后,我们将获得的结果与使用最新方法获得的结果进行了比较。为了说明HKIFCM方法的有效性,首先进行了一次彻底的实验研究,从在突尼斯国家档案馆和在ICDAR和ICFHR公开比赛的背景下提供的两个数据集收集的彩色档案手写文档图像中获得了定性和定量观察结果。会议。然后,我们将获得的结果与使用最新方法获得的结果进行了比较。为了说明HKIFCM方法的有效性,首先进行了一次彻底的实验研究,从在突尼斯国家档案馆和在ICDAR和ICFHR公开比赛的背景下提供的两个数据集收集的彩色档案手写文档图像中获得了定性和定量观察结果。会议。然后,我们将获得的结果与使用最新方法获得的结果进行了比较。
更新日期:2020-03-10
down
wechat
bug