Automatic tracing and extraction of text-line and word segments directly in JPEG compressed document images,IET Image Processing

当前位置： X-MOL 学术 › IET Image Process. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Automatic tracing and extraction of text-line and word segments directly in JPEG compressed document images
IET Image Processing ( IF 2.0 ) Pub Date : 2020-07-27 , DOI: 10.1049/iet-ipr.2019.1437
Bulla Rajesh ₁ , Mohammed Javed ₁ , P. Nagabhushan ₁

Affiliation

JPEG is one of the popular and efficient compression algorithms supported in the consumer electronics world. Excessive usage of mobile phones and e-governance applications have all resulted in a huge collection of JPEG compressed document images. The major challenge with these images is that its processing becomes expensive as it requires repeated decompression and recompression operations. Recently, it has been proved that developing algorithms to operate directly on the compressed data is one of the solutions in overcoming the above issue. This research study investigates a novel algorithm for segmentation of text-lines and words directly from JPEG compressed handwritten document images. Segmenting a handwritten document is challenging due to the presence of uneven spacing, variable font sizes, overlapping and touching components, and it becomes much more challenging if it is to be done directly in the compressed image. The proposed technique virtually fixes a vertical stripe at the beginning of the document to detect starting points of text-lines. Then a moving window-based space penetration algorithm is used for tracing the exact line boundary between two text-lines, resolving the issues of space and font variations, touching and overlapping components. Subsequently, a word boundary tracing algorithm is used to segment words.

中文翻译：

直接在JPEG压缩文档图像中自动跟踪和提取文本行和单词段

JPEG是消费电子世界中受支持的流行且高效的压缩算法之一。移动电话和电子政务应用程序的过度使用导致了JPEG压缩文档图像的大量收集。这些图像的主要挑战在于其处理变得昂贵，因为它需要反复进行解压缩和重新压缩操作。最近，已经证明，开发直接对压缩数据进行操作的算法是克服上述问题的解决方案之一。这项研究研究了一种直接从JPEG压缩的手写文档图像中分割文本行和单词的新颖算法。由于存在间距不均，字体大小可变，组件重叠且相互接触的问题，因此分割手写文档非常具有挑战性，如果要直接在压缩图像中进行操作，将变得更具挑战性。所提出的技术实际上在文档的开头固定了垂直条纹，以检测文本行的起点。然后使用基于移动窗口的空间穿透算法来跟踪两条文本行之间的确切行边界，从而解决空间和字体变化，接触和重叠的组件的问题。随后，使用词边界跟踪算法对词进行分段。解决空间和字体变化，组件接触和重叠的问题。随后，使用词边界跟踪算法对词进行分段。解决空间和字体变化，组件接触和重叠的问题。随后，使用词边界跟踪算法对词进行分段。

更新日期：2020-07-28

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11