PROCESSING OF BINARY IMAGES OF HANDWRITTEN TEXT DOCUMENTS

Abuhaiba ISI.; Datta S.; Holt MJJ.

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >PROCESSING OF BINARY IMAGES OF HANDWRITTEN TEXT DOCUMENTS

【24h】

PROCESSING OF BINARY IMAGES OF HANDWRITTEN TEXT DOCUMENTS

机译：手写文本文档的二进制图像处理

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper deals with three different problems in the processing of binary images of handwritten text documents. Firstly, an integrated algorithm that finds a straight line approximation of a textual stroke is described. It has the advantage of using the distance transform of thinned binary images to identify spurious bifurcation points, which are unavoidable when thinning algorithms are used, remove them and recover the original ones. The obtained straight line approximations preserve the structural information of the original pattern. The algorithm does not resort to distortable geometrical properties. Secondly, a method is presented to recover]oops that become blobs due to blotting. The method depends on removing the pixels whose distance transform exceeds a calculated threshold. Unfortunately, it seems that it is not possible to recover such loops with a high rate of success. The authors suggest that the inclusion of thickness information, in the line segments that connect the vertices of the straight line approximations produced by the previous algorithm, is a step towards a solution of this problem. Finally, a method is developed to extract lines from pages of handwritten text, by finding the shortest spanning tree of a graph Formed from the set of main strokes. Then, main strokes of extracted lines are arranged in the same order as they were written by following the path in which they are contained. Then, every secondary stroke is assigned to the closest main stroke. At the end, an ordered list of main strokes, each with the corresponding number of assigned secondary strokes, is obtained. Each combination of main-secondary strokes can be the input to a subsequent recognition stage. The method proved to be powerful and more suited to variable handwriting. Copyright (C) 1996 Pattern Recognition Society. [References: 21]

机译：本文处理手写文本文档的二进制图像中的三个不同问题。首先，描述了找到文本笔划的直线近似的综合算法。它具有使用稀疏二值图像的距离变换来识别虚假分叉点的优势，当使用稀疏算法时，这是不可避免的，将其删除并恢复原始分叉点。所获得的直线近似值保留了原始图案的结构信息。该算法不求助于可变形的几何特性。其次，提出了一种方法来回收由于印迹而变成斑点的环。该方法取决于去除距离变换超过计算的阈值的像素。不幸的是，似乎不可能以很高的成功率恢复这样的循环。作者建议，在连接先前算法产生的直线近似值的顶点的线段中包含厚度信息，是朝着解决此问题迈出的一步。最后，开发了一种方法，该方法可以通过从主要笔划集中形成的图形的最短生成树来从手写文本的页面中提取行。然后，通过遵循包含笔画的路径，以与提取笔画相同的顺序排列提取笔画的主笔画。然后，每个次要笔划都分配给最接近的主笔划。最后，获得主笔划的有序列表，每个主笔划具有对应数量的分配的第二笔划。主次笔画的每种组合都可以作为后续识别阶段的输入。该方法被证明是强大的，并且更适合于可变笔迹。版权所有（C）1996模式识别学会。 [参考：21]

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |1996年第7期|共17页
作者
Abuhaiba ISI.; Datta S.; Holt MJJ.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术及设备;
关键词
Handwritten text; Printed text; Main stroke; Secondary stroke; Thinning; Straight line approximation; Distance transform; Blotting; Blobs; Cost matrix; Shortest spanning tree; Curves;

机译：手写文本;打印文本;主笔画;二次笔画;细化;直线逼近;距离变换;印迹;斑点;成本矩阵;最短的生成树;曲线;

相似文献

外文文献
中文文献
专利

1. PROCESSING OF BINARY IMAGES OF HANDWRITTEN TEXT DOCUMENTS [J] . Abuhaiba ISI., Datta S., Holt MJJ. Pattern Recognition: The Journal of the Pattern Recognition Society . 1996,第7期

机译：手写文本文档的二进制图像处理
2. Segmentation of text lines using multi-scale CNN from warped printed and handwritten document images [J] . Dutta Arpita, Garai Arpan, Biswas Samit, International Journal on Document Analysis and Recognition . 2021,第4期

机译：使用来自翘曲的打印和手写文档图像的多尺度CNN的文本线的分割
3. Text-line extraction from handwritten document images using GAN [J] . Kundu Soumyadeep, Paul Sayantan, Bera Suman Kumar, Expert Systems with Application . 2020,第Feba期

机译：使用GAN从手写文档图像中提取文本行
4. Text and Non-text Separation in Handwritten Document Images Using Local Binary Pattern Operator [C] . Showmik Bhowmik, Ram Sarkar, Mita Nasipuri International Conference on Intelligent Computing and Communication . 2017

机译：使用本地二进制模式操作员手写文档图像中的文本和非文本分离
5. Document image analysis techniques for handwritten text segmentation, document image rectification and digital collation. [D] . Salvi, Dhaval. 2014

机译：用于手写文本分割，文档图像校正和数字整理的文档图像分析技术。
6. Ancient administrative handwritten documents: X-ray analysis and imaging [O] . F. Albertin, A. Astolfo, M. Stampanoni, -1

机译：古代行政手写文件：X射线分析和成像
7. Approach for Arabic Handwritten Image Processing: Case of Text Detection in Degraded Documents [O] . Youssef Boulid, Mohamed Youssfi Elkettani 2014

机译：阿拉伯语手写图像处理方法：降级文档中的文本检测情况

PROCESSING OF BINARY IMAGES OF HANDWRITTEN TEXT DOCUMENTS

摘要

著录项

相似文献

相关主题

期刊订阅