...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >PROCESSING OF BINARY IMAGES OF HANDWRITTEN TEXT DOCUMENTS
【24h】

PROCESSING OF BINARY IMAGES OF HANDWRITTEN TEXT DOCUMENTS

机译:手写文本文档的二进制图像处理

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper deals with three different problems in the processing of binary images of handwritten text documents. Firstly, an integrated algorithm that finds a straight line approximation of a textual stroke is described. It has the advantage of using the distance transform of thinned binary images to identify spurious bifurcation points, which are unavoidable when thinning algorithms are used, remove them and recover the original ones. The obtained straight line approximations preserve the structural information of the original pattern. The algorithm does not resort to distortable geometrical properties. Secondly, a method is presented to recover]oops that become blobs due to blotting. The method depends on removing the pixels whose distance transform exceeds a calculated threshold. Unfortunately, it seems that it is not possible to recover such loops with a high rate of success. The authors suggest that the inclusion of thickness information, in the line segments that connect the vertices of the straight line approximations produced by the previous algorithm, is a step towards a solution of this problem. Finally, a method is developed to extract lines from pages of handwritten text, by finding the shortest spanning tree of a graph Formed from the set of main strokes. Then, main strokes of extracted lines are arranged in the same order as they were written by following the path in which they are contained. Then, every secondary stroke is assigned to the closest main stroke. At the end, an ordered list of main strokes, each with the corresponding number of assigned secondary strokes, is obtained. Each combination of main-secondary strokes can be the input to a subsequent recognition stage. The method proved to be powerful and more suited to variable handwriting. Copyright (C) 1996 Pattern Recognition Society. [References: 21]
机译:本文处理手写文本文档的二进制图像中的三个不同问题。首先,描述了找到文本笔划的直线近似的综合算法。它具有使用稀疏二值图像的距离变换来识别虚假分叉点的优势,当使用稀疏算法时,这是不可避免的,将其删除并恢复原始分叉点。所获得的直线近似值保留了原始图案的结构信息。该算法不求助于可变形的几何特性。其次,提出了一种方法来回收由于印迹而变成斑点的环。该方法取决于去除距离变换超过计算的阈值的像素。不幸的是,似乎不可能以很高的成功率恢复这样的循环。作者建议,在连接先前算法产生的直线近似值的顶点的线段中包含厚度信息,是朝着解决此问题迈出的一步。最后,开发了一种方法,该方法可以通过从主要笔划集中形成的图形的最短生成树来从手写文本的页面中提取行。然后,通过遵循包含笔画的路径,以与提取笔画相同的顺序排列提取笔画的主笔画。然后,每个次要笔划都分配给最接近的主笔划。最后,获得主笔划的有序列表,每个主笔划具有对应数量的分配的第二笔划。主次笔画的每种组合都可以作为后续识别阶段的输入。该方法被证明是强大的,并且更适合于可变笔迹。版权所有(C)1996模式识别学会。 [参考:21]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号