首页> 外文期刊>International Journal of Applied Engineering Research >Detection and Removal of Graphical Components in Pre-Printed Documents
【24h】

Detection and Removal of Graphical Components in Pre-Printed Documents

机译:预打印文档中图形组件的检测和删除

获取原文
获取原文并翻译 | 示例
           

摘要

Pre-processing of document images is one of the most intensive operations for pre-printed document images. The recognition of text in pre-printed documents is most sensitive to graphical components coexisting with it. In this paper we address the problem of detection and removal of graphical components like logos, emblems and other symbolic entities, which leads to an error free document processing in the subsequent stages of Optical Character Recognition. The detection of graphical entities is performed by employing Zernike moments and histogram of gradient features, followed by which the line detection and removal is accomplished by masking the image with a vertical line structuring element by computation of region covered by convex hull within the area by structuring element in the image. The detection of line structuring element also addresses the problem of characters overlapping with lines leading to retention of the character during erosion of lines from the image. The experimental outcomes produced by emblem detection of algorithm are appreciable with accuracy of around 97% for the emblem detection and 92% accurate outcomes in case of line detection and removal.
机译:文档图像的预处理是用于预打印文档图像的最密集的操作之一。预打印文档中的文本识别对与其共存的图形组件最为敏感。在本文中,我们解决了检测和删除徽标,标志和其他符号实体之类的图形组件的问题,该问题导致在光学字符识别的后续阶段实现无错误的文档处理。图形实体的检测是通过使用Zernike矩和梯度特征的直方图执行的,然后通过垂直线构造元素对图像进行遮罩来实现线条的检测和去除,方法是通过结构化计算区域内凸包所覆盖的区域图片中的元素。线结构元素的检测还解决了字符与线重叠的问题,从而导致在从图像中腐蚀线的过程中保留了字符。通过标志检测算法产生的实验结果是可观的,标志检测的准确度约为97%,而在进行线检测和去除的情况下,准确度为92%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号