首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Selecting Automatically Pre-Processing Methods to Improve OCR Performances
【24h】

Selecting Automatically Pre-Processing Methods to Improve OCR Performances

机译:选择自动预处理方法以提高OCR性能

获取原文

摘要

In this paper, we propose an approach that automatically selects suitable document pre-processing algorithms to increase OCR performances. We first provide an experimental evaluations protocol to study effects of document pre-processing methods on different OCR engines for document images that have different type of distorsions. We remark that, when distortions on the document image is unknown, a pre-processing methods does not always improve but sometimes decreases the OCR performance. We conclude that the effectiveness of a pre-processing algorithm depends on the nature of the OCR and type of distorsions. In the context that distortions on the document and information about OCR system's mechanism are unknown, we propose an automatic pre-processing selection method based on a convolutional neural network with 15 layers and where the last layer contains neurons representing our different pre-processing algorithms. Experimental results show the effectiveness of our approach to improve OCR performances in a mobile-captured document images framework.
机译:在本文中,我们提出了一种自动选择合适的文档预处理算法以提高OCR性能的方法。我们首先提供一个实验评估协议,以研究文档预处理方法对具有不同类型失真的文档图像在不同OCR引擎上的影响。我们注意到,当文档图像上的变形未知时,预处理方法并不总是会改善,但有时会降低OCR性能。我们得出结论,预处理算法的有效性取决于OCR的性质和失真的类型。在文件的失真和有关OCR系统机理的信息未知的情况下,我们提出了一种基于卷积神经网络的自动预处理选择方法,该算法有15层,最后一层包含代表我们不同预处理算法的神经元。实验结果表明,我们的方法在移动捕获的文档图像框架中改善OCR性能的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号