Selecting Automatically Pre-Processing Methods to Improve OCR Performances

机译：选择自动预处理方法以提高OCR性能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose an approach that automatically selects suitable document pre-processing algorithms to increase OCR performances. We first provide an experimental evaluations protocol to study effects of document pre-processing methods on different OCR engines for document images that have different type of distorsions. We remark that, when distortions on the document image is unknown, a pre-processing methods does not always improve but sometimes decreases the OCR performance. We conclude that the effectiveness of a pre-processing algorithm depends on the nature of the OCR and type of distorsions. In the context that distortions on the document and information about OCR system's mechanism are unknown, we propose an automatic pre-processing selection method based on a convolutional neural network with 15 layers and where the last layer contains neurons representing our different pre-processing algorithms. Experimental results show the effectiveness of our approach to improve OCR performances in a mobile-captured document images framework.

机译：在本文中，我们提出了一种自动选择合适的文档预处理算法以提高OCR性能的方法。我们首先提供一个实验评估协议，以研究文档预处理方法对具有不同类型失真的文档图像在不同OCR引擎上的影响。我们注意到，当文档图像上的变形未知时，预处理方法并不总是会改善，但有时会降低OCR性能。我们得出结论，预处理算法的有效性取决于OCR的性质和失真的类型。在文件的失真和有关OCR系统机理的信息未知的情况下，我们提出了一种基于卷积神经网络的自动预处理选择方法，该算法有15层，最后一层包含代表我们不同预处理算法的神经元。实验结果表明，我们的方法在移动捕获的文档图像框架中改善OCR性能的有效性。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|169-174|共6页
会议地点
作者
Quang Anh Bui; David Mollard; Salvatore Tabbone;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Optical character recognition software; Distortion; Noise reduction; Training; Text recognition; Image edge detection;

机译：光学字符识别软件;失真;降噪;训练;文本识别;图像边缘检测;

相似文献

外文文献
中文文献
专利

1. IMPROVING OCR BY EFFECTIVE PRE-PROCESSING AND SEGMENTATION FOR DEVANAGIRI SCRIPT:A QUANTIFIED STUDY [J] . Dr. DEEPA GUPTA, LEEMA MADHU NAIR Journal of Theoretical and Applied Information Technology . 2013,第2期

机译：通过有效的预处理和分段改进DEVANAGIRI脚本的OCR：定量研究
2. NONLINEAR CASCADED CORRELATION PROCESSES TO IMPROVE THE PERFORMANCES OF AUTOMATIC SPATIAL-FREQUENCY-SELECTIVE FILTERS IN PATTERN RECOGNITION [J] . Dubois F. Applied optics . 1996,第23期

机译：非线性级联相关过程，提高模式识别中自动空间频率选择性滤波器的性能
3. Comparison of DTI analysis methods for clinical research: influence of pre-processing and tract selection methods [J] . Volker Ressel, Hubertus J. A. van Hedel, Ianina Scheer, European Radiology Experimental . 2018,第1期

机译：DTI分析方法在临床研究中的比较：预处理和道选择方法的影响
4. Selecting Automatically Pre-Processing Methods to Improve OCR Performances [C] . Quang Anh Bui, David Mollard, Salvatore Tabbone IAPR International Conference on Document Analysis and Recognition . 2017

机译：选择自动预处理方法以提高OCR性能
5. Pre-processing methods and stepwise variable selection for binary classification of high-dimensional data. [D] . Ramachandar, Shahla. 2010

机译：高维数据二进制分类的预处理方法和逐步变量选择。
6. Comparison of DTI analysis methods for clinical research: influence of pre-processing and tract selection methods [O] . Volker Ressel, Hubertus J. A. van Hedel, Ianina Scheer, 2018

机译：DTI分析方法在临床研究中的比较：预处理方法和区域选择方法的影响
7. A comparative investigation of the combined effects of pre-processing, wavelength selection and regression methods on near infrared calibration model performance [O] . Wan J, Yi-Chieh Chen, A. Julian Morris, 2017

机译：预处理，波长选择和回归方法对近红外校准模型性能的综合影响的比较研究

Selecting Automatically Pre-Processing Methods to Improve OCR Performances

摘要

著录项

相似文献

相关主题

期刊订阅