首页> 外文会议>Asian Conference on Pattern Recognition >Automatic Annotation Method for Document Image Binarization in Real Systems
【24h】

Automatic Annotation Method for Document Image Binarization in Real Systems

机译:真实系统中文档图像二值化的自动注释方法

获取原文

摘要

The accuracy of optical character recognition (OCR) has significantly improved recently through the use of deep learning. However, when OCR is used in real applications, the shortage of annotated images often makes training difficult. To solve this problem, there are automatic annotation methods. However, many of these methods are based on active learning, and operators need to confirm generated annotation candidates. I propose a practical automatic annotation method for binarization, which is one of the components of OCR. The purpose with the proposed method is to automatically confirm the quality of annotation candidates. This method consists of three simple processes to achieve this. First, cropping a text from a whole image. Second, applying binarization to the cropped image at all thresholds. Third, recognizing all binarized cropped images and matching the recognition results and correct character database. If the characters match, the cropped binary image is correctly binarized. The method selects that cropped binarized image as an annotation for binarization. Cropping coordinates and the correct character database (DB) can be obtained from a practical OCR system. Because users of such a system usually input corrections for misrecognition of OCR to the system, the system can obtain the correct characters and coordinates. The experimental results indicate that the annotations generated with the proposed method can improve the performance of deep-learning-based binarization. As a result, the normalized edit distance between the recognized text and grand truth text can be reduced by 38.56% on the Find it! receipt image dataset.
机译:通过使用深度学习,光学字符识别(OCR)的准确性最近得到了显着提高。但是,在实际应用中使用OCR时,带注释的图像不足常常使训练变得困难。为了解决这个问题,有自动注释方法。但是,这些方法中的许多方法都是基于主动学习的,操作员需要确认生成的注释候选者。我提出了一种实用的用于二值化的自动注释方法,它是OCR的组成部分之一。所提出的方法的目的是自动确认候选注释的质量。该方法由三个简单的过程组成,可以实现这一目标。首先,从整个图像中裁剪文本。第二,在所有阈值下将二值化应用于裁剪后的图像。第三,识别所有二值化的裁剪图像并匹配识别结果和正确的字符数据库。如果字符匹配,则裁剪的二进制图像将正确地进行二值化处理。该方法选择该裁剪的二值化图像作为二值化的注释。可以从实际的OCR系统获取裁剪坐标和正确的字符数据库(DB)。因为这种系统的用户通常向系统输入错误识别OCR的更正,所以系统可以获得正确的字符和坐标。实验结果表明,该方法生成的注释可以提高基于深度学习的二值化的性能。结果,在“查找它”上,可以将识别的文本和真实的文本之间的规范化编辑距离减少38.56%。收据图像数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号