Automatic Annotation Method for Document Image Binarization in Real Systems

机译：真实系统中文档图像二值化的自动注释方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The accuracy of optical character recognition (OCR) has significantly improved recently through the use of deep learning. However, when OCR is used in real applications, the shortage of annotated images often makes training difficult. To solve this problem, there are automatic annotation methods. However, many of these methods are based on active learning, and operators need to confirm generated annotation candidates. I propose a practical automatic annotation method for binarization, which is one of the components of OCR. The purpose with the proposed method is to automatically confirm the quality of annotation candidates. This method consists of three simple processes to achieve this. First, cropping a text from a whole image. Second, applying binarization to the cropped image at all thresholds. Third, recognizing all binarized cropped images and matching the recognition results and correct character database. If the characters match, the cropped binary image is correctly binarized. The method selects that cropped binarized image as an annotation for binarization. Cropping coordinates and the correct character database (DB) can be obtained from a practical OCR system. Because users of such a system usually input corrections for misrecognition of OCR to the system, the system can obtain the correct characters and coordinates. The experimental results indicate that the annotations generated with the proposed method can improve the performance of deep-learning-based binarization. As a result, the normalized edit distance between the recognized text and grand truth text can be reduced by 38.56% on the Find it! receipt image dataset.

机译：通过使用深度学习，光学字符识别（OCR）的准确性最近得到了显着提高。但是，在实际应用中使用OCR时，带注释的图像不足常常使训练变得困难。为了解决这个问题，有自动注释方法。但是，这些方法中的许多方法都是基于主动学习的，操作员需要确认生成的注释候选者。我提出了一种实用的用于二值化的自动注释方法，它是OCR的组成部分之一。所提出的方法的目的是自动确认候选注释的质量。该方法由三个简单的过程组成，可以实现这一目标。首先，从整个图像中裁剪文本。第二，在所有阈值下将二值化应用于裁剪后的图像。第三，识别所有二值化的裁剪图像并匹配识别结果和正确的字符数据库。如果字符匹配，则裁剪的二进制图像将正确地进行二值化处理。该方法选择该裁剪的二值化图像作为二值化的注释。可以从实际的OCR系统获取裁剪坐标和正确的字符数据库（DB）。因为这种系统的用户通常向系统输入错误识别OCR的更正，所以系统可以获得正确的字符和坐标。实验结果表明，该方法生成的注释可以提高基于深度学习的二值化的性能。结果，在“查找它”上，可以将识别的文本和真实的文本之间的规范化编辑距离减少38.56％。收据图像数据集。

著录项

来源
《Asian Conference on Pattern Recognition》|2020年|464-477|共14页
会议地点
作者
Ryosuke Odate;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Character recognition; OCR; Binarization; Annotation; Practical systems;

机译：字符识别; OCR;二值化;注解;实用系统;

相似文献

外文文献
中文文献
专利

1. An Adaptive Binarization Method for Cost-efficient Document Image System in Wavelet Domain [J] . Hsia Chih-Hsien, Lin Ting-Yu, Chiang Jen-Shiun Journal of Imaging Science and Technology . 2020,第3期

机译：小波域中成本高效文档图像系统的自适应二值化方法
2. An adaptive local binarization method for document images based on a novel thresholding method and dynamic windows [J] . Bilal Bataineh, Siti Norul Huda Sheikh Abdullah, Khairuddin Omar Pattern recognition letters . 2011,第14期

机译：基于新颖阈值法和动态窗口的文档图像自适应局部二值化方法
3. Evaluation of binarization methods for document images [J] . Trier O.D., Taxt T. IEEE Transactions on Pattern Analysis and Machine Intelligence . 1995,第3期

机译：评估文档图像的二值化方法
4. Research on automatic block binarization method of stained Tibetan ancient document image based on Lab color space [C] . Yuehui Han, Weilan Wang, Yiqun Wang International Forum on Management, Education and Information Technology Application . 2018

机译：基于Lab色彩空间的染色藏古文档图像自动阻滞二值化方法研究
5. Automatic mesh generation for radiosity method in realistic image synthesis. [D] . Yi, Xin. 1996

机译：用于真实图像合成中的光能传递方法的自动网格生成。
6. Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition [O] . Hubert Michalak, Krzysztof Okarma 2020

机译：非均匀照明文档图像的鲁棒组合二值化方法用于字母数字字符识别

Automatic Annotation Method for Document Image Binarization in Real Systems

摘要

著录项

相似文献

相关主题

期刊订阅