An automatic closed-loop methodology for generating character groundtruth for scanned documents

Kanungo T.; Haralick R.M.

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >An automatic closed-loop methodology for generating character groundtruth for scanned documents

【24h】

An automatic closed-loop methodology for generating character groundtruth for scanned documents

机译：一种自动闭环方法，用于为扫描的文档生成字符基础

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Character groundtruth for real, scanned document images is crucial for evaluating the performance of OCR systems, training OCR algorithms, and validating document degradation models. Unfortunately, manual collection of accurate groundtruth for characters in a real (scanned) document image is not practical because (i) accuracy in delineating groundtruth character bounding boxes is not high enough, (ii) it is extremely laborious and time consuming, and (iii) the manual labor required for this task is prohibitively expensive. Ee describe a closed-loop methodology for collecting very accurate groundtruth for scanned documents. We first create ideal documents using a typesetting language. Next we create the groundtruth for the ideal document. The ideal document is then printed, photocopied and then scanned. A registration algorithm estimates the global geometric transformation and then performs a robust local bitmap match to register the ideal document image to the scanned document image. Finally, groundtruth associated with the ideal document image is transformed using the estimated geometric transformation to create the groundtruth for the scanned document image. This methodology is very general and can be used for creating groundtruth for documents in typeset in any language, layout, font, and style. We have demonstrated the method by generating groundtruth for English, Hindi, and FAX document images. The cost of creating groundtruth using our methodology is minimal. If character, word or zone groundtruth is available for any real document, the registration algorithm can be used to generate the corresponding groundtruth for a rescanned version of the document.

机译：真实，已扫描文档图像的特征基础对于评估OCR系统的性能，训练OCR算法以及验证文档降级模型至关重要。不幸的是，手动收集真实的（扫描的）文档图像中的字符的准确的地面真相是不切实际的，因为（i）描绘地面真相字符边界框的准确性不够高；（ii）这非常费力且费时，并且（iii））执行此任务所需的体力劳动非常昂贵。 Ee描述了一种闭环方法，用于收集扫描文档的非常准确的地面真相。我们首先使用排版语言创建理想的文档。接下来，我们为理想文档创建基础。然后打印，复印和扫描理想文档。配准算法估计全局几何变换，然后执行鲁棒的局部位图匹配，以将理想文档图像配准到扫描的文档图像。最后，使用估计的几何变换对与理想文档图像关联的地面信息进行变换，以创建扫描文档图像的地面信息。这种方法非常通用，可用于为任何语言，布局，字体和样式的排版文档创建基础。我们已经通过为英语，印地语和传真文档图像生成groundtruth演示了该方法。使用我们的方法创建地面真理的成本是最小的。如果字符，单词或区域groundtruth可用于任何真实文档，则可使用注册算法为文档的重新扫描版本生成相应的groundtruth。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |1999年第2期|P.179-183|共5页
作者
Kanungo T.; Haralick R.M.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Speed-up ellipse enclosing character detection approach for large-size document images by parallel scanning and Hough transform [J] . Premachandra H. Waruna H., Premachandra Chinthaka, Parape Chandana Dinesh, International journal of machine learning and cybernetics . 2017,第1期

机译：通过并行扫描和霍夫变换加快大尺寸文档图像的椭圆包围字符检测方法
2. SCALE-SPACE APPROACH FOR CHARACTER SEGMENTATION IN SCANNED IMAGES OF ARABIC DOCUMENTS [J] . NOUREDDINE EL MAKHFI, OMAR EL BANNAY Journal of Theoretical and Applied Information Technology . 2016,第2期

机译：阿拉伯文文档倾斜图像中的字符分割的尺度空间方法
3. Automatic classification of scanned electronic health record documents [J] . Goodrum Heath, Roberts Kirk, Bernstam Elmer V International journal of medical informatics . 2020,第Deca期

机译：扫描电子健康记录文件的自动分类
4. Automatic generation of character groundtruth for scanned documents: a closed-loop approach [C] . Kanungo, T., Haralick, . 1996

机译：自动为扫描的文档生成字符基础：闭环方法
5. Groundtruth generation and document image degradation. [D] . Zi, Gang. 2005

机译：Groundtruth生成和文档图像质量下降。
6. An extensible six-step methodology to automatically generate fuzzy DSSs for diagnostic applications [O] . Antonio dAcierno, Massimo Esposito, Giuseppe De Pietro 2013

机译：一种可扩展的六步方法可自动生成用于诊断应用的模糊DSS
7. An Automatic Closed-Loop Methodology for Generating Character Groundtruth for Scanned Documents [O] . Tapas Kanungo, Robert M. Haralick 1998

机译：一种自动闭环方法，用于生成扫描文档的字符基础
8. Automatic Closed-Loop Methodology for Generating Character Groundtruth for Scanned Documents [R] . Kanungo, T. 1998

机译：用于生成扫描文档字符地面真实的自动闭环方法

An automatic closed-loop methodology for generating character groundtruth for scanned documents

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅