【24h】

Document Image De-warping Based on Detection of Distorted Text Lines

机译:基于文本行失真检测的文档图像变形

获取原文
获取原文并翻译 | 示例

摘要

Image warping caused by scanning, photocopying or photographing a document is a common problem in the field of document processing and understanding. Distortion within the text documents impairs OCRability and thus strongly decreases the usability of the results. This is one of the major obstacles for automating the process of digitizing printed documents. In this paper we present a novel algorithm which is able to correct document image warping based on the detection of distorted text lines. The proposed solution is used in a recent project of digitizing old, poor quality manuscripts. The algorithm is compared to other published approaches. Experiments with various document samples and the resulting improvements of the text recognition rate achieved by a commercial OCR engine are also presented.
机译:由扫描,影印或照相文档引起的图像变形是文档处理和理解领域中的普遍问题。文本文档中的失真会损害OCR能力,因此会大大降低结果的可用性。这是使打印文档数字化过程自动化的主要障碍之一。在本文中,我们提出了一种新颖的算法,该算法能够基于扭曲文本行的检测来纠正文档图像变形。拟议的解决方案用于将旧的,质量较差的手稿数字化的最新项目。该算法与其他已发布的方法进行了比较。还介绍了各种文档样本的实验以及通过商用OCR引擎实现的文本识别率的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号