首页> 外文会议>International Conference on Document Analysis and Recognition >Towards Document Image Quality Assessment: A Text Line Based Framework and a Synthetic Text Line Image Dataset
【24h】

Towards Document Image Quality Assessment: A Text Line Based Framework and a Synthetic Text Line Image Dataset

机译:迈向文档图像质量评估:基于文本行的框架和合成文本行图像数据集

获取原文

摘要

Since the low quality of document images will greatly undermine the chances of success in automatic text recognition and analysis, it is necessary to assess the quality of document images uploaded in online business process, so as to reject those images of low quality. In this paper, we attempt to achieve document image quality assessment and our contributions are twofold. Firstly, since document image quality assessment is more interested in text, we propose a text line based framework to estimate document image quality, which is composed of three stages: text line detection, text line quality prediction, and overall quality assessment. Text line detection aims to find potential text lines with a detector. In the text line quality prediction stage, the quality score is computed for each text line with a CNN-based prediction model. The overall quality of document images is finally assessed with the ensemble of all text line quality. Secondly, to train the prediction model, a large-scale dataset, comprising 52,094 text line images, is synthesized with diverse attributes. For each text line image, a quality label is computed with a piecewise function. To demonstrate the effectiveness of the proposed framework, comprehensive experiments are evaluated on two popular document image quality assessment benchmarks. Our framework significantly outperforms the state-of-the-art methods by large margins on the large and complicated dataset.
机译:由于文档图像质量低下会大大破坏自动文本识别和分析的成功机会,因此有必要评估在线业务流程中上载的文档图像质量,以拒绝那些质量低下的图像。在本文中,我们尝试实现文档图像质量评估,并且我们的贡献是双重的。首先,由于文档图像质量评估对文本更感兴趣,因此我们提出了一种基于文本行的框架来估计文档图像质量,该框架包括三个阶段:文本行检测,文本行质量预测和总体质量评估。文本行检测旨在通过检测器查找潜在的文本行。在文本行质量预测阶段,使用基于CNN的预测模型为每个文本行计算质量得分。最后,以所有文本行质量为整体评估文档图像的整体质量。其次,为了训练预测模型,合成了包含52,094个文本行图像的大规模数据集,并具有多种属性。对于每个文本行图像,使用分段函数计算质量标签。为了证明所提出框架的有效性,在两个流行的文档图像质量评估基准上对综合实验进行了评估。在庞大而复杂的数据集上,我们的框架大大优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号