首页> 外文会议>ACM conference on digital libraries >Quality of OCR for Degraded Text Images
【24h】

Quality of OCR for Degraded Text Images

机译:退化文本图像的OCR的质量

获取原文

摘要

Commercial OCR packages work best with high-quality scanned images. They often produce poor results when the image is degraded, either because the original itself was poor quality, or because of excessive photocopying. The ability to predict the word failure rate of OCR from a statistical analysis of the image can help in making decisions in the trade-off between the success rate of OCR and the cost of human correction of errors. This paper describes an investigation of OCR of degraded text images using a standard OCR engine (Adobe Capture). By introducing noise in a controlled manner into perfect documents, we show how the quality of OCR can be predicted from the nature of the noise.
机译:商业OCR包装最优质的是高质量的扫描图像。当图像劣化时,它们通常会产生差的结果,因为原始本身质量差,或由于过度的复印。从图像的统计分析中预测OCR的失败率的能力可以有助于在OCR成功率和人类校正的成本之间进行折衷的决策。本文介绍了使用标准OCR引擎(Adobe捕获)的降级文本图像OCR的调查。通过将受控方式引入完美文档的噪声,我们展示了如何从噪声的性质预测OCR的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号