首页> 外文期刊>Elektronika ir Elektrotechnika >Prediction of the Optical Character Recognition Accuracy based on the Combined Assessment of Image Binarization Results
【24h】

Prediction of the Optical Character Recognition Accuracy based on the Combined Assessment of Image Binarization Results

机译:基于图像二值化结果综合评估的光学字符识别精度预测

获取原文
获取原文并翻译 | 示例
           

摘要

In the paper the problem of reliable evaluation of the effects of image binarization is discussed in view of image recognition accuracy. Considering the Optical Character Recognition methods, typically used for document images obtained by cameras or scanners, their accuracy is strongly dependent on the results of image binarization. Unfortunately, metrics typically used for the evaluation of binarization results, such as Peak Signal to Noise Ratio, Distance Reciprocal Distortion or Misclassification Penalty Metric, are not always well correlated with the recognition accuracy of individual characters. Therefore, a novel approach related to the use of combined metric for the assessment of binarization results is proposed and verified for the binary images obtained using some popular histogram-based methods from the original images with degraded quality. For the experimental prediction of the character recognition accuracy, the popular open source engine supported by Google, known as Tesseract, has been used.
机译:本文从图像识别的准确性出发,讨论了对图像二值化效果进行可靠评估的问题。考虑到通常用于照相机或扫描仪获得的文档图像的光学字符识别方法,其准确性在很大程度上取决于图像二值化的结果。不幸的是,通常用于评估二值化结果的度量标准,例如峰值信噪比,距离倒数失真或误分类罚分度量标准,并不总是与单个字符的识别准确度相关联。因此,提出了一种与使用组合度量来评估二值化结果有关的新颖方法,并针对使用某些基于直方图的流行方法从质量下降的原始图像中获得的二进制图像进行了验证。为了对字符识别精度进行实验性预测,已使用了由Google支持的流行的开源引擎Tesseract。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号