首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition
【2h】

Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition

机译:非均匀照明文档图像的鲁棒组合二值化方法用于字母数字字符识别

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Image binarization is one of the key operations decreasing the amount of information used in further analysis of image data, significantly influencing the final results. Although in some applications, where well illuminated images may be easily captured, ensuring a high contrast, even a simple global thresholding may be sufficient, there are some more challenging solutions, e.g., based on the analysis of natural images or assuming the presence of some quality degradations, such as in historical document images. Considering the variety of image binarization methods, as well as their different applications and types of images, one cannot expect a single universal thresholding method that would be the best solution for all images. Nevertheless, since one of the most common operations preceded by the binarization is the Optical Character Recognition (OCR), which may also be applied for non-uniformly illuminated images captured by camera sensors mounted in mobile phones, the development of even better binarization methods in view of the maximization of the OCR accuracy is still expected. Therefore, in this paper, the idea of the use of robust combined measures is presented, making it possible to bring together the advantages of various methods, including some recently proposed approaches based on entropy filtering and a multi-layered stack of regions. The experimental results, obtained for a dataset of 176 non-uniformly illuminated document images, referred to as the WEZUT OCR Dataset, confirm the validity and usefulness of the proposed approach, leading to a significant increase of the recognition accuracy.
机译:图像二值化是减少用于图像数据进一步分析的信息量的关键操作之一,从而显着影响最终结果。尽管在某些应用中,可以轻松捕获照明良好的图像,确保高对比度,即使是简单的全局阈值也已足够,但是存在一些更具挑战性的解决方案,例如,基于对自然图像的分析或假定存在某些质量下降,例如历史文档图像中的质量下降。考虑到图像二值化方法的多样性以及它们的不同应用和图像类型,不能期望一种通用的阈值化方法将是所有图像的最佳解决方案。然而,由于二值化之前最常见的操作之一是光学字符识别(OCR),它也可用于安装在移动电话中的摄像头传感器捕获的非均匀照明图像,因此,开发出了更好的二值化方法OCR精度最大化的观点仍然值得期待。因此,在本文中,提出了使用健壮的组合度量的想法,从而有可能将各种方法的优点结合在一起,其中包括一些最近提出的基于熵滤波的方法和多层区域堆栈。从176个非均匀照亮文档图像的数据集(称为WEZUT OCR数据集)获得的实验结果证实了所提方法的有效性和实用性,从而导致识别精度的显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号