首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition

【2h】

Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition

机译：非均匀照明文档图像的鲁棒组合二值化方法用于字母数字字符识别

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image binarization is one of the key operations decreasing the amount of information used in further analysis of image data, significantly influencing the final results. Although in some applications, where well illuminated images may be easily captured, ensuring a high contrast, even a simple global thresholding may be sufficient, there are some more challenging solutions, e.g., based on the analysis of natural images or assuming the presence of some quality degradations, such as in historical document images. Considering the variety of image binarization methods, as well as their different applications and types of images, one cannot expect a single universal thresholding method that would be the best solution for all images. Nevertheless, since one of the most common operations preceded by the binarization is the Optical Character Recognition (OCR), which may also be applied for non-uniformly illuminated images captured by camera sensors mounted in mobile phones, the development of even better binarization methods in view of the maximization of the OCR accuracy is still expected. Therefore, in this paper, the idea of the use of robust combined measures is presented, making it possible to bring together the advantages of various methods, including some recently proposed approaches based on entropy filtering and a multi-layered stack of regions. The experimental results, obtained for a dataset of 176 non-uniformly illuminated document images, referred to as the WEZUT OCR Dataset, confirm the validity and usefulness of the proposed approach, leading to a significant increase of the recognition accuracy.

机译：图像二值化是减少用于图像数据进一步分析的信息量的关键操作之一，从而显着影响最终结果。尽管在某些应用中，可以轻松捕获照明良好的图像，确保高对比度，即使是简单的全局阈值也已足够，但是存在一些更具挑战性的解决方案，例如，基于对自然图像的分析或假定存在某些质量下降，例如历史文档图像中的质量下降。考虑到图像二值化方法的多样性以及它们的不同应用和图像类型，不能期望一种通用的阈值化方法将是所有图像的最佳解决方案。然而，由于二值化之前最常见的操作之一是光学字符识别（OCR），它也可用于安装在移动电话中的摄像头传感器捕获的非均匀照明图像，因此，开发出了更好的二值化方法OCR精度最大化的观点仍然值得期待。因此，在本文中，提出了使用健壮的组合度量的想法，从而有可能将各种方法的优点结合在一起，其中包括一些最近提出的基于熵滤波的方法和多层区域堆栈。从176个非均匀照亮文档图像的数据集（称为WEZUT OCR数据集）获得的实验结果证实了所提方法的有效性和实用性，从而导致识别精度的显着提高。

著录项

期刊名称 Sensors (Basel Switzerland)
作者
Hubert Michalak; Krzysztof Okarma;
展开▼
作者单位

展开▼
年(卷),期 2020(20),10
年度 2020
页码 -1
总页数 23
原文格式 PDF
正文语种
中图分类
关键词
image binarization; optical character recognition; document images; local thresholding; image pre-processing; natural images;

机译：图像二值化;光学字符识别;文档图像;局部阈值;图像预处理;自然图像;
入库时间 2022-08-21 11:48:43

相似文献

外文文献
中文文献
专利

1. Fast Binarization of Unevenly Illuminated Document Images Based on Background Estimation for Optical Character Recognition Purposes [J] . Hubert Michalak, Krzysztof Okarma Journal of Universal Computer Science . 2019,第6期

机译：基于背景估计的不均匀照明文档图像的快速二值化，用于光学字符识别
2. Fast Binarization of Unevenly Illuminated Document Images Based on Background Estimation for Optical Character Recognition Purposes [J] . Hubert Michalak, Krzysztof Okarma Journal of Universal Computer Science . 2019,第6期

机译：基于背景估计的不均匀照明文档图像的快速二值化，用于光学字符识别
3. A new binarization method for non-uniform illuminated document images [J] . Wen J., Li S., Sun J. Pattern Recognition: The Journal of the Pattern Recognition Society . 2013,第6期

机译：一种非均匀照明文档图像的二值化新方法
4. A novel method for binarization of badly illuminated document images [C] . Tabatabaei S.A., Bohlool M. 17th IEEE International Conference on Image Processing . 2010

机译：一种不良光照文档图像二值化的新方法
5. Effective and efficient binarization of degraded document images. [D] . Parker, Jon Ivan. 2016

机译：对退化的文档图像进行有效和高效的二值化。
6. Effective and fast binarization method for combined degradation on ancient documents [O] . Khairun Saddami, Khairul Munadi, Yuwaldi Away, 2019

机译：有效快速的二值化方法对古代文献进行综合降解
7. Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition [O] . Hubert Michalak, Krzysztof Okarma 2020

机译：非均匀照明文档图像的鲁棒组合二值化方法，用于字母数字识别
8. Some methods of encoding simple visual images for use with a sparse distributed memory, with applications to character recognition [R] . Jaeckel, Louis A. 1989

机译：一些编码简单视觉图像的方法，用于稀疏分布式存储器，应用于字符识别

Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition

摘要

著录项

相似文献

相关主题

期刊订阅