首页> 外文期刊>Pattern Analysis and Applications >Document image binarization by two-stage block extraction and background intensity determination
【24h】

Document image binarization by two-stage block extraction and background intensity determination

机译:通过两阶段块提取和背景强度确定对文档图像进行二值化

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a novel approach to bina-rizing document images. All blocks with individual background intensity values in a document image are first extracted using a two-stage extraction procedure. Then, the intensity distribution of each block is calculated to determine the variation ranges of background intensity. For each extracted block, interior pixels whose intensity values fall within these ranges are regarded as background pixels. For those pixels outside all extracted blocks, Otsu's global threshold method is applied to binarize them. To evaluate the developed system, 275 representative document images are collected to evaluate the binarization results by recognizing characters extracted from those binarized images. These binarized images are generated using the proposed and other existent approaches and fed into the same optical character recognition system to evaluate the practicability of each method. The proposed document binarization method obtains the highest recognition accuracy.
机译:本文提出了一种对文档图像进行二值化处理的新颖方法。首先使用两阶段提取程序来提取文档图像中具有各个背景强度值的所有块。然后,计算每个块的强度分布以确定背景强度的变化范围。对于每个提取的块,将强度值落入这些范围内的内部像素视为背景像素。对于所有提取的块之外的那些像素,将使用Otsu的全局阈值方法对它们进行二值化。为了评估开发的系统,收集了275个代表性文档图像,通过识别从那些二进制图像中提取的字符来评估二进制化结果。这些二值化图像是使用建议的方法和其他现有方法生成的,并馈入同一光学字符识别系统以评估每种方法的实用性。提出的文档二值化方法获得了最高的识别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号