首页> 外文会议>2011 IEEE Workshop on Applications of Computer Vision >A recursive Otsu thresholding method for scanned document binarization
【24h】

A recursive Otsu thresholding method for scanned document binarization

机译:用于扫描文档二值化的递归Otsu阈值化方法

获取原文

摘要

The use of digital images of scanned handwritten historical documents has increased in recent years, especially with the online availability of large document collections. However, the sheer number of images in some of these collections makes them cumbersome to manually read and process, making the need for automated processing of increased importance. A key step in the recognition and retrieval of such documents is binarization, the separation of document text from the page's background. Binarization of images of historical documents that have been affected by degradation or are otherwise of poor image quality is difficult and continues to be a topic of research in the field of image processing. This paper presents a novel approach to this problem, including two primary variations. One combines a recursive extension of Otsu thresholding and selective bilateral filtering to allow automatic binarization and segmentation of handwritten text images. The other also builds on the recursive Otsu method and adds improved background normalization and a post-processing step to the algorithm to make it more robust and to perform adequately even for images that present bleed-through artifacts. Our results show that these techniques segment the text in historical documents comparable to and in some cases better than many state-of-the-art approaches based on their performance as evaluated using the dataset from the recent ICDAR 2009 Document Image Binarization Contest.
机译:近年来,特别是随着大型文档收藏的在线提供,对扫描的手写历史文档的数字图像的使用有所增加。但是,这些集合中某些图像的数量庞大,使得它们难以手动读取和处理,这使得对自动化处理的需求变得越来越重要。识别和检索此类文档的关键步骤是二进制化,即将文档文本与页面背景分离。已经受到降级影响或图像质量差的历史文档图像的二值化是困难的,并且继续成为图像处理领域的研究主题。本文提出了一种解决此问题的新颖方法,包括两个主要变体。一种结合了Otsu阈值的递归扩展和选择性双边过滤,以允许对手写文本图像进行自动二值化和分段。另一种也建立在递归Otsu方法的基础上,并在算法中增加了改进的背景标准化和后处理步骤,以使其更加健壮并甚至对存在渗出伪像的图像也能充分执行。我们的结果表明,根据使用最近的ICDAR 2009文档图像二值化竞赛数据集进行的性能评估,这些技术可以对历史文档中的文本进行细分,这些文本在性能上可与许多最新技术相媲美,并且在某些情况下要优于许多最新技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号