...
首页> 外文期刊>Multimedia Tools and Applications >A statistical tool based binarization method for document images
【24h】

A statistical tool based binarization method for document images

机译:基于统计工具的文档图像二值化方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Binarization of document images has great importance in several applications like historical document restoration, Optical Character Recognition (OCR). It is a challenging task due to small difference between foreground and background pixel intensities, intricate font patterns and noisy background. In this article a binarization algorithm is presented for document images which has performed significantly well on handwritten document images as well as machine printed document images. First, the RGB document images are converted to a prominent gray-scale image using statistical tools like mean, variance and standard deviation. Next, the gray-scale images are binarized using edge detection. Further the noises are removed using connected component features analysis. The proposed method is experimented on publicly available DIBCO 2016 and DIBCO 2017 datasets. The performance of the proposed algorithm is satisfactory in terms of F-Measure (FM), Pseudo-FMeasure (F-ps), PSNR, Distance Reciprocal Distortion (DRD) and it also provides significant results on degraded document images.
机译:文档图像的二值化在诸如历史文档还原,光学字符识别(OCR)等多种应用中非常重要。由于前景和背景像素强度之间的细微差异,复杂的字体样式和嘈杂的背景,这是一项具有挑战性的任务。在本文中,提出了一种针对文档图像的二值化算法,该算法在手写文档图像以及机器打印的文档图像上均表现出色。首先,使用统计工具(例如均值,方差和标准差)将RGB文档图像转换为突出的灰度图像。接下来,使用边缘检测将灰度图像二值化。此外,使用连接的零部件特征分析可以消除噪声。在公开可用的DIBCO 2016和DIBCO 2017数据集上对提出的方法进行了实验。所提出算法的性能在F度量(FM),伪FMeasure(F-ps),PSNR,距离倒数失真(DRD)方面令人满意,并且在退化的文档图像上也提供了显着的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号