首页> 外文期刊>Pattern recognition letters >Morphological preprocessing method to thresholding degraded word images
【24h】

Morphological preprocessing method to thresholding degraded word images

机译:阈值化退化词图像的形态学预处理方法

获取原文
获取原文并翻译 | 示例
       

摘要

This paper presents a novel preprocessing method based on mathematical morphology techniques to improve the subsequent thresholding quality of raw degraded word images. The raw degraded word images contain undesirable shapes called critical shadows on the background that cause noise in binary images. This noise constitutes obstacles to posterior segmentation of characters. Direct application of a thresholding method produces inadequate binary versions of these degraded word images. Our preprocessing method called Shadow Location and Lightening (SL~*L) adaptively, accurately and without manual fine-tuning of parameters locates these critical shadows on grayscale degraded images using morphological operations, and lightens them before applying eventual thresholding process. In this way, enhanced binary images without unpredictable and inappropriate noise can be provided to subsequent segmentation of characters. Then, adequate binary characters can be segmented and extracted as input data to optical character recognition (OCR) applications saving computational effort and increasing recognition rate. The proposed method is experimentally tested with a set of several raw degraded images extracted from real photos acquired by unsophisticated imaging systems. A qualitative analysis of experimental results led to conclusions that the thresholding result quality was significantly improved with the proposed preprocessing method. Also, a quantitative evaluation using a testing data of 1194 degraded word images showed the essentiality and effectiveness of the proposed preprocessing method to increase segmentation and recognition rates of their characters. Furthermore, an advantage of the proposed method is that Otsu's method as a simple and easily implementable global thresholding technique can be sufficient to reducing computational load.
机译:本文提出了一种基于数学形态学技术的预处理方法,以提高原始降级词图像的后续阈值质量。原始的降级词图像在背景上包含不希望的形状,称为临界阴影,会在二进制图像中引起噪声。这种噪声构成了字符后方分割的障碍。直接应用阈值方法会导致这些降级词图像的二进制版本不足。我们的预处理方法称为“阴影位置和减亮(SL〜* L)”,可通过形态学操作自适应,准确且无需手动微调参数,即可将这些关键阴影定位在灰度退化图像上,并在应用最终阈值处理之前将其减亮。以此方式,可以将没有不可预测的和不适当的噪声的增强的二进制图像提供给后续的字符分割。然后,可以对足够的二进制字符进行分段并将其提取为光学字符识别(OCR)应用程序的输入数据,从而节省了计算量并提高了识别率。所提出的方法是用一组从未经精密成像系统采集的真实照片中提取的原始图像进行实验测试的。对实验结果的定性分析得出的结论是,所提出的预处理方法显着提高了阈值结果的质量。此外,使用1194个退化词图像的测试数据进行的定量评估显示了所提出的预处理方法提高其字符的分割和识别率的必要性和有效性。此外,所提出的方法的优点在于,作为简单且易于实现的全局阈值技术的大津方法足以减少计算负荷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号