首页> 外文期刊>Signal, Image and Video Processing >Foreground text segmentation in complex color document images using Gabor filters - Springer
【24h】

Foreground text segmentation in complex color document images using Gabor filters - Springer

机译:使用Gabor滤镜的复杂彩色文档图像中的前景文本分割-Springer

获取原文
获取原文并翻译 | 示例
       

摘要

Extraction of foreground contents in complex background document images is very difficult as background texture, color and foreground font, size, color, tilt are not known in advance. In this work, we propose a RGB color model for the input of complex color document images. An algorithm to detect the text regions using Gabor filters followed by extraction of text using color feature luminance is developed too. The proposed approach consists of three stages. Based on the Gabor features, the candidate image segments containing text are detected in stage-1. Because of complex background, certain amount of high frequency non-text objects in the background are also detected as text objects in stage-1. In stage-2, certain amount of false text objects is dropped by performing the connected component analysis. In stage-3, the image segments containing textual information, which are obtained from the previous stage are binarized to extract the foreground text. The color feature luminance is extracted from the input color document image. The threshold value is derived automatically using this color feature. The proposed approach handles both printed and handwritten color document images with foreground text in any color, font, size and orientation. For experimental evaluations, we have considered a variety of document images having non-uniform/uniform textured and multicolored background. Performance of segmentation of foreground text is evaluated on a commercially available OCR. Evaluation results show better recognition accuracy of foreground characters in the processed document images against unprocessed document images.
机译:由于背景纹理,颜色和前景字体,大小,颜色,倾斜度事先未知,因此很难提取复杂背景文档图像中的前景内容。在这项工作中,我们提出了用于输入复杂彩色文档图像的RGB颜色模型。还开发了一种算法,该算法使用Gabor滤波器检测文本区域,然后使用颜色特征亮度提取文本。提议的方法包括三个阶段。基于Gabor特征,在阶段1中检测包含文本的候选图像段。由于背景复杂,背景中的一定数量的高频非文本对象在阶段1中也被检测为文本对象。在阶段2中,通过执行连接的组件分析来丢弃一定数量的错误文本对象。在阶段3中,将从前一阶段获得的包含文本信息的图像段进行二值化以提取前景文本。从输入的彩色文档图像中提取颜色特征亮度。使用此颜色功能会自动得出阈值。所提出的方法可处理带有任何颜色,字体,大小和方向的前景文本的印刷和手写彩色文档图像。为了进行实验评估,我们考虑了具有不均匀/均匀纹理和彩色背景的各种文档图像。在商用OCR上评估前景文本的分割性能。评估结果表明,相对于未处理的文档图像,已处理的文档图像中前景字符的识别精度更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号