首页> 外文会议>Pattern Recognition, 2008 19th International Conference on >Robust outdoor text detection using text intensity and shape features
【24h】

Robust outdoor text detection using text intensity and shape features

机译:利用文本强度和形状特征进行可靠的户外文本检测

获取原文

摘要

Recognizing texts from camera images is a known hard problem because of the difficulties in text segmentation from the varied and complicated backgrounds. In this paper, we propose an algorithm that employs two novel filters and a basic component-based text detection framework. The framework uses the Niblack algorithm to threshold images and groups components into regions with commonly used geometry features. The intensity filter considers the overlap between the intensity histogram of a component and that of its adjoining area. For non-text regions, we have found that this overlap is large, and so we can prune out components with large values of this measure. The shape filter, on the other hand, deletes regions whose constituent components come from a same object, as most words consist of different characters. The proposed method is evaluated with the text locating database with 249 images used in the ICDAR2003 robust reading competition. The result shows that the algorithm is robust to both indoor images and outdoor images, even for the images of complex background, which usually is a hard factor to overcome for traditional component-based algorithms. In terms of performance statistics, we tested the algorithm on the ICDAR 2003 challenge experiment, and the algorithm achieves 66% precision rate (p), 46% recall rate (r), and 54% the combined rate ( f ), which is the best reported in the literature on this dataset.
机译:从照相机图像中识别文本是一个众所周知的难题,因为在来自各种背景和复杂背景的文本分割中存在困难。在本文中,我们提出了一种算法,该算法采用了两个新颖的过滤器和一个基于组件的基本文本检测框架。该框架使用Niblack算法对图像进行阈值处理并将组件分组为具有常用几何特征的区域。强度过滤器考虑组件的强度直方图与其相邻区域的强度直方图之间的重叠。对于非文本区域,我们发现此重叠量很大,因此我们可以删节具有此度量值较大的组件。另一方面,形状过滤器会删除组成成分来自同一对象的区域,因为大多数单词由不同的字符组成。在ICDAR2003健壮的阅读比赛中,使用带有249张图像的文本定位数据库对提出的方法进行了评估。结果表明,该算法对室内图像和室外图像都具有鲁棒性,即使对于复杂背景的图像也是如此,这通常是传统基于组件算法难以克服的因素。在性能统计方面,我们在ICDAR 2003挑战实验中对该算法进行了测试,该算法达到了66%的准确率(p),46%的查全率(r)和54%的综合率(f),即关于此数据集的文献中报道得最好的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号