首页> 外文会议>International Conference on Pattern Recognition >Robust Outdoor Text Detection Using Text Intensity and Shape Features
【24h】

Robust Outdoor Text Detection Using Text Intensity and Shape Features

机译:使用文本强度和形状功能的强大户外文本检测

获取原文

摘要

Recognizing texts from camera images is a known hard problem because of the difficulties in text segmentation from the varied and complicated backgrounds. In this paper, we propose an algorithm that employs two novel filters and a basic component-based text detection framework. The framework uses the Niblack algorithm to threshold images and groups components into regions with commonly used geometry features. The intensity filter considers the overlap between the intensity histogram of a component and that of its adjoining area. For non-text regions, we have found that this overlap is large, and so we can prune out components with large values of this measure. The shape filter, on the other hand, deletes regions whose constituent components come from a same object, as most words consist of different characters. The proposed method is evaluated with the text locating database with 249 images used in the ICDAR2003 robust reading competition. The result shows that the algorithm is robust to both indoor images and outdoor images, even for the images of complex background, which usually is a hard factor to overcome for traditional component-based algorithms. In terms of performance statistics, we tested the algorithm on the ICDAR 2003 challenge experiment, and the algorithm achieves 66% precision rate (p), 46% recall rate (r), and 54% the combined rate (f), which is the best reported in the literature on this dataset.
机译:由于文本分段来自各种复杂的背景,识别来自摄像机图像的文本是一个已知的难题。在本文中,我们提出了一种算法,该算法采用了两个新颖的滤波器和基于基于组件的文本检测框架。该框架使用NiBlack算法将阈值图像和组组件与常用的几何特征阈值映像和组组件。强度滤波器考虑组件的强度直方图与其相邻区域之间的重叠。对于非文本区域,我们发现这个重叠很大,所以我们可以用这种措施的大值修剪零件。另一方面,形状过滤器删除组成组件来自同一对象的区域,因为大多数单词由不同的字符组成。该提出的方法是用文本定位数据库进行评估,其中ICDAR2003强大的阅读竞争中使用的249个图像。结果表明,即使对于复杂背景的图像,该算法对于室内图像和户外图像也是强大的,这通常是对基于组件的算法克服的难以克服的难度因素。在性能统计方面,我们在ICDAR 2003挑战实验上测试了算法,算法达到66%的精密率(P),46%召回率(R)和54%的组合率(F),这是在这个数据集的文献中最好报道。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号