首页> 外文期刊>Pattern recognition letters >Detecting natural scenes text via auto image partition, two-stage grouping and two-layer classification
【24h】

Detecting natural scenes text via auto image partition, two-stage grouping and two-layer classification

机译:通过自动图像分割,两阶段分组和两层分类来检测自然场景文本

获取原文
获取原文并翻译 | 示例
           

摘要

Text detection in natural scene images is important and challenging work for image analysis. In this paper, we present a robust system to detect natural scene text according to text region appearances. The framework includes three parts: auto image partition, two-stage grouping and two-layer classification. The first part partitions images into unconstrained sub-images through statistical distribution of sampling points. The designed two-stage grouping method performs grouping in each sub-image in first stage and connects different partitioned image regions in second stage to group connected components (CCs) to text regions. Then a two-layer classification mechanism is designed for classifying candidate text regions. The first layer is to compute the similarity score of region blocks and the second layer is a SVM classifier using HOG features. We add a normalization step to rectify perspective distortion before candidate text region classification which improves the accuracy and robustness of the final output result. The proposed system is evaluated on four types datasets including two ICDAR Robust Reading Competition datasets, a born-digital image dataset, a video image dataset and a perspective distortion image dataset. The experimental results demonstrate that our proposed framework outperforms state-of-the-art localization algorithms and is robust in dealing with multiple background outliers. (C) 2015 Elsevier B.V. All rights reserved.
机译:自然场景图像中的文本检测对于图像分析而言是重要且具有挑战性的工作。在本文中,我们提出了一个强大的系统,可以根据文本区域的外观来检测自然场景文本。该框架包括三个部分:自动图像分区,两阶段分组和两层分类。第一部分通过采样点的统计分布将图像划分为不受约束的子图像。设计的两阶段分组方法在第一阶段对每个子图像执行分组,然后在第二阶段将不同的分区图像区域连接起来,以将连接的组件(CC)分组到文本区域。然后设计了一种两层分类机制来对候选文本区域进行分类。第一层是计算区域块的相似性得分,第二层是使用HOG特征的SVM分类器。我们在候选文本区域分类之前添加了归一化步骤来纠正透视失真,从而提高了最终输出结果的准确性和鲁棒性。该系统对四种类型的数据集进行了评估,包括两个ICDAR健壮的阅读比赛数据集,一个数字图像数据集,一个视频图像数据集和一个透视畸变图像数据集。实验结果表明,我们提出的框架优于最新的定位算法,并且在处理多个背景离群值方面具有鲁棒性。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号