首页> 外文期刊>Journal of electronic imaging >Multiorientation/multiscript scene text detection based on projection profile analysis and graph segmentation
【24h】

Multiorientation/multiscript scene text detection based on projection profile analysis and graph segmentation

机译:基于投影轮廓分析和图分割的多向/多场景文本检测

获取原文
获取原文并翻译 | 示例
           

摘要

Textline detection in natural images has been an important problem and researchers have attempted to address this problem by grouping connected components (CCs) into clusters corresponding to textlines. However, developing bottom-up rules that work for multiorientation and/or multiscript textlines is not a simple task. In order to address this problem, we propose a framework that incorporates projection profile analysis (PPA) into the CC-based approach. Specifically, we build a graph of CCs and recursively partition the graph into subgraphs, until textline structures are detected by PPA. Although PPA has been a common technique in document image processing, it was developed for scanned documents, and we also propose a method to compute projection profiles for CCs. Experimental results show that our method is efficient and achieves better or comparable performance on conventional datasets (ICDAR 2011/2013 and MSRA-TD500), and shows promising results on a challenging dataset (ICDAR 2015 incidental text localization dataset). (C) 2016 SPIE and IS& T
机译:自然图像中的文本行检测一直是一个重要的问题,研究人员已尝试通过将连接的组件(CC)分组为与文本行相对应的群集来解决此问题。但是,开发适用于多方向和/或多脚本文本行的自下而上的规则并非易事。为了解决此问题,我们提出了一个将投影轮廓分析(PPA)纳入基于CC的方法的框架。具体来说,我们构建了一个CC图,并将该图递归地划分为子图,直到PPA检测到文本行结构为止。尽管PPA是文档图像处理中的常用技术,但它是为扫描文档开发的,我们还提出了一种计算CC投影轮廓的方法。实验结果表明,我们的方法是有效的,并且在常规数据集(ICDAR 2011/2013和MSRA-TD500)上可以达到更好或相当的性能,并在具有挑战性的数据集(ICDAR 2015附带文本本地化数据集)上显示出可喜的结果。 (C)2016 SPIE和IS&T

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号