首页> 外文会议>International conference on multimedia modeling >Robust Scene Text Detection for Multi-script Languages Using Deep Learning
【24h】

Robust Scene Text Detection for Multi-script Languages Using Deep Learning

机译:使用深度学习对多脚本语言进行可靠的场景文本检测

获取原文

摘要

Text detection in natural images has been a high demand for a lot real-life applications such as image retrieval and self-navigation. This work deals with the problem of robust text detection especially for multi-script in natural scene images. Unlike the existing works that consider multi-script characters as groups of text fragments, we consider them as non-connected components. Specifically, we firstly propose a novel representation named Linked Extremal Regions (LER) to extract full characters instead of fragments of scene characters. Secondly, we propose a two-stage convolution neural networks for discriminating multi-script texts in clutter background images for more robust text detection. Experimental results on three well-known datasets, namely, ICDAR 2011, 2013 and MSRA-TD500, demonstrate that the proposed method outperforms the state-of-the-art methods, and is also language independent.
机译:对于许多现实生活中的应用(例如图像检索和自我导航),自然图像中的文本检测一直是很高的要求。这项工作解决了鲁棒的文本检测问题,尤其是对于自然场景图像中的多脚本文本检测。与将多脚本字符视为文本片段组的现有作品不同,我们将它们视为非连接组件。具体来说,我们首先提出一种名为链接的极端区域(LER)的新颖表示形式,以提取完整的角色而不是场景角色的片段。其次,我们提出了一种两阶段卷积神经网络,用于区分杂乱背景图像中的多脚本文本,以实现更鲁棒的文本检测。在三个著名的数据集ICDAR 2011、2013和MSRA-TD500上的实验结果表明,该方法优于最新方法,并且与语言无关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号