首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Cascaded Segmentation-Detection Networks for Word-Level Text Spotting
【24h】

Cascaded Segmentation-Detection Networks for Word-Level Text Spotting

机译:用于单词级文本点检的级联分段检测网络

获取原文

摘要

We introduce an algorithm for word-level text spotting that is able to accurately and reliably determine the bounding regions of individual words of text "in the wild". Our system is formed by the cascade of two convolutional neural networks. The first network is fully convolutional and is in charge of detecting areas containing text. This results in a very reliable but possibly inaccurate segmentation of the input image. The second network (inspired by the popular YOLO architecture) analyzes each segment produced in the first stage, and predicts oriented rectangular regions containing individual words. No post-processing (e.g. text line grouping) is necessary. With execution time of 450 ms for a 1000 by 560 image on a Titan X GPU, our system achieves good performance on the ICDAR 2013, 2015 benchmarks [2], [1].
机译:我们介绍了一种用于单词级文本识别的算法,该算法能够准确可靠地确定“野外”文本单个单词的边界区域。我们的系统由两个卷积神经网络的级联形成。第一个网络是完全卷积的,负责检测包含文本的区域。这导致输入图像的非常可靠但可能不准确的分割。第二个网络(受流行的YOLO架构的启发)分析了第一阶段中产生的每个片段,并预测了包含单个单词的定向矩形区域。无需后期处理(例如,文本行分组)。在Titan X GPU上执行1000 x 560图像的执行时间为450毫秒,我们的系统在ICDAR 2013、2015基准测试[2],[1]上实现了良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号