首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Cascaded Segmentation-Detection Networks for Word-Level Text Spotting
【24h】

Cascaded Segmentation-Detection Networks for Word-Level Text Spotting

机译:用于字级文本拍摄的级联分段检测网络

获取原文

摘要

We introduce an algorithm for word-level text spotting that is able to accurately and reliably determine the bounding regions of individual words of text "in the wild". Our system is formed by the cascade of two convolutional neural networks. The first network is fully convolutional and is in charge of detecting areas containing text. This results in a very reliable but possibly inaccurate segmentation of the input image. The second network (inspired by the popular YOLO architecture) analyzes each segment produced in the first stage, and predicts oriented rectangular regions containing individual words. No post-processing (e.g. text line grouping) is necessary. With execution time of 450 ms for a 1000×560 image on a Titan X GPU, our system achieves good performance on the ICDAR 2013, 2015 benchmarks [2], [1].
机译:我们介绍了一种用于字级文本的算法,可以准确且可靠地确定文本中的文本的界限区域“在野外”。我们的系统由两个卷积神经网络的级联形成。第一网络是完全卷积的,负责检测包含文本的区域。这导致非常可靠但可能不准确的输入图像的分割。第二个网络(受欢迎的YOLO架构的启发)分析了第一阶段中产生的每个段,并预测包含单个单词的导向矩形区域。没有后处理(例如文本行分组)是必要的。在Titan X GPU上的一个1000×560图像的执行时间为450 ms,我们的系统在ICDAR 2015年的基准[2],[1]上实现了良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号