首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >A Unified Video Text Detection Method with Network Flow
【24h】

A Unified Video Text Detection Method with Network Flow

机译:具有网络流的统一视频文本检测方法

获取原文

摘要

Scene text detection in videos has many application needs but has drawn less attention than that in images. Existing methods for video text detection perform unsatisfactorily because of the insufficient utilization of spatial and temporal information. In this paper, we propose a novel video text detection method with network flow based tracking. The system first applies a newly proposed Fully Convolutional Neural Network (FCN) based scene text detection method to detect texts in individual frames and then track proposals in adjacent frames with a motion-based method. Next, the text association problem is formulated into a cost-flow network and text trajectories are derived from the network with a min-cost flow algorithm. At last, the trajectories are post-processed to improve the precision accuracy. The method can detect multi-oriented scene text in videos and incorporate spatial and temporal information efficiently. Experimental results show that the method improves the detection performance remarkably on benchmark datasets, e.g., by a 15.66% increase of ATA Average Tracking Accuracy) on ICDAR video scene text dataset.
机译:视频中的视频文本检测有许多应用需求,但绘制的注意力不如图像中的注意。由于空间和时间信息的利用不足,视频文本检测的现有方法表现不佳。在本文中,我们提出了一种新颖的视频文本检测方法,具有基于网络流的跟踪。该系统首先应用一种基于新提出的完全卷积神经网络(FCN)的场景文本检测方法,以检测单个帧中的文本,然后以基于运动的方法在相邻帧中跟踪提案。接下来,将文本关联问题配制成成本流网络,并且通过最小成本流算法从网络导出文本轨迹。最后,轨迹被处理后以提高精度准确性。该方法可以检测视频中的多面向场景文本,有效地合并空间和时间信息。实验结果表明,该方法在基准数据集中显着提高了检测性能,例如,在ICDAR视频场景文本数据集上的15.66 %增加ATA平均跟踪精度的增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号