A Unified Video Text Detection Method with Network Flow

机译：具有网络流的统一视频文本检测方法

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Scene text detection in videos has many application needs but has drawn less attention than that in images. Existing methods for video text detection perform unsatisfactorily because of the insufficient utilization of spatial and temporal information. In this paper, we propose a novel video text detection method with network flow based tracking. The system first applies a newly proposed Fully Convolutional Neural Network (FCN) based scene text detection method to detect texts in individual frames and then track proposals in adjacent frames with a motion-based method. Next, the text association problem is formulated into a cost-flow network and text trajectories are derived from the network with a min-cost flow algorithm. At last, the trajectories are post-processed to improve the precision accuracy. The method can detect multi-oriented scene text in videos and incorporate spatial and temporal information efficiently. Experimental results show that the method improves the detection performance remarkably on benchmark datasets, e.g., by a 15.66% increase of ATA Average Tracking Accuracy) on ICDAR video scene text dataset.

机译：视频中的视频文本检测有许多应用需求，但绘制的注意力不如图像中的注意。由于空间和时间信息的利用不足，视频文本检测的现有方法表现不佳。在本文中，我们提出了一种新颖的视频文本检测方法，具有基于网络流的跟踪。该系统首先应用一种基于新提出的完全卷积神经网络（FCN）的场景文本检测方法，以检测单个帧中的文本，然后以基于运动的方法在相邻帧中跟踪提案。接下来，将文本关联问题配制成成本流网络，并且通过最小成本流算法从网络导出文本轨迹。最后，轨迹被处理后以提高精度准确性。该方法可以检测视频中的多面向场景文本，有效地合并空间和时间信息。实验结果表明，该方法在基准数据集中显着提高了检测性能，例如，在ICDAR视频场景文本数据集上的15.66 ％增加ATA平均跟踪精度的增加。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|732p|共6页
会议地点
作者
Xue-Hang Yang; Wenhao He; Fei Yin; Cheng-Lin Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Tracking; Trajectory; Proposals; Streaming media; Image edge detection; Benchmark testing; Text recognition;

机译：跟踪;轨迹;提案;流媒体;图像边缘检测;基准测试;文本识别;

相似文献

外文文献
中文文献
专利

1. Chinese text-line detection from web videos with fully convolutional networks [J] . Chun Yang, Wei-Yi Pei, Long-Huang Wu, Big Data Analytics . 2018,第1期

机译：具有完全卷积网络的网络视频中的中文文本行检测
2. Gradient Vector Flow and Grouping-Based Method for Arbitrarily Oriented Scene Text Detection in Video Images [J] . Shivakumara P., Phan T.Q., Lu S., IEEE Transactions on Circuits and Systems for Video Technology . 2013,第10期

机译：基于梯度矢量流和分组的视频图像场景文本任意检测方法
3. Robust detection of video text using an efficient hybrid method via key frame extraction and text localization [J] . Sravani Meesala, Maheswararao Aggala, Murthy Meesala Krishna Multimedia Tools and Applications . 2021,第6期

机译：使用键帧提取和文本本地化使用高效的混合方法鲁棒检测视频文本
4. A Unified Video Text Detection Method with Network Flow [C] . Xue-Hang Yang, Wenhao He, Fei Yin, IAPR International Conference on Document Analysis and Recognition . 2017

机译：具有网络流的统一视频文本检测方法
5. A Unified Framework based on Convolutional Neural Networks for Interpreting Carotid Intima-Media Thickness Videos [D] . Shin, Jaeyul 2016

机译：基于卷积神经网络的统一框架，用于解释颈动脉内膜介质厚度视频
6. Automatic Detection of the Pharyngeal Phase in Raw Videos for the Videofluoroscopic Swallowing Study Using Efficient Data Collection and 3D Convolutional Networks [O] . Jong Taek Lee, Eunhee Park, Tae-Du Jung 2019

机译：使用有效的数据收集和3D卷积网络自动检测原始视频中的咽相以便进行视频荧光吞咽研究
7. Chinese text-line detection from web videos with fully convolutional networks [O] . Chun Yang, Wei-Yi Pei, Long-Huang Wu, 2018

机译：中文文本线路从网络视频与完全卷积网络检测

A Unified Video Text Detection Method with Network Flow

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅