A Unified Video Text Detection Method with Network Flow

机译：具有网络流的统一视频文本检测方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Scene text detection in videos has many application needs but has drawn less attention than that in images. Existing methods for video text detection perform unsatisfactorily because of the insufficient utilization of spatial and temporal information. In this paper, we propose a novel video text detection method with network flow based tracking. The system first applies a newly proposed Fully Convolutional Neural Network (FCN) based scene text detection method to detect texts in individual frames and then track proposals in adjacent frames with a motion-based method. Next, the text association problem is formulated into a cost-flow network and text trajectories are derived from the network with a min-cost flow algorithm. At last, the trajectories are post-processed to improve the precision accuracy. The method can detect multi-oriented scene text in videos and incorporate spatial and temporal information efficiently. Experimental results show that the method improves the detection performance remarkably on benchmark datasets, e.g., by a 15.66% increase of ATA Average Tracking Accuracy) on ICDAR video scene text dataset.

机译：视频中的场景文本检测具有许多应用需求，但引起的关注比图像中的要少。由于对空间和时间信息的利用不足，用于视频文本检测的现有方法不能令人满意地执行。在本文中，我们提出了一种基于网络流跟踪的新型视频文本检测方法。该系统首先应用一种新提出的基于全卷积神经网络（FCN）的场景文本检测方法来检测单个帧中的文本，然后使用基于运动的方法跟踪相邻帧中的提议。接下来，将文本关联问题表述为成本流网络，并使用最小成本流算法从网络中得出文本轨迹。最后对轨迹进行后处理，以提高精度。该方法可以检测视频中的多方位场景文本并且有效地合并空间和时间信息。实验结果表明，该方法显着提高了基准数据集的检测性能，例如，对ICDAR视频场景文本数据集的ATA平均跟踪精度提高了15.66％。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|331-336|共6页
会议地点
作者
Xue-Hang Yang; Wenhao He; Fei Yin; Cheng-Lin Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Tracking; Trajectory; Proposals; Streaming media; Image edge detection; Benchmark testing; Text recognition;

机译：跟踪;轨迹;建议;流媒体;图像边缘检测;基准测试;文本识别;

相似文献

外文文献
中文文献
专利

1. Chinese text-line detection from web videos with fully convolutional networks [J] . Chun Yang, Wei-Yi Pei, Long-Huang Wu, Big Data Analytics . 2018,第1期

机译：具有完全卷积网络的网络视频中的中文文本行检测
2. Gradient Vector Flow and Grouping-Based Method for Arbitrarily Oriented Scene Text Detection in Video Images [J] . Shivakumara P., Phan T.Q., Lu S., IEEE Transactions on Circuits and Systems for Video Technology . 2013,第10期

机译：基于梯度矢量流和分组的视频图像场景文本任意检测方法
3. Robust detection of video text using an efficient hybrid method via key frame extraction and text localization [J] . Sravani Meesala, Maheswararao Aggala, Murthy Meesala Krishna Multimedia Tools and Applications . 2021,第6期

机译：使用键帧提取和文本本地化使用高效的混合方法鲁棒检测视频文本
4. A Unified Video Text Detection Method with Network Flow [C] . Xue-Hang Yang, Wenhao He, Fei Yin, IAPR International Conference on Document Analysis and Recognition . 2017

机译：具有网络流的统一视频文本检测方法
5. A Unified Framework based on Convolutional Neural Networks for Interpreting Carotid Intima-Media Thickness Videos [D] . Shin, Jaeyul 2016

机译：基于卷积神经网络的统一框架，用于解释颈动脉内膜介质厚度视频
6. Automatic Detection of the Pharyngeal Phase in Raw Videos for the Videofluoroscopic Swallowing Study Using Efficient Data Collection and 3D Convolutional Networks [O] . Jong Taek Lee, Eunhee Park, Tae-Du Jung 2019

机译：使用有效的数据收集和3D卷积网络自动检测原始视频中的咽相以便进行视频荧光吞咽研究
7. Chinese text-line detection from web videos with fully convolutional networks [O] . Chun Yang, Wei-Yi Pei, Long-Huang Wu, 2018

机译：中文文本线路从网络视频与完全卷积网络检测

A Unified Video Text Detection Method with Network Flow

摘要

著录项

相似文献

相关主题

期刊订阅