Tracking Based Multi-Orientation Scene Text Detection: A Unified Framework With Dynamic Programming

Chun Yang; Xu-Cheng Yin; Wei-Yi Pei; Shu Tian; Ze-Yu Zuo; Chao Zhu; Junchi Yan

首页> 外文期刊>Image Processing, IEEE Transactions on >Tracking Based Multi-Orientation Scene Text Detection: A Unified Framework With Dynamic Programming

【24h】

Tracking Based Multi-Orientation Scene Text Detection: A Unified Framework With Dynamic Programming

机译：基于跟踪的多方向场景文本检测：具有动态编程的统一框架

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

There are a variety of grand challenges for multi-orientation text detection in scene videos, where the typical issues include skew distortion, low contrast, and arbitrary motion. Most conventional video text detection methods using individual frames have limited performance. In this paper, we propose a novel tracking based multi-orientation scene text detection method using multiple frames within a unified framework via dynamic programming. First, a multi-information fusion-based multi-orientation text detection method in each frame is proposed to extensively locate possible character candidates and extract text regions with multiple channels and scales. Second, an optimal tracking trajectory is learned and linked globally over consecutive frames by dynamic programming to finally refine the detection results with all detection, recognition, and prediction information. Moreover, the effectiveness of our proposed system is evaluated with the state-of-the-art performances on several public data sets of multi-orientation scene text images and videos, including MSRA-TD500, USTB-SV1K, and ICDAR 2015 Scene Videos.

机译：场景视频中多方向文本检测面临着各种各样的挑战，其中典型的问题包括偏斜失真，低对比度和任意运动。使用单个帧的大多数常规视频文本检测方法的性能有限。在本文中，我们提出了一种新的基于跟踪的多方向场景文本检测方法，该方法在一个统一框架内通过动态编程在多个框架上使用多个框架。首先，提出了一种在每帧中基于多信息融合的多方向文本检测方法，以广泛地定位可能的字符候选并提取具有多个通道和比例的文本区域。其次，通过动态编程学习并在连续帧上全局链接最佳跟踪轨迹，以最终利用所有检测，识别和预测信息完善检测结果。此外，我们在多方位场景文本图像和视频的多个公共数据集上的最新性能（包括MSRA-TD500，USTB-SV1K和ICDAR 2015场景视频）对我们提出的系统的有效性进行了评估。

著录项

来源
《Image Processing, IEEE Transactions on》 |2017年第7期|3235-3248|共14页
作者
Chun Yang; Xu-Cheng Yin; Wei-Yi Pei; Shu Tian; Ze-Yu Zuo; Chao Zhu; Junchi Yan;
展开▼
作者单位

Department of Computer Science and Technology, School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China;

Department of Computer Science and Technology, School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China;

Department of Computer Science and Technology, School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China;

Department of Computer Science and Technology, School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China;

weibo.com, Beijing, China;

Department of Computer Science and Technology, School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China;

Department of Computer Science and Technology, East China Normal University, Shanghai, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Videos; Feature extraction; Tracking; Text recognition; Robustness; Dynamic programming; Distortion;

机译：视频;特征提取;跟踪;文本识别;稳健性;动态编程;失真;

相似文献

外文文献
中文文献
专利

1. Scene video text tracking based on hybrid deep text detection and layout constraint [J] . Wang Xihan, Feng Xiaoyi, Xia Zhaoqiang Neurocomputing . 2019,第Octa21期

机译：基于混合深度文本检测和布局约束的场景视频文本跟踪
2. Visual tracking based on a unified tracking-and-detection framework with spatial-temporal consistency filtering [J] . Fang Yang, Ka Seunghyun, Jo Geun-Sik Computers and Electrical Engineering . 2019,第期

机译：基于具有空间 - 时间一致性滤波的统一跟踪和检测框架的视觉跟踪
3. TT-XSS: A novel taint tracking based dynamic detection framework for DOM Cross-Site Scripting [J] . Ran Wang, Guangquan Xu, Xianjiao Zeng, Journal of Parallel and Distributed Computing . 2018,第PTa1期

机译：TT-XSS：一种用于DOM跨站点脚本的新颖的基于污点跟踪的动态检测框架
4. Multi-strategy tracking based text detection in scene videos [C] . Ze-Yu Zuo, Shu Tian, Wei-yi Pei, International Conference on Document Analysis and Recognition . 2015

机译：场景视频中基于多策略跟踪的文本检测
5. Unified detection and recognition for reading text in scene images [D] . Weinman, Jerod J. 2008

机译：统一检测和识别以读取场景图像中的文本
6. Road Scene Simulation Based on Vehicle Sensors: An Intelligent Framework Using Random Walk Detection and Scene Stage Reconstruction [O] . Yaochen Li, Zhichao Cui, Yuehu Liu, 2018

机译：基于车辆传感器的道路场景仿真：基于随机步行检测和场景阶段重构的智能框架
7. Parameter Adjustment for a Dynamic Programming Track-before-Detect-Based Target Detection Algorithm [O] . O Nichtern, S R Rotman 2008

机译：基于检测前动态规划的目标检测算法的参数调整

Tracking Based Multi-Orientation Scene Text Detection: A Unified Framework With Dynamic Programming

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅