A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video

Wu Liang; Shivakumara Palaiahnakote; Lu Tong; Tan Chew Lim

首页> 外文期刊>Multimedia, IEEE Transactions on >A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video

【24h】

A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video

机译：视频多方向场景文本行检测与跟踪的新技术

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text detection and tracking in video is challenging due to contrast, resolution and background variations, and different orientations and text movements. In addition, the presence of both caption and scene texts in video aggravates the problem because these two text types differ in characteristics significantly . This paper proposes a new technique for detecting and tracking video texts of any orientation by using spatial and temporal information, respectively. The technique explores gradient directional symmetry at component level for smoothing edge components before text detection. Spatial information is preserved by forming Delaunay triangulation in a novel way at this level, which results in text candidates. Text characteristics are then proposed in a different way for eliminating false text candidates , which results in potential text candidates. Then grouping is proposed for combining potential text candidates regardless of orientation based on the nearest neighbor criterion. To tackle the problems of multi-font and multi-sized texts, we propose multi-scale integration by a pyramid structure, which helps in extracting full text lines. Then, the detected text lines are tracked in video by matching the subgraphs of triangulation. Experimental results for text detection and tracking on our video dataset, the benchmark video datasets, and the natural scene image benchmark datasets show that the proposed method is superior to the state-of-the-art methods in terms of recall, precision , and F-measure.

机译：由于对比度，分辨率和背景变化以及不同的方向和文本移动，视频中的文本检测和跟踪具有挑战性。另外，字幕和场景文本在视频中的存在加剧了该问题，因为这两种文本类型的特征差异很大。本文提出了一种新的技术，分别通过使用空间和时间信息来检测和跟踪任何方向的视频文本。该技术探索了组件级别的梯度方向对称性，以在文本检测之前平滑边缘组件。通过在此级别上以新颖的方式形成Delaunay三角剖分法，可以保留空间信息，从而生成候选文本。然后以不同的方式提出文本特征，以消除候选虚假文本，从而导致潜在的候选文本。然后提出了基于最近邻居准则的，用于组合潜在文本候选者而不考虑方向的分组方法。为了解决多字体和多尺寸文本的问题，我们建议通过金字塔结构进行多比例集成，这有助于提取全文行。然后，通过匹配三角剖分的子图，在视频中跟踪检测到的文本行。在我们的视频数据集，基准视频数据集和自然场景图像基准数据集上进行文本检测和跟踪的实验结果表明，该方法在查全率，精度和F方面均优于最新方法。 -测量。

著录项

来源
《Multimedia, IEEE Transactions on》 |2015年第8期|1137-1152|共16页
作者
Wu Liang; Shivakumara Palaiahnakote; Lu Tong; Tan Chew Lim;
展开▼
作者单位

National Key Lab for Novel Software Technology, Nanjing University, Nanjing, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Delaunay triangulation; multi-oriented video text detection; multi-sized text detection; text detection; text tracking;

机译：Delaunay三角剖分;多方向视频文本检测;多尺寸文本检测;文本检测;文本跟踪;

相似文献

外文文献
中文文献
专利

1. Multi-oriented scene text detection in video based on wavelet and angle projection boundary growing [J] . Palaiahnakote Shivakumara, Anjan Dutta, Chew Lim Tan, Multimedia Tools and Applications . 2014,第1期

机译：基于小波和角度投影边界增长的视频多方向场景文本检测
2. Scene video text tracking based on hybrid deep text detection and layout constraint [J] . Wang Xihan, Feng Xiaoyi, Xia Zhaoqiang Neurocomputing . 2019,第Octa21期

机译：基于混合深度文本检测和布局约束的场景视频文本跟踪
3. AUTOMATIC DETECTION OF VIDEOS’ SCENES WITH AGGRESSION UTILIZING MOVIES’ TRANSCRIPTS BY USING TEXT MINING TECHNIQUES [J] . Badriya Murdhi Alenzi, Muhammad Badruddin Khan International journal of computer science and network security . 2020,第9期

机译：通过使用文本挖掘技术自动检测利用电影的成绩单进行攻击的视频的场景
4. Pelee-Text: A Tiny Convolutional Neural Network for Multi-oriented Scene Text Detection [C] . Manuel A. Córdova, Luis G. L. Decker, Jose L. Flores-Campana, IEEE International Conference on Machine Learning and Applications . 2019

机译：Pelee-Text：用于多方向场景文本检测的微小卷积神经网络
5. Utilization of robust video processing techniques to aid efficient object detection and tracking. [D] . Balasubramanian, Anand. 2014

机译：利用强大的视频处理技术来辅助有效的对象检测和跟踪。
6. Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images [O] . Cong Yao, Xin Zhang, Xiang Bai, -1

机译：自然图像中多方向文本检测的旋转不变特征
7. Multi-Oriented Text Detection and Verification in Video Frames and Scene Images [O] . Sain, Aneeshan, Bhunia, Ayan Kumar, Roy, Partha Pratim, 2017

机译：视频帧和场景中的多方向文本检测与验证图片
8. Automatic Text Detection and Tracking in Digital Video [R] . Li, H. , Doermann, D. , Kia, O. 1998

机译：数字视频中的自动文本检测与跟踪

A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video

摘要

著录项

相似文献

相关主题

期刊订阅