Gradient Vector Flow and Grouping-Based Method for Arbitrarily Oriented Scene Text Detection in Video Images

Shivakumara P.; Phan T.Q.; Lu S.; Tan C.L.

首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Gradient Vector Flow and Grouping-Based Method for Arbitrarily Oriented Scene Text Detection in Video Images

【24h】

Gradient Vector Flow and Grouping-Based Method for Arbitrarily Oriented Scene Text Detection in Video Images

机译：基于梯度矢量流和分组的视频图像场景文本任意检测方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text detection in videos is challenging due to low resolution and complex background of videos. Besides, an arbitrary orientation of scene text lines in video makes the problem more complex and challenging. This paper presents a new method that extracts text lines of any orientations based on gradient vector flow (GVF) and neighbor component grouping. The GVF of edge pixels in the Sobel edge map of the input frame is explored to identify the dominant edge pixels which represent text components. The method extracts edge components corresponding to dominant pixels in the Sobel edge map, which we call text candidates (TC) of the text lines. We propose two grouping schemes. The first finds nearest neighbors based on geometrical properties of TC to group broken segments and neighboring characters which results in word patches. The end and junction points of skeleton of the word patches are considered to eliminate false positives, which output the candidate text components (CTC). The second is based on the direction and the size of the CTC to extract neighboring CTC and to restore missing CTC, which enables arbitrarily oriented text line detection in video frame. Experimental results on different datasets, including arbitrarily oriented text data, nonhorizontal and horizontal text data, Hua's data and ICDAR-03 data (camera images), show that the proposed method outperforms existing methods in terms of recall, precision and f-measure.

机译：由于视频的低分辨率和复杂的背景，因此视频中的文本检测具有挑战性。此外，视频中场景文本行的任意方向使问题变得更加复杂和具有挑战性。本文提出了一种新方法，该方法基于梯度矢量流（GVF）和邻域分量分组提取任意方向的文本行。探索输入帧的Sobel边缘图中的边缘像素的GVF，以识别代表文本成分的主要边缘像素。该方法提取与Sobel边缘图中的优势像素相对应的边缘分量，我们将其称为文本行的文本候选（TC）。我们提出了两种分组方案。第一种方法基于TC的几何特性找到最接近的邻居，以对折断的段和相邻字符进行分组，从而产生单词补丁。单词补丁的骨架的端点和连接点被认为可以消除误报，误报会输出候选文本成分（CTC）。第二种方法基于CTC的方向和大小来提取相邻的CTC并恢复丢失的CTC，从而可以在视频帧中任意定位文本行。在不同数据集上的实验结果，包括任意方向的文本数据，非水平和水平文本数据，Hua的数据和ICDAR-03数据（相机图像），表明该方法在查全率，精度和f测度方面优于现有方法。

著录项

来源
《IEEE Transactions on Circuits and Systems for Video Technology》 |2013年第10期|1729-1739|共11页
作者
Shivakumara P.; Phan T.Q.; Lu S.; Tan C.L.;
展开▼
作者单位

Multimedia Unit, Department of Computer Systems and Information Technology, University of Malaya, Kuala Lumpur, Malaysia|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Arbitrarily oriented text detection; candidate text components (CTC); dominant text pixel; gradient vector flow (GVF); text candidates (TC); text components;

机译：任意方向的文本检测;候选文本成分（CTC）;主要文本像素;梯度矢量流（GVF）;文本候选（TC）;文本成分;

相似文献

外文文献
中文文献
专利

1. Radiant Vector Flow Method for Arbitrarily Oriented Scene Text Detection [J] . B.Nishanthi, S. Shahul Hammed International Journal of Innovative Research in Science, Engineering and Technology . 2014,第1期

机译：面向任意方向的场景文本检测的辐射矢量流方法
2. Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images [J] . Raghunandan K. S., Shivakumara Palaiahnakote, Roy Sangheeta, IEEE Transactions on Circuits and Systems for Video Technology . 2019,第4期

机译：视频/场景/出生数字图像中面向多脚本的文本检测和识别
3. LoG and Structural Based Arbitrary Oriented Multilingual Text Detection in Images/Video [J] . Basavaraju H. T., Manjunath Aradhya V.N., Guru D. S., International journal of natural computing research . 2018,第3期

机译：LoG和基于结构的图像/视频中任意方向的多语言文本检测
4. Cloud of Line Distribution for Arbitrary Text Detection in Scene/Video/License Plate Images [C] . Wenhai Wang, Yirui Wu, Shivakumara Palaiahnakote, Pacific-Rim conference on multimedia . 2018

机译：用于场景/视频/牌照图像中任意文本检测的线分布云
5. Unified detection and recognition for reading text in scene images [D] . Weinman, Jerod J. 2008

机译：统一检测和识别以读取场景图像中的文本
6. Automatic lesion boundary detection in dermoscopy images using gradient vector flow snakes [O] . Bulent Erkol, Randy H. Moss, R. Joe Stanley, -1

机译：使用梯度矢量流蛇在皮肤镜图像中自动进行病灶边界检测
7. Multi-Oriented Text Detection and Verification in Video Frames and Scene Images [O] . Sain, Aneeshan, Bhunia, Ayan Kumar, Roy, Partha Pratim, 2017

机译：视频帧和场景中的多方向文本检测与验证图片
8. Method and Apparatus for Recognizing Text in an Image Sequence of Scene Imagery. [R] . Myers, G. K., Bolles, R. C., Luong, Q. T., 2006

机译：用于识别场景图像的图像序列中的文本的方法和装置。

Gradient Vector Flow and Grouping-Based Method for Arbitrarily Oriented Scene Text Detection in Video Images

摘要

著录项

相似文献

相关主题

期刊订阅