首页> 外文会议>IAPR International Workshop on Document Analysis Systems >A New Common Points Detection Method for Classification of 2D and 3D Texts in Video/Scene Images

【24h】

A New Common Points Detection Method for Classification of 2D and 3D Texts in Video/Scene Images

机译：视频/场景图像中2D和3D文本分类的新公共点检测方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Achieving high quality recognition result for video and natural scene images that contain both standard 2D text as well as decorative 3D text is challenging. Methods developed for 2D text may fail for 3D text due to the presence of pixels representing shadow and depth in the 3D text. This work aims at classification of 2D and 3D texts in video or scene images such that one can choose an appropriate method in the classified text for achieving better results. The proposed method explores Generalized Gradient Vector Flow (GGVF) for finding dominant points for input 2D and 3D text line images based on opposite direction symmetry. For each dominant point, our approach finds distance between neighbor points and plots a histogram to choose points which contribute to the highest peak as candidates. Distance symmetry between a candidate point and its neighbor points is checked and if a candidate point is visited twice, a common point is created. Statistical features such as the mean and standard deviation of the common points and candidate points are extracted to feed to Neural Network (NN) for classification. Experimental results on dataset of 2D-3D text line images and the dataset collected from standard natural scene images show that the proposed method outperforms exiting methods. Furthermore, recognition experiments before and after classification show recognition performance improves significantly as a result of applying our method.

机译：对于既包含标准2D文本又包含装饰性3D文本的视频和自然场景图像，要获得高质量的识别结果，将是一项挑战。为2D文本开发的方法可能因3D文本中存在表示阴影和深度的像素而无法用于3D文本。这项工作旨在对视频或场景图像中的2D和3D文本进行分类，以便人们可以在分类的文本中选择一种合适的方法以获得更好的效果。所提出的方法探索了基于相反方向对称性的通用梯度向量流（GGVF），以找到输入2D和3D文本线图像的优势点。对于每个主要点，我们的方法都会找到相邻点之间的距离，并绘制直方图以选择对峰最高有贡献的点作为候选点。检查候选点及其相邻点之间的距离对称性，如果两次访问候选点，则会创建一个公共点。统计特征（例如公共点和候选点的均值和标准差）被提取并馈送到神经网络（NN）进行分类。在2D-3D文本行图像的数据集和从标准自然场景图像收集的数据集上的实验结果表明，该方法优于现有方法。此外，分类前后的识别实验表明，由于应用了我们的方法，识别性能得到了显着提高。

著录项

来源
《IAPR International Workshop on Document Analysis Systems 》|2020年|512-528|共17页
会议地点
作者
Lokesh Nandanwar; Palaiahnakote Shivakumara; Ahlad Kumar; Tong Lu; Umapada Pal; Daniel Lopresti;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Gradient Vector Flow; Edge points; Candidate points; 2D text; 3D text; Text recognition; Video/scene images;

机译：梯度矢量流;边缘点;候选分数; 2D文字; 3D文字;文字识别;视频/场景图像;

相似文献

外文文献
中文文献
专利

1. A new method for multi-oriented graphics-scene-3D text classification in video [J] . Xu Jiamin, Shivakumara Palaiahnakote, Lu Tong, Pattern Recognition: The Journal of the Pattern Recognition Society . 2016 ,第Null期

机译：视频多方向图形场景3D文本分类的新方法
2. Gradient Vector Flow and Grouping-Based Method for Arbitrarily Oriented Scene Text Detection in Video Images [J] . Shivakumara P., Phan T.Q., Lu S., IEEE Transactions on Circuits and Systems for Video Technology . 2013 ,第10期

机译：基于梯度矢量流和分组的视频图像场景文本任意检测方法
3. Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images [J] . Raghunandan K. S., Shivakumara Palaiahnakote, Roy Sangheeta, IEEE Transactions on Circuits and Systems for Video Technology . 2019 ,第4期

机译：视频/场景/出生数字图像中面向多脚本的文本检测和识别
4. 2D and 3D Video Scene Text Classification [C] . Xu Jiamin, Shivakumara Palaiahnakote, Lu Tong, International Conference on Pattern Recognition . 2014

机译：2D和3D视频场景文本分类
5. 3D Object Detection, Instance Segmentation and Classification from 3D Range and 2D Color Images [D] . Shen, Xiaoke. 2021

机译：3D对象检测，实例分段和3D范围和2D彩色图像的分类
6. TMOD-13. A NOVEL 3D HIGH-RESOLUTION HISTOPATHOLOGICAL IMAGE RECONSTRUCTION METHOD VERSUS COMMON 2D AND 3D IMAGING METHODOLOGIES FOR APPLICATION IN CANCER SPHEROID RESEARCH: WHICH IS BETTER? [O] . James Samarasekara, Filomena Esteves, Alistair Curd, -1

机译：TMOD-13。新型3D高分辨率组织病理学图像重建方法与常用2D和3D成像方法在癌症球状体研究中的应用：哪种更好？
7. 2D and 3D Video Scene Text Classification [O] . Jiamin Xu, Palaiahnakote Shivakumara, Tong Lu, 2015

机译：2D和3D视频场景文本分类
8. Distance Metric between 3D Models and 2D Images for Recognition and Classification. [R] . Basri, R., Weinshall, D. 1992

机译：用于识别和分类的3D模型和2D图像之间的距离度量。

A New Common Points Detection Method for Classification of 2D and 3D Texts in Video/Scene Images

摘要

著录项

相似文献

相关主题

期刊订阅