Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images

Raghunandan K. S.; Shivakumara Palaiahnakote; Roy Sangheeta; Kumar G. Hemantha; Pal Umapada; Lu Tong

首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images

【24h】

Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images

机译：视频/场景/出生数字图像中面向多脚本的文本检测和识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Achieving good text detection and recognition results for multi-script-oriented images is a challenging task. First, we explore bit plane slicing in order to utilize the advantage of the most significant bit information to identify text components. A new iterative nearest neighbor symmetry is then proposed based on shapes of convex and concave deficiencies of text components in bit planes to identify candidate planes. Further, we introduce a new concept called mutual nearest neighbor pair components based on gradient direction to identify representative pairs of texts in each candidate bit plane. The representative pairs are used to restore words with the help of edge image of the input one, which results in text detection results (words). Second, we propose a new idea by fixing window for character components of arbitrary oriented words based on angular relationship between sub-bands and a fused band. For each window, we extract features in contourlet wavelet domain to detect characters with the help of an SVM classifier. Further, we propose to explore HMM for recognizing characters and words of any orientation using the same feature vector. The proposed method is evaluated on standard databases such as ICDAR, YVT video, ICDAR, SVT, MSRA scene data, ICDAR born digital data, and multi-lingual data to show its superiority to the state of the art methods.

机译：为面向多脚本的图像实现良好的文本检测和识别结果是一项艰巨的任务。首先，我们探索位平面切片，以便利用最重要的位信息的优势来识别文本成分。然后，基于位平面中文本分量的凹凸缺陷的形状，提出了一种新的迭代最近邻对称性，以识别候选平面。此外，我们引入了一个新概念，即基于梯度方向的相互最邻近对组件，以识别每个候选位平面中的代表性文本对。代表对用于借助输入图像的边缘图像还原单词，从而产生文本检测结果（单词）。其次，我们通过基于子带和融合带之间的角度关系固定面向任意方向的单词的字符窗口的窗口，提出了一种新的想法。对于每个窗口，我们在SVM分类器的帮助下提取Contourlet小波域中的特征以检测字符。此外，我们建议探索HMM，以便使用相同的特征向量识别任何方向的字符和单词。在标准数据库（如ICDAR，YVT视频，ICDAR，SVT，MSRA场景数据，ICDAR固有的数字数据和多语言数据）上对提出的方法进行了评估，以显示其相对于现有方法的优越性。

著录项

来源
《IEEE Transactions on Circuits and Systems for Video Technology》 |2019年第4期|1145-1162|共18页
作者
Raghunandan K. S.; Shivakumara Palaiahnakote; Roy Sangheeta; Kumar G. Hemantha; Pal Umapada; Lu Tong;
展开▼
作者单位

Univ Mysore, Dept Studies Comp Sci, Mysore 57005, Karnataka, India;

Univ Malaya, Fac Comp Syst & Informat Technol, Kuala Lumpur 50603, Malaysia;

Univ Malaya, Fac Comp Syst & Informat Technol, Kuala Lumpur 50603, Malaysia;

Univ Mysore, Dept Studies Comp Sci, Mysore 57005, Karnataka, India;

Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata 700108, India;

Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Bit plane slicing; convex and concave deficiencies; wavelet sub-bands; arbitrarily-oriented text detection and recognition; hidden Markov model; multi-lingual text detection and recognition;

机译：位平面切片;凸和凹面缺陷;小波分布;面向任意的文本检测和识别;隐藏的马尔可夫模型;多语言文本检测和识别;
入库时间 2022-08-18 04:27:37

相似文献

外文文献
中文文献
专利

1. Contour Restoration of Text Components for Recognition in Video/Scene Images [J] . Yirui Wu, Palaiahnakote Shivakumara, Tong Lu, IEEE Transactions on Image Processing . 2016,第12期

机译：用于视频/场景图像识别的文本组件的轮廓恢复
2. Curved text detection in blurredon-blurred video/scene images [J] . Xue Minglong, Shivakumara Palaiahnakote, Zhang Chao, Multimedia Tools and Applications . 2019,第18期

机译：模糊/非模糊视频/场景图像中的弯曲文本检测
3. Curved text detection in blurredon-blurred video/scene images [J] . Xue Minglong, Shivakumara Palaiahnakote, Zhang Chao, Multimedia Tools and Applications . 2019,第18期

机译：模糊/非模糊视频/场景图像中的弯曲文本检测
4. Text detection in born-digital images using multiple layer images [C] . Zeng Chao, Jia Wenjing, He Xiangjian IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：使用多层图像在原始数字图像中进行文本检测
5. Unified detection and recognition for reading text in scene images [D] . Weinman, Jerod J. 2008

机译：统一检测和识别以读取场景图像中的文本
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. Scene text localization and recognition in images and videos [O] . Neumann Lukáš 2017

机译：图像和视频中场景文本的本地化和识别
8. Automated System for Text Detection Individual Video Images [R] . Du, Y. , Chang, C. , Thouin, P. D. 2003

机译：用于文本检测的自动化系统单个视频图像

Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images

摘要

著录项

相似文献

相关主题

期刊订阅