End-to-end scene text recognition using tree-structured models

Cunzhao Shi; Chunheng Wang; Baihua Xiao; Song Gao; Jinlong Hu

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >End-to-end scene text recognition using tree-structured models

【24h】

End-to-end scene text recognition using tree-structured models

机译：使用树结构模型的端到端场景文本识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Detecting and recognizing text in natural images are quite challenging and have received much attention from the computer vision community in recent years. In this paper, we propose a robust end-to-end scene text recognition method, which utilizes tree-structured character models and normalized pictorial structured word models. For each category of characters, we build a part-based tree-structured model (TSM) so as to make use of the character-specific structure information as well as the local appearance information. The TSM could detect each part of the character and recognize the unique structure as well, seamlessly combining character detection and recognition together. As the TSMs could accurately detect characters from complex background, for text localization, we apply TSMs for all the characters on the coarse text detection regions to eliminate the false positives and search the possible missing characters as well. While for word recognition, we propose a normalized pictorial structure (PS) framework to deal with the bias caused by words of different lengths. Experimental results on a range of challenging public datasets (ICDAR 2003, ICDAR 2011, SVT) demonstrate that the proposed method outperforms state-of- the-art methods both for text localization and word recognition.

机译：在自然图像中检测和识别文本非常具有挑战性，近年来受到计算机视觉界的广泛关注。在本文中，我们提出了一种鲁棒的端到端场景文本识别方法，该方法利用树形结构的字符模型和归一化的图形化结构的词模型。对于每种字符类别，我们建立一个基于零件的树结构模型（TSM），以便利用特定于字符的结构信息以及局部外观信息。 TSM可以检测字符的每个部分并识别唯一的结构，将字符检测和识别无缝结合在一起。由于TSM可以准确地从复杂的背景中检测字符，因此对于文本定位，我们将TSM应用于粗糙文本检测区域上的所有字符，以消除误报并搜索可能的丢失字符。在进行单词识别时，我们提出了一种归一化的图片结构（PS）框架，以应对由不同长度的单词引起的偏差。在一系列具有挑战性的公共数据集（ICDAR 2003，ICDAR 2011，SVT）上的实验结果表明，该方法在文本本地化和单词识别方面都优于最新方法。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2014年第9期|共14页
作者
Cunzhao Shi; Chunheng Wang; Baihua Xiao; Song Gao; Jinlong Hu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
End-to-end; Scene text recognition; Part-based tree-structured models (TSMs); Normalized pictorial structure;

机译：端到端;场景文本识别;基于部分的树结构模型（TSM）;归一化的图形结构;

相似文献

外文文献
中文文献
专利

1. End-to-end scene text recognition using tree-structured models [J] . Cunzhao Shi, Chunheng Wang, Baihua Xiao, Pattern Recognition: The Journal of the Pattern Recognition Society . 2014,第9期

机译：使用树结构模型的端到端场景文本识别
2. An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition [J] . Baoguang Shi, Xiang Bai, Cong Yao IEEE Transactions on Pattern Analysis and Machine Intelligence . 2017,第11期

机译：基于端到端的可训练神经网络基于图像的序列识别及其在场景文本识别中的应用
3. Deep neural network with attention model for scene text recognition [J] . Shuohao Li, Min Tang, Qiang Guo, Computer Vision, IET . 2017,第7期

机译：具有注意力模型的深度神经网络用于场景文本识别
4. FANet: An End-to-End Full Attention Mechanism Model for Multi-Oriented Scene Text Recognition [C] . Zhenyu Ding, Ziqiang Chen, Shiqing Wang International Conference on Big Data and Information Analytics . 2019

机译：FANet：用于多方向场景文本识别的端到端全注意力机制模型
5. Reducing computation in speaker recognition systems using a tree-structured universal background model. [D] . McClanahan, Richard Daniel. 2014

机译：使用树型通用背景模型来减少说话人识别系统中的计算。
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition [O] . Shi, Baoguang, Bai, Xiang, Yao, Cong 2015

机译：基于图像序列的端到端可训练神经网络识别及其在场景文本识别中的应用

End-to-end scene text recognition using tree-structured models

摘要

著录项

相似文献

相关主题

期刊订阅