Towards Accurate Scene Text Recognition With Semantic Reasoning Networks

机译：利用语义推理网络实现准确的场景文本识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Scene text image contains two levels of contents: visual texture and semantic information. Although the previous scene text recognition methods have made great progress over the past few years, the research on mining semantic information to assist text recognition attracts less attention, only RNN-like structures are explored to implicitly model semantic information. However, we observe that RNN based methods have some obvious shortcomings, such as time-dependent decoding manner and one-way serial transmission of semantic context, which greatly limit the help of semantic information and the computation efficiency. To mitigate these limitations, we propose a novel end-to-end trainable framework named semantic reasoning network (SRN) for accurate scene text recognition, where a global semantic reasoning module (GSRM) is introduced to capture global semantic context through multi-way parallel transmission. The state-of-the-art results on 7 public benchmarks, including regular text, irregular text and non-Latin long text, verify the effectiveness and robustness of the proposed method. In addition, the speed of SRN has significant advantages over the RNN based methods, demonstrating its value in practical use.

机译：场景文本图像包含两个级别的内容：视觉纹理和语义信息。尽管过去的几年中场景文本识别方法取得了长足的进步，但是挖掘语义信息以辅助文本识别的研究却很少受到关注，仅探索类似RNN的结构来对语义信息进行隐式建模。但是，我们发现基于RNN的方法存在一些明显的缺点，如时间依赖的解码方式和语义上下文的单向串行传输，这极大地限制了语义信息的帮助和计算效率。为了缓解这些限制，我们提出了一种新颖的端到端可训练框架，称为语义推理网络（SRN），用于准确的场景文本识别，其中引入了全局语义推理模块（GSRM），以通过多路并行捕获全局语义上下文。传播。最新的7个公共基准测试结果（包括常规文本，不规则文本和非拉丁长文本）验证了该方法的有效性和鲁棒性。另外，与基于RNN的方法相比，SRN的速度具有明显的优势，证明了其在实际应用中的价值。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2020年|12110-12119|共10页
会议地点
作者
Deli Yu; Xuan Li; Chengquan Zhang; Tao Liu; Junyu Han; Jingtuo Liu; Errui Ding;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Visualization; Text recognition; Cognition; Feature extraction; Decoding; Robustness;

机译：语义;可视化;文本识别;认知;特征提取;解码;鲁棒性;

相似文献

外文文献
中文文献
专利

1. Scene Semantics Recognition Based on Target Detection and Fuzzy Reasoning [J] . Weiliang Liu, Changliang Liu, Yongjun Lin Research journal of applied science, engineering and technology . 2014,第5期

机译：基于目标检测和模糊推理的场景语义识别
2. Scene Semantics Recognition Based on Target Detection and Fuzzy Reasoning [J] . Weiliang Liu, Changliang Liu, Yongjun Lin Research journal of applied science, engineering and technology . 2014,第5期

机译：基于目标检测和模糊推理的场景语义识别
3. An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition [J] . Baoguang Shi, Xiang Bai, Cong Yao IEEE Transactions on Pattern Analysis and Machine Intelligence . 2017,第11期

机译：基于端到端的可训练神经网络基于图像的序列识别及其在场景文本识别中的应用
4. Accurate Scene Text Recognition Based on Recurrent Neural Network [C] . Bolan Su, Shijian Lu Asian conference on computer vision . 2015

机译：基于递归神经网络的准确场景文本识别
5. Context modeling for semantic text matching and scene text detection [D] . Huang, Wenyi. 2016

机译：语义文本匹配和场景文本检测的上下文建模
6. An Algorithm Based on Text Position Correction and Encoder-Decoder Network for Text Recognition in the Scene Image of Visual Sensors [O] . Zhiwei Huang, Jinzhao Lin, Hongzhi Yang, 2020

机译：基于文本位置校正和编解码器网络的视觉传感器场景图像文本识别算法
7. Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes [O] . Fangneng Zhan, Shijian Lu, Chuhui Xue 2018

机译：虚拟格式图像合成，用于精确检测和识别场景中的文本

Towards Accurate Scene Text Recognition With Semantic Reasoning Networks

摘要

著录项

相似文献

相关主题

期刊订阅