首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Towards Accurate Scene Text Recognition With Semantic Reasoning Networks
【24h】

Towards Accurate Scene Text Recognition With Semantic Reasoning Networks

机译:利用语义推理网络实现准确的场景文本识别

获取原文

摘要

Scene text image contains two levels of contents: visual texture and semantic information. Although the previous scene text recognition methods have made great progress over the past few years, the research on mining semantic information to assist text recognition attracts less attention, only RNN-like structures are explored to implicitly model semantic information. However, we observe that RNN based methods have some obvious shortcomings, such as time-dependent decoding manner and one-way serial transmission of semantic context, which greatly limit the help of semantic information and the computation efficiency. To mitigate these limitations, we propose a novel end-to-end trainable framework named semantic reasoning network (SRN) for accurate scene text recognition, where a global semantic reasoning module (GSRM) is introduced to capture global semantic context through multi-way parallel transmission. The state-of-the-art results on 7 public benchmarks, including regular text, irregular text and non-Latin long text, verify the effectiveness and robustness of the proposed method. In addition, the speed of SRN has significant advantages over the RNN based methods, demonstrating its value in practical use.
机译:场景文本图像包含两个级别的内容:视觉纹理和语义信息。尽管过去的几年中场景文本识别方法取得了长足的进步,但是挖掘语义信息以辅助文本识别的研究却很少受到关注,仅探索类似RNN的结构来对语义信息进行隐式建模。但是,我们发现基于RNN的方法存在一些明显的缺点,如时间依赖的解码方式和语义上下文的单向串行传输,这极大地限制了语义信息的帮助和计算效率。为了缓解这些限制,我们提出了一种新颖的端到端可训练框架,称为语义推理网络(SRN),用于准确的场景文本识别,其中引入了全局语义推理模块(GSRM),以通过多路并行捕获全局语义上下文。传播。最新的7个公共基准测试结果(包括常规文本,不规则文本和非拉丁长文本)验证了该方法的有效性和鲁棒性。另外,与基于RNN的方法相比,SRN的速度具有明显的优势,证明了其在实际应用中的价值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号