首页> 外文会议>International Conference on Pattern Recognition >A Multi-head Self-relation Network for Scene Text Recognition
【24h】

A Multi-head Self-relation Network for Scene Text Recognition

机译:用于场景文本识别的多头自交与网络

获取原文

摘要

The text embedded in scene images can be seen everywhere in our lives. However, recognizing text from natural scene images is still a challenge because of its diverse shapes and distorted patterns. Recently, advanced recognition networks generally treat scene text recognition as a sequence prediction task. Although achieving excellent performance, these recognition networks consider the feature map cells as independent individuals and update cells state without utilizing the information of their related cells. And the local receptive field of traditional convolutional neural network (CNN) makes a single cell that cannot cover the whole text region in an image. Due to these issues, the existing recognition networks cannot extract the global context information in a visual scene. To deal with the above problems, we propose a Multi-head Self-relation Network(MSRN) for scene text recognition in this paper. The MSRN consists of several multihead self-relation layers, which are designed for extracting the global context information of a visual scene. Then the information of the related cells can be fused by multi-head self-relation layer. Furthermore, experiments over several public datasets demonstrate that our proposed recognition network achieves superior performance on several benchmark datasets including IC03, IC13, IC15, SVT-Perspective.
机译:在我们的生活中可以看到嵌入在场景图像中的文本。然而,由于其不同的形状和扭曲的模式,识别来自自然场景图像的文本仍然是一个挑战。最近,高级识别网络通常将场景文本识别视为序列预测任务。虽然实现了出色的性能,但是这些识别网络将特征映射单元视为独立的个体和更新单元格状态而不利用其相关细胞的信息。而传统的卷积神经网络(CNN)的局部接受领域使得单个小区不能覆盖图像中的整个文本区域。由于这些问题,现有识别网络无法在视觉场景中提取全局上下文信息。要处理上述问题,我们提出了一个多头自我关系网络(MSRN),用于本文的场景文本识别。 MSRN由多个多个自主关系层组成,该层被设计用于提取视觉场景的全局上下文信息。然后可以通过多头自我关系层融合相关细胞的信息。此外,在几个公共数据集上的实验表明,我们所提出的识别网络在包括IC03,IC13,IC15,SVT透视的若干基准数据集中实现了卓越的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号