首页> 外文期刊>Pattern recognition letters >SaHAN: Scale-aware hierarchical attention network for scene text recognition
【24h】

SaHAN: Scale-aware hierarchical attention network for scene text recognition

机译:Sahan:尺度意识的分层关注网络用于场景文本识别

获取原文
获取原文并翻译 | 示例
       

摘要

Scene text recognition has become a research hotspot owing to its abundant semantic information and various applications. Recent methods of scene text recognition usually focus on handling shape distortion, attention drift, or background noise, ignoring that text recognition encounters character scale-variation problem. To address this issue, in this paper, we propose a new scale-aware hierarchical attention network (SaHAN) for scene text recognition. Inspired by feature pyramid network, we exploit the inherent pyramidal structure of a deep convolutional network to retain multi-scale features for flexible receptive fields. Then, we construct a hierarchical attention decoder that performs the attention mechanism twice on multi-scale features to collect the most fine-grained information for prediction. The SaHAN is trained in a weak supervision way, requiring only images and corresponding text labels. Extensive experiments on seven benchmarks reveal that SaHAN achieves state-of-the-art performance.
机译:由于其丰富的语义信息和各种应用,现场文本识别已成为研究热点。最近的现场文本识别方法通常侧重于处理形状扭曲,注意漂移或背景噪声,忽略该文本识别遇到字符尺度变化问题。要解决此问题,请在本文中,我们提出了一种新的尺度感知分层关注网络(Sahan),用于场景文本识别。灵感来自特色金字塔网络,我们利用深度卷积网络的固有金字塔结构,以保持灵活接收领域的多尺度特征。然后,我们构建一个分层注意解码器,对多尺度特征进行两次执行注意机制,以收集最细粒度的预测信息。撒拉莎被弱势监督方式培训,只需要图像和相应的文本标签。七个基准的广泛实验揭示了撒哈希实现了最先进的表现。

著录项

  • 来源
    《Pattern recognition letters》 |2020年第8期|205-211|共7页
  • 作者单位

    School of Electronic and Information Engineering South China University of Technology Guangzhou 510000 China;

    School of Electronic and Information Engineering South China University of Technology Guangzhou 510000 China;

    School of Electronic and Information Engineering South China University of Technology Guangzhou 510000 China SCUT-Zhuhai Institute of Modern Industrial Innovation Zhuhai 519000 China;

    School of Electronic and Information Engineering South China University of Technology Guangzhou 510000 China;

    School of Electronic and Information Engineering South China University of Technology Guangzhou 510000 China SCUT-Zhuhai Institute of Modern Industrial Innovation Zhuhai 519000 China;

    School of Electronic and Information Engineering South China University of Technology Guangzhou 510000 China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Scene text recognition; Character scale-variation problem; Multi-scale features; Hierarchical attention decoder;

    机译:场景文本识别;字符规模变化问题;多尺度特征;分层注意解码器;
  • 入库时间 2022-08-18 21:28:45

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号