首页> 外文期刊>Neurocomputing >Adaptive embedding gate for attention-based scene text recognition
【24h】

Adaptive embedding gate for attention-based scene text recognition

机译:自适应嵌入门用于基于注意力的场景文本识别

获取原文
获取原文并翻译 | 示例
       

摘要

Scene text recognition has attracted particular research interest because it is a very challenging problem and has various applications. The most cutting-edge methods are attentional encoder-decoder frameworks that learn the alignment between the input image and output sequences. In particular, the decoder recurrently outputs predictions, using the prediction of the previous step as a guidance for every time step. In this study, we point out that the inappropriate use of previous predictions in existing attentional decoders restricts the recognition performance and brings instability. To handle this problem, we propose a novel module, namely adaptive embedding gate (AEG). The proposed AEG focuses on introducing high-order character language models to attentional decoders by controlling the information transmission between adjacent characters. AEG is a flexible module and can be easily integrated into the state-of-the-art attentional decoders for scene text recognition. We evaluate its effectiveness as well as robustness on a number of standard benchmarks, including the IIIT5K, SVT, SVT-P, CUTE80, and ICDAR datasets. Experimental results demonstrate that AEG can significantly boost recognition performance and bring better robustness. (C) 2019 The Author(s). Published by Elsevier B.V.
机译:场景文本识别吸引了特别的研究兴趣,因为它是一个非常具有挑战性的问题,并且具有多种应用。最前沿的方法是关注的编码器-解码器框架,该框架可学习输入图像和输出序列之间的对齐方式。特别地,解码器使用先前步骤的预测作为每个时间步骤的指导来循环输出预测。在这项研究中,我们指出在现有的注意力解码器中不适当地使用先前的预测会限制识别性能并带来不稳定。为了解决这个问题,我们提出了一种新颖的模块,即自适应嵌入门(AEG)。拟议的AEG致力于通过控制相邻字符之间的信息传输,将高阶字符语言模型引入注意解码器。 AEG是一个灵活的模块,可以轻松集成到最新的注意力解码器中,以进行场景文本识别。我们在包括IIIT5K,SVT,SVT-P,CUTE80和ICDAR数据集在内的许多标准基准上评估其有效性和鲁棒性。实验结果表明,AEG可以显着提高识别性能并带来更好的鲁棒性。 (C)2019作者。由Elsevier B.V.发布

著录项

  • 来源
    《Neurocomputing》 |2020年第14期|261-271|共11页
  • 作者

  • 作者单位

    South China Univ Technol Coll Elect & Informat Engn Guangzhou Peoples R China;

    South China Univ Technol Coll Elect & Informat Engn Guangzhou Peoples R China|SCUT Zhuhai Inst Modern Ind Innovat Zhuhai Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Deep learning; Scene text recognition; Attention mechanism;

    机译:深度学习;场景文字识别;注意机制;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号