首页> 外文期刊>Computer Vision, IET >Deep neural network with attention model for scene text recognition
【24h】

Deep neural network with attention model for scene text recognition

机译:具有注意力模型的深度神经网络用于场景文本识别

获取原文
获取原文并翻译 | 示例
           

摘要

The authors present a deep neural network (DNN) with attention model for scene text recognition. The proposed model does not require any segmentation of the input text image. The framework is inspired by the attention model presented recently for speech recognition and image captioning. In the proposed framework, feature extraction, feature attention and sequence recognition are integrated in a jointly trainable network. Compared with previous approaches, the following contributions are mainly made. (i) The attention model is applied into DNN to recognise scene text, and it can effectively solve the sequence recognition problem caused by variable length labels. (ii) Rigorous experiments are performed across a number of challenging benchmarks, including IIIT5K, SVT, ICDAR2003 and ICDAR2013 datasets. Results in experiments show that the proposed model is comparable or better than the state-of-the-art methods. (iii) This model only contains 6.5 million parameters. Compared with other DNN models for scene text recognition, this model has the least number of parameters so far.
机译:作者提出了一种具有注意力模型的深度神经网络(DNN),用于场景文本识别。提出的模型不需要对输入文本图像进行任何分割。该框架的灵感来自最近提出的用于语音识别和图像字幕的注意力模型。在提出的框架中,特征提取,特征注意和序列识别被集成在可联合训练的网络中。与以前的方法相比,主要做出以下贡献。 (i)将注意力模型应用到DNN中识别场景文本,可以有效解决变长标签引起的序列识别问题。 (ii)在包括IIIT5K,SVT,ICDAR2003和ICDAR2013数据集在内的许多具有挑战性的基准上进行了严格的实验。实验结果表明,所提出的模型可比或优于最新方法。 (iii)该模型仅包含650万个参数。与用于场景文本识别的其他DNN模型相比,该模型到目前为止具有最少数量的参数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号