首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?
【24h】

Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?

机译:是手写文本识别所必需的多维反复层吗?

获取原文
获取外文期刊封面目录资料

摘要

Current state-of-the-art approaches to offline Handwritten Text Recognition extensively rely on Multidimensional Long Short-Term Memory networks. However, these architectures come with quite an expensive computational cost, and we observe that they extract features visually similar to those of convolutional layers, which are computationally cheaper. This suggests that the two-dimensional long-term dependencies, which are potentially modeled by multidimensional recurrent layers, may not be essential to achieve a good recognition accuracy, at least in the lower layers of the architecture. In this work, an alternative model is explored that relies only on convolutional and one-dimensional recurrent layers that achieves better or equivalent results than those of the current state-of-the-art architecture, and runs significantly faster. In addition, we observe that using random distortions during training as synthetic data augmentation dramatically improves the accuracy of our model. Thus, are multidimensional recurrent layers really necessary for Handwritten Text Recognition? Probably not.
机译:目前最先进的方法,可以广泛地依赖多维长短期内存网络的离线手写文本识别。然而,这些架构具有相当昂贵的计算成本,并且我们观察到它们在视觉上提取与卷积层相似的特征,这是计算方式便宜的。这表明,由多维反复层潜在地建模的二维长期依赖性可能不是实现良好的识别精度,至少在架构的下层中必不可少。在这项工作中,另一种模式是仅探索卷积和一维周期性层依赖其获得更好的或等同的结果比当前状态的最先进的体系结构,和运行显著更快。此外,我们观察到,在培训期间使用随机扭曲作为合成数据的增强显着提高了模型的准确性。因此,是手写文本识别所必需的多维反复层?可能不是。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号