Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?

机译：是手写文本识别所必需的多维反复层吗？

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Current state-of-the-art approaches to offline Handwritten Text Recognition extensively rely on Multidimensional Long Short-Term Memory networks. However, these architectures come with quite an expensive computational cost, and we observe that they extract features visually similar to those of convolutional layers, which are computationally cheaper. This suggests that the two-dimensional long-term dependencies, which are potentially modeled by multidimensional recurrent layers, may not be essential to achieve a good recognition accuracy, at least in the lower layers of the architecture. In this work, an alternative model is explored that relies only on convolutional and one-dimensional recurrent layers that achieves better or equivalent results than those of the current state-of-the-art architecture, and runs significantly faster. In addition, we observe that using random distortions during training as synthetic data augmentation dramatically improves the accuracy of our model. Thus, are multidimensional recurrent layers really necessary for Handwritten Text Recognition? Probably not.

机译：目前最先进的方法，可以广泛地依赖多维长短期内存网络的离线手写文本识别。然而，这些架构具有相当昂贵的计算成本，并且我们观察到它们在视觉上提取与卷积层相似的特征，这是计算方式便宜的。这表明，由多维反复层潜在地建模的二维长期依赖性可能不是实现良好的识别精度，至少在架构的下层中必不可少。在这项工作中，另一种模式是仅探索卷积和一维周期性层依赖其获得更好的或等同的结果比当前状态的最先进的体系结构，和运行显著更快。此外，我们观察到，在培训期间使用随机扭曲作为合成数据的增强显着提高了模型的准确性。因此，是手写文本识别所必需的多维反复层？可能不是。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|732p|共6页
会议地点
作者
Joan Puigcerver;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Training; Feature extraction; Computer architecture; Error analysis; Text recognition; Computational modeling; Hidden Markov models;

机译：培训;特征提取;计算机架构;错误分析;文本识别;计算建模;隐藏的马尔可夫模型;

相似文献

外文文献
中文文献
专利

1. In-air handwritten Chinese text recognition with temporal convolutional recurrent network [J] . Gan Ji, Wang Weiqiang, Lu Ke Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：与时间卷积经常性网络的空中手写的中国文本识别
2. Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models [J] . Daniyar Nurseitov, Kairat Bostanbekov, Maksat Kanatov, Advances in Science, Technology and Engineering Systems . 2020,第5期

机译：使用各种深度学习模型对城市手写名称和手写文本识别的分类
3. Text-Line and Character Segmentation for Off-line Recognition of Handwritten Japanese Text [J] . Kha Cong Nguyen, Nakagawa Masaki 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2015,第517期

机译：文本行和字符分割，用于手写日语文本的离线识别
4. Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition? [C] . Joan Puigcerver IAPR International Conference on Document Analysis and Recognition . 2017

机译：手写文本识别真的需要多维递归层吗？
5. Neural network based off-line handwritten text recognition system [D] . Han, Changan 2011

机译：基于神经网络的离线手写文本识别系统
6. Entity recognition from clinical texts via recurrent neural network [O] . Zengjian Liu, Ming Yang, Xiaolong Wang, 2017

机译：通过递归神经网络从临床文本中识别实体
7. Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition [O] . Xie, Z, Sun, Z, Jin, L, 2017

机译：利用全卷积递归网络学习空间语义上下文进行在线手写中文文本识别

Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅