首页> 外文会议>International Conference on Document Analysis and Recognition >Deep Network with Pixel-Level Rectification and Robust Training for Handwriting Recognition
【24h】

Deep Network with Pixel-Level Rectification and Robust Training for Handwriting Recognition

机译:具有像素级校正功能的深度网络和用于手写识别的强大培训

获取原文

摘要

Offline handwriting recognition is a well-known challenging task in the optical character recognition (OCR) field due to the difficulty caused by various unconstraint handwriting styles. In order to learn invariant feature representations for handwriting, we propose a novel method to incorporate pixel-level rectification into a CNN and RNN based model. We also propose an adjacent output mixup method for RNN layer's training to improve the generalization ability of the model, i.e., the previous output of an RNN layer is added to the current output with random weights. We additionally adopt a series of techniques including pre-training, data augmentation and language model, and further analyze their contributions to the improvement of the model performance. The proposed method performs well on three public benchmarks, including the IAM, Rimes and IFN/ENIT datasets.
机译:由于各种不受约束的笔迹样式引起的困难,离线笔迹识别是光学字符识别(OCR)领域中众所周知的挑战性任务。为了学习手写的不变特征表示,我们提出了一种将像素级校正合并到基于CNN和RNN的模型中的新颖方法。我们还提出了一种用于RNN层训练的相邻输出混合方法,以提高模型的泛化能力,即将RNN层的先前输出添加到具有随机权重的当前输出中。我们还采用了包括预训练,数据增强和语言模型在内的一系列技术,并进一步分析了它们对改善模型性能的贡献。所提出的方法在三个公共基准(包括IAM,Rimes和IFN / ENIT数据集)上表现良好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号