首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >A Compact CNN-DBLSTM Based Character Model For Online Handwritten Chinese Text Recognition
【24h】

A Compact CNN-DBLSTM Based Character Model For Online Handwritten Chinese Text Recognition

机译:基于紧凑的CNN-DBLSTM字符模型,用于在线手写中文文本识别

获取原文

摘要

Recently, character model based on integrated convolutional neural network (CNN) and deep bidirectional long short-term memory (DBLSTM) has been demonstrated to be effective for online handwritten Chinese text recognition (HCTR). However, the reported CNN-DBLSTM topologies are too complex to be practically useful. In this paper, we propose a compact CNN-DBLSTM which has small footprint and low computation cost yet be able to accommodate multiple receptive fields for CNN-based feature extraction. By using the training set of a popular benchmark database, namely CASIA-OLHWDB, we trained a compact CNN-DBLSTM by a connectionist temporal classification (CTC) criterion with a multi-step training strategy. Combined this character model with a character trigram language model, our online HCTR system with a WFST-based decoder has achieved state-of-the-art performance on both CASIA and ICDAR-2013 Chinese handwriting recognition competition test sets.
机译:最近,基于集成卷积神经网络(CNN)和深双向短期内记忆(DBLSTM)的字符模型已经证明了在线手写中文文本识别(HCTR)是有效的。但是,报告的CNN-DBLSTM拓扑太复杂,无法实际上是有用的。在本文中,我们提出了一种紧凑的CNN-DBLSTM,其具有小的占地面积和低计算成本,但能够适应基于CNN的特征提取的多个接收领域。通过使用流行的基准数据库的培训集,即Casia-OLHWDB,我们通过具有多步训练策略的连接员时间分类(CTC)标准培训了一个紧凑的CNN-DBLSTM。将此字符模型与角色三重语言模型组合,我们的在线HCTR系统具有基于WFST的解码器,在Casia和Icdar-2013中文手写识别竞争测试集中实现了最先进的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号