首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >A Hybrid Model for End to End Online Handwriting Recognition
【24h】

A Hybrid Model for End to End Online Handwriting Recognition

机译:结束结束在线手写识别的混合模型

获取原文
获取外文期刊封面目录资料

摘要

Automatic recognition of online handwritten words in a generic mode has significant application potentials. However, this recognition job is challenging for unconstrained handwriting data. The challenge is more serious for Indic scripts like Devanagari or Bangla due to the inherent cursiveness of their characters, large sizes of respective alphabets, existence of several groups of shape similar characters etc. On the other hand, with the recent development of powerful machine learning tools, major research initiatives in this area of pattern recognition studies have been observed. Feature extraction and classification are two major modules of such a recognizer. Deep architectures of convolutional neural network (CNN) models have been found to be efficient in extraction of useful features from raw signal. On the other hand, a recurrent neural network (RNN) along with connectionist temporal classification (CTC) has been shown to be able to label unsegmented sequence data. In the present article, we propose a hybrid layered architecture consisting of three networks CNN, RNN and CTC for recognition of online handwriting without use of any specific lexicon. In this study, we have also observed that feeding hand-crafted features to the CNN at the first level of the proposed model provides better performance than feeding the raw signal to the CNN. We have simulated the proposed model on two large databases of Devanagari and Bangla online unconstrained handwritten words. The recognition accuracies provided by the proposed model are encouraging.
机译:在通用模式下自动识别在线手写单词具有重要的应用势。但是,此识别作业对无限制的手写数据有挑战性。由于其角色的固有诅咒,各个字母的固有的统治,大小的尺寸,几组形状类似的角色等,挑战更为严重。另一方面,随着强大的机器学习的最新发展工具,已经观察到该模式识别研究领域的主要研究举措。特征提取和分类是这种识别器的两个主要模块。已经发现卷积神经网络(CNN)模型的深层架构在原始信号中提取有用特征的高效。另一方面,已经示出了经常性神经网络(RNN)以及连接员时间分类(CTC)能够标记未分段的序列数据。在本文中,我们提出了一种混合分层架构,包括三个网络CNN,RNN和CTC,用于识别在线手写而不使用任何特定的Lexicon。在这项研究中,我们还观察到,在所提出的模型的第一级将手工制作的特征送到CNN中,提供比将原始信号馈送到CNN的更好的性能。我们已经在Devanagari和Bangla在线不受约束的手写单词中模拟了两个大型数据库模型。拟议模型提供的识别准确性令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号