首页> 外文期刊>Neural computing & applications >Evaluation of cursive and non-cursive scripts using recurrent neural networks
【24h】

Evaluation of cursive and non-cursive scripts using recurrent neural networks

机译:使用递归神经网络评估草书和非草书

获取原文
获取原文并翻译 | 示例
       

摘要

Character recognition has been widely used since its inception in applications involved processing of scanned or camera-captured documents. There exist multiple scripts in which the languages are written. The scripts could broadly be divided into cursive and non-cursive scripts. The recurrent neural networks have been proved to obtain state-of-the-art results for optical character recognition. We present a thorough investigation of the performance of recurrent neural network (RNN) for cursive and non-cursive scripts. We employ bidirectional long short-term memory (BLSTM) networks, which is a variant of the standard RNN. The output layer of the architecture used to carry out our investigation is a special layer called connectionist temporal classification (CTC) which does the sequence alignment. The CTC layer takes as an input the activations of LSTM and aligns the target labels with the inputs. The results were obtained at the character level for both cursive Urdu and non-cursive English scripts are significant and suggest that the BLSTM technique is potentially more useful than the existing OCR algorithms.
机译:自从字符识别技术问世以来,它就已广泛应用于涉及处理扫描或照相机捕获的文档的应用中。存在用于编写语言的多个脚本。这些脚本大致可分为草书和非草书。事实证明,递归神经网络可以获得光学字符识别的最新结果。我们对草书和非草书的递归神经网络(RNN)的性能进行了彻底的研究。我们采用双向长期短期记忆(BLSTM)网络,这是标准RNN的一种变体。用于进行研究的体系结构的输出层是称为连接主义时间分类(CTC)的特殊层,该层进行序列比对。 CTC层将LSTM的激活作为输入,并将目标标签与输入对齐。在草书乌尔都语和非草书英语脚本的字符级别获得的结果都很重要,这表明BLSTM技术可能比现有的OCR算法更有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号