首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >An attention based method for offline handwritten Urdu text recognition
【24h】

An attention based method for offline handwritten Urdu text recognition

机译:基于注意力的离线乌尔都语手写文本识别方法

获取原文

摘要

Compared to derivatives from Latin script, recognition of derivatives from Arabic handwritten script is a complex task due to the presence of two-dimensional structure, context-dependent shape of characters, high number of ligatures, overlap of characters, and placement of diacritics. While significant attempts exist for Latin and Arabic scripts, very few attempts have been made for offline, handwritten, Urdu script. In this paper, we introduce a large, annotated dataset of handwritten Urdu sentences. We also present a methodology for the recognition of offline handwritten Urdu text lines. A deep learning based encoder /decoder framework with attention mechanism is used to handle two-dimensional text structure. While existing approaches report only character level accuracy, the proposed model improves on BLSTM-based state-of-the-art by a factor of 2 in terms of character level accuracy and by a factor of 37 in terms of word level accuracy. Incorporation of attention before a recurrent decoding framework helps the model in looking at appropriate locations before classifying the next character and therefore results in a higher word level accuracy.
机译:与拉丁文字的派生词相比,阿拉伯文字手写体的派生词识别是一个复杂的任务,这是因为它具有二维结构,与字符有关的上下文形状,大量的连字,字符重叠和变音符号放置。尽管对拉丁文和阿拉伯文脚本进行了大量尝试,但对于脱机手写乌尔都语脚本却进行了很少的尝试。在本文中,我们介绍了一个大型的带注释的手写乌尔都语句子数据集。我们还提出了一种用于识别脱机手写乌尔都语文本行的方法。基于深度学习的具有注意力机制的编码器/解码器框架用于处理二维文本结构。尽管现有方法仅报告字符级精度,但是在基于BLSTM的最新技术方面,提出的模型在字符级精度方面提高了2倍,在单词级精度方面提高了37倍。在循环解码框架之前合并注意力有助于模型在对下一个字符进行分类之前先查找适当的位置,从而提高单词级别的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号