首页> 外文会议>International Conference on Document Analysis and Recognition >Woodblock-Printing Mongolian Words Recognition by Bi-LSTM with Attention Mechanism
【24h】

Woodblock-Printing Mongolian Words Recognition by Bi-LSTM with Attention Mechanism

机译:具有注意机制的Bi-LSTM木刻印刷蒙古语单词识别

获取原文

摘要

Woodblock-printing Mongolian documents are seriously degraded due to aging. Therefore, it is difficult to segment woodblock-printing Mongolian words are into individual glyphs. In this paper, a holistic recognition approach based on sequence to sequence model has been proposed for the woodblock-printing Mongolian words. The input of the proposed model is the sequence of frames of a wood-block printing Mongolian word. In order to generating the corresponding sequence of frames, each word image should be normalized into the same sizes in advance. And then, each word image is segmented into several fragments with equal size along writing direction. The output of the proposed model is a sequence of letters. To be specific, the proposed model contains three parts: an encoder, a decoder and an attention network. The encoder consists of a deep neural network and a bi-directional Long Short-Term Memory (Bi-LSTM). The decoder consists of a Long Short-Term Memory (LSTM) with a softmax layer. The encoder and decoder are connected by an attention network, which can map multiple frames to one letter. Experimental results demonstrate that the proposed approach outperforms the segmentation based method.
机译:木刻印刷的蒙古文件由于老化而严重退化。因此,很难将木版印刷的蒙古语单词分割成单个字形。本文针对木刻版蒙古语单词,提出了一种基于顺序的整体识别方法。所提出模型的输入是一个木刻印刷蒙古语单词的框架序列。为了生成相应的帧序列,每个单词图像应预先规格化为相同的大小。然后,将每个单词图像沿书写方向分成大小相等的几个片段。所提出的模型的输出是一个字母序列。具体而言,提出的模型包含三个部分:编码器,解码器和注意网络。编码器由一个深度神经网络和一个双向长期短期记忆(Bi-LSTM)组成。解码器由一个带有softmax层的长短期存储器(LSTM)组成。编码器和解码器通过注意网络连接,该注意网络可以将多个帧映射为一个字母。实验结果表明,该方法优于基于分割的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号