首页> 外文会议>International Conference on Document Analysis and Recognition >A Handwritten Chinese Text Recognizer Applying Multi-level Multimodal Fusion Network
【24h】

A Handwritten Chinese Text Recognizer Applying Multi-level Multimodal Fusion Network

机译:应用多级多模式融合网络的手写中文文本识别器

获取原文

摘要

Handwritten Chinese text recognition (HCTR) has received extensive attention from the community of pattern recognition in the past decades. Most existing deep learning methods consist of two stages, i.e., training a text recognition network on the base of visual information, followed by incorporating language constrains with various language models. Therefore, the inherent linguistic semantic information is often neglected when designing the recognition network. To tackle this problem, in this work, we propose a novel multi-level multimodal fusion network and properly embed it into an attention-based LSTM so that both the visual information and the linguistic semantic information can be fully leveraged when predicting sequential outputs from the feature vectors. Experimental results on the ICDAR-2013 competition dataset demonstrate a comparable result with the state-of-the-art approaches.
机译:在过去的几十年中,手写中文文本识别(HCTR)受到了模式识别社区的广泛关注。现有的大多数深度学习方法包括两个阶段,即在视觉信息的基础上训练文本识别网络,然后将语言约束与各种语言模型结合在一起。因此,在设计识别网络时,往往会忽略固有的语言语义信息。为了解决这个问题,在这项工作中,我们提出了一种新颖的多级多模态融合网络,并将其正确地嵌入到基于注意力的LSTM中,以便在预测来自语言的顺序输出时可以充分利用视觉信息和语言语义信息。特征向量。 ICDAR-2013竞争数据集上的实验结果证明了与最新技术方法可比的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号