首页> 外文会议>International Conference on Document Analysis and Recognition >A Handwritten Chinese Text Recognizer Applying Multi-level Multimodal Fusion Network

【24h】

A Handwritten Chinese Text Recognizer Applying Multi-level Multimodal Fusion Network

机译：应用多级多模式融合网络的手写中文文本识别器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Handwritten Chinese text recognition (HCTR) has received extensive attention from the community of pattern recognition in the past decades. Most existing deep learning methods consist of two stages, i.e., training a text recognition network on the base of visual information, followed by incorporating language constrains with various language models. Therefore, the inherent linguistic semantic information is often neglected when designing the recognition network. To tackle this problem, in this work, we propose a novel multi-level multimodal fusion network and properly embed it into an attention-based LSTM so that both the visual information and the linguistic semantic information can be fully leveraged when predicting sequential outputs from the feature vectors. Experimental results on the ICDAR-2013 competition dataset demonstrate a comparable result with the state-of-the-art approaches.

机译：在过去的几十年中，手写中文文本识别（HCTR）受到了模式识别社区的广泛关注。现有的大多数深度学习方法包括两个阶段，即在视觉信息的基础上训练文本识别网络，然后将语言约束与各种语言模型结合在一起。因此，在设计识别网络时，往往会忽略固有的语言语义信息。为了解决这个问题，在这项工作中，我们提出了一种新颖的多级多模态融合网络，并将其正确地嵌入到基于注意力的LSTM中，以便在预测来自语言的顺序输出时可以充分利用视觉信息和语言语义信息。特征向量。 ICDAR-2013竞争数据集上的实验结果证明了与最新技术方法可比的结果。

著录项

来源
《International Conference on Document Analysis and Recognition 》|2019年|1464-1469|共6页
会议地点
作者
Yuhuan Xiu; Qingqing Wang; Hongjian Zhan; Man Lan; Yue Lu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Semantics; Linguistics; Convolution; Text recognition; Feature extraction; Hidden Markov models;

机译：可视化;语义学;语言学;卷积;文本识别;特征提取;隐马尔可夫模型;

相似文献

外文文献
中文文献
专利

1. MULTIMODAL BIOMETRIC FUSION ONLINE HANDWRITTEN SIGNATURE VERIFICATION USING NEURAL NETWORK AND SUPPORT VECTOR MACHINE [J] . ORIEB ABUALGHANAM, LAYLA ALBDOUR, OMAR ADWAN International Journal of Innovative Computing Information and Control . 2021 ,第5期

机译：多模式生物识别在线手写签名验证使用神经网络和支持向量机
2. In-air handwritten Chinese text recognition with temporal convolutional recurrent network [J] . Gan Ji, Wang Weiqiang, Lu Ke Pattern Recognition: The Journal of the Pattern Recognition Society . 2020 ,第期

机译：与时间卷积经常性网络的空中手写的中国文本识别
3. A comprehensive study of hybrid neural network hidden Markov model for offline handwritten Chinese text recognition [J] . Wang Zi-Rui, Du Jun, Wang Wen-Chao, International Journal on Document Analysis and Recognition . 2018 ,第4期

机译：用于离线手写汉字识别的混合神经网络隐马尔可夫模型的综合研究
4. A Handwritten Chinese Text Recognizer Applying Multi-level Multimodal Fusion Network [C] . Yuhuan Xiu, Qingqing Wang, Hongjian Zhan, International Conference on Document Analysis and Recognition . 2019

机译：应用多级多模融合网络的手写的中国文本识别器
5. Applying text mining to multi-level indexing and searching for enhancing probabilistic information retrieval. [D] . Wen, Miao. 2010

机译：将文本挖掘应用于多级索引和搜索，以增强概率信息检索。
6. Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression [O] . Joseph Geraci, Pamela Wilansky, Vincenzo de Luca, -1

机译：将深度神经网络应用于电子医疗记录中的非结构化文本注释以对青年抑郁症进行表型分析
7. Multimodal Approach of Speech Emotion Recognition Using Multi-Level Multi-Head Fusion Attention-Based Recurrent Neural Network [O] . Ngoc-Huynh Ho, Hyung-Jeong Yang, Soo-Hyung Kim, 2020

机译：基于多级多头融合的复发性神经网络的多式联运方法

A Handwritten Chinese Text Recognizer Applying Multi-level Multimodal Fusion Network

摘要

著录项

相似文献

相关主题

期刊订阅