首页> 外文会议>International Conference on Fuzzy Systems and Data Mining >A Novel Chinese Character Recognition Method Based on Multi-Modal Fusion
【24h】

A Novel Chinese Character Recognition Method Based on Multi-Modal Fusion

机译:一种基于多模态融合的新型汉字识别方法

获取原文

摘要

In recent years, deep learning models are significantly improving capacity of computer vision. Especially in the field of Chinese character recognition, due to the complex structure of the characters, traditional rule-based or machine learning methods have been no longer competitive in terms of the measures such as recognition precision. Inspired by the actual reading process of human beings, we find that the processing of character recognition involves multiple modal information, while single-modal information of images is not only needed. Thus, we study the multi-modal fusion, and propose a novel multi-network model for Chinese character recognition in this paper. The proposed model consists of CNNs, LSTMs and full connection networks, fusing multi-modal information for classifying input images. After experiments, we found that multi-modal fusion can improve the accuracy of character recognition.
机译:近年来,深入学习模型显着提高了计算机视觉的能力。特别是在汉字识别领域,由于人物的复杂结构,在识别精度等措施方面,传统的基于规则或机器学习方法已经不再竞争。灵感来自人类的实际阅读过程,发现字符识别的处理涉及多个模态信息,而不仅需要图像的单模信息。因此,我们研究了多模态融合,并提出了本文中汉字识别的新型多网络模型。所提出的模型包括CNN,LSTM和完整连接网络,融合用于对输入图像进行分类的多模态信息。实验后,我们发现多模态融合可以提高字符识别的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号