首页> 中文期刊>中文信息学报 >MHW蒙古文脱机手写数据库及其应用

MHW蒙古文脱机手写数据库及其应用

     

摘要

A public well-recognized Mongolian offline handwritten database is the basis for the research and develop-ment of Mongolian handwriting recognition system.Based on the research on Mongolian coding,word formation and grammar,a large-vocabulary Mongolian offline handwritten database(MHW)is constructed,which contains 100000 pieces of Mongolian words,i.e.20 samples for each of 5000 words.The test set I contains 5000 samples and test set II contains 14085 samples.An automatic error detection algorithm is applied,which is based on the vari-able length of each Mongolian word.The performance of MHW is validated on three propular handwriting recogni-tion models,among which the Recurrent Neural Network based model shows best performance of 2.20% on test set I and 5.55% on test set II with constrained dictionary.%建立公开、权威的蒙古文手写数据库是研究和开发蒙古文手写识别系统的基础.该文在蒙古文编码、构词和语法的研究基础上,公开了一个蒙古文大词汇量脱机手写数据库M HW,其中训练集由5000个单词构成,每个词采集了20个样本,共包含10万样本,测试集Ⅰ包含5000样本,测试集Ⅱ包含14085样本.该文利用蒙古文文字长度可变特征研究了自动错误检测算法,提高了字库的可靠性.在三种常用手写识别模型上评估了字库的性能,其中基于循环神经网络的模型表现出最佳性能,在字典受限条件下测试集Ⅰ的词错误率达到2.20%,测试集Ⅱ达到了5.55%.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号