首页> 外文会议>International Conference on Wavelet Analysis and Pattern Recognition >A Japanese OCR post-processing approach based on dictionary matching
【24h】

A Japanese OCR post-processing approach based on dictionary matching

机译:基于字典匹配的日语OCR后处理方法

获取原文

摘要

This paper describes a post-processing approach for Japanese character recognition based on dictionary. By the analysis of experimental data in the processing of OCR, we find that some segmentation and recognition results do not conform to the rules of lexical and just generate the character based on the shape. If the fonts of pending recognized characters are similar with the others, it will easily lead to going wrong in the processing of OCR. For these errors we put forward an idea based on the Limited Length Segmentation Matching and the Bayesian Statistical Classifier. Through the above method, most of the font recognized mistakes can be solved. By the experimental results, it can be proved that this method is an effective way to improve the recognized rate of Japanese character.
机译:本文介绍了一种基于字典的日语字符识别的后处理方法。通过对OCR处理中的实验数据进行分析,我们发现某些分割和识别结果不符合词汇规则,只是根据形状生成字符。如果待识别的待识别字符的字体与其他字符相似,则很容易导致OCR处理出错。针对这些错误,我们提出了一种基于有限长度分割匹配和贝叶斯统计分类器的思想。通过以上方法,可以解决大多数字体识别错误。实验结果表明,该方法是提高日语字符识别率的有效方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号