首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >ON-LINE CHINESE CHARACTER RECOGNITION WITH EFFECTIVE CANDIDATE RADICAL AND CANDIDATE CHARACTER SELECTIONS
【24h】

ON-LINE CHINESE CHARACTER RECOGNITION WITH EFFECTIVE CANDIDATE RADICAL AND CANDIDATE CHARACTER SELECTIONS

机译:有效汉字汉字和汉字汉字选择的中文汉字识别

获取原文
获取原文并翻译 | 示例
       

摘要

A useful property of Chinese characters is that most of them possess a radical with a meaning. Our system is motivated by this characteristic found in Chinese characters which leads us to radical-based candidate selection approaches, performed before similarity measurement. The searching ranges of the radicals and the characters can be basically decided by the number of strokes in the input script. From the recognized possible radicals, the candidate characters are limited to those having such radicals and those having no radical. Since some of the candidate radicals may have very high matching costs, we can reduce the recognition time by discarding those unlikely candidate radicals. Furthermore, to speed up the inspection of radicals, we developed a radical extraction algorithm to narrow down the searching scope of the reference templates. The radical extraction algorithm was further improved by eliminating false extracted radicals. With these mechanisms, the number of candidate radicals screened out is 286 out of 726 radical templates and the number of candidate characters to be detailed matched is 123 out of 5401 Chinese characters. Through these efforts, the recognition rate improves to be 96.35% for the first rank and 98.96% for the first 10 result candidates with the speed of 0.427 seconds on average per character on a PC using Intel 386-33 CPU. Copyright (C) 1996 Pattern Recognition Society. [References: 12]
机译:汉字的一个有用特性是,大多数汉字都带有带有含义的部首。我们的系统受到汉字中这一特征的启发,从而使我们找到了基于相似度度量之前基于部首的候选词选择方法。部首和字符的搜索范围基本上可以由输入脚本中的笔画数决定。根据公认的可能的部首,候选字符限于具有此类部首的那些和不具有部首的那些。由于某些候选基部可能具有很高的匹配成本,因此我们可以通过丢弃那些不太可能的候选基部来减少识别时间。此外,为了加快部首检查,我们开发了部首提取算法来缩小参考模板的搜索范围。通过消除错误提取的部首,进一步改进了部首提取算法。通过这些机制,筛选出的候选部首个数是726个部首模板中的286个,要详细匹配的候选字符数是5401个汉字中的123个。通过这些努力,使用Intel 386-33 CPU的PC上,每个字符的识别率提高到96.35%,对于前10个结果候选者的识别率提高到98.96%,平均每字符速度为0.427秒。版权所有(C)1996模式识别学会。 [参考:12]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号