首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Off-line handwritten Chinese character recognition as a compound Bayes decision problem
【24h】

Off-line handwritten Chinese character recognition as a compound Bayes decision problem

机译:离线手写汉字识别是复合贝叶斯决策问题

获取原文
获取原文并翻译 | 示例
           

摘要

A handwritten Chinese character off-line recognizer based on contextual vector quantization (CVQ) of every pixel of an unknown character image has been constructed. Each template character is represented by a codebook. When an unknown image is matched against a template character, each pixel of the image is quantized according to the associated codebook by considering not just the feature vector observed at each pixel, but those observed at its neighbors and their quantization as well. Structural information such as stroke counts observed at each pixel are captured to form a cellular feature vector. Supporting a vocabulary of 4616 simplified Chinese characters and alphanumeric and punctuation symbols, the writer-independent recognizer has an average recognition rate of 77.2 percent. Three statistical language models for postprocessing have been studied for their effectiveness in upgrading the recognition rate of the system. Among them, the CVQ-based language model is the most effective one upgrading the recognition rate by 10.4 percent on the average.
机译:基于未知字符图像的每个像素的上下文向量量化(CVQ),构建了手写汉字离线识别器。每个模板字符由一个密码本表示。当未知图像与模板字符匹配时,不仅考虑在每个像素处观察到的特征向量,而且还考虑在其相邻像素处观察到的特征向量及其量化,根据关联的代码本对图像的每个像素进行量化。捕获结构信息,例如在每个像素处观察到的笔划计数,以形成细胞特征向量。独立于作者的识别器支持4616个简体中文字符,字母数字和标点符号的词汇,平均识别率为77.2%。研究了三种用于后处理的统计语言模型在提高系统识别率方面的有效性。其中,基于CVQ的语言模型是最有效的模型,其识别率平均提高了10.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号