首页> 外文会议>International Workshop on Document Analysis Systems >Top-Down Likelihood Word Image Generation Model for Holistic Word Recognition
【24h】

Top-Down Likelihood Word Image Generation Model for Holistic Word Recognition

机译:自上而下的似然词整体词识别的词图像生成模型

获取原文

摘要

This paper describes a new top-down word image generation model for word recognition. this model an generate a word image with a likelihood based on linguistic knowledge, segmentation and character image. In the recognition process, first, the model generates the word image which approximates an input image best for each of a dictionary of possible words. next, the model calculates the distance value between the input image and each generated word image. Thus, the proposed method is a type of holistic word recognition method. The effectiveness of the proposed method was evaluated in an experiment using type-written museum archive card images. The difference between a non-holistic method and the proposed method is shown by the evaluation. The small errors accumulate in non-holistic methods during the process carried out, because the non-holistic methods can't cover the whole word image but only part images extracted by segmentation, and the non-holistic method can't eliminate the black pixels intruding in the recognition window from neighboring characters. In the proposed method, we can expect that no such errors will accumulate. Results show that a recognition rate of 99.8% was obtained, compared with only 89.4% for a recently published comparator algorithm.
机译:本文介绍了一种用于Word识别的新型自上而下的单词图像生成模型。该模型基于语言知识,分段和字符图像生成具有似然的单词图像。在识别过程中,首先,模型生成对可能字典中的每条字典的最佳输入图像的单词图像。接下来,模型计算输入图像和每个生成的字图像之间的距离值。因此,所提出的方法是一种全面词识别方法。使用类型博物馆档案卡片图像在实验中评估了所提出的方法的有效性。评估显示了非全整体方法和所提出的方法之间的差异。在进行过程中,小错误在非全整体方法中累积,因为非整体方法无法覆盖整个单词图像,而是仅通过分段提取的部分图像,而非全面方法不能消除黑色像素从邻居字符中侵入识别窗口。在提出的方法中,我们可以期望不会积累这样的错误。结果表明,获得了99.8%的识别率,而最近公布的比较器算法仅为89.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号