首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >IMAGE-BASED KEYWORD RECOGNITION IN ORIENTAL LANGUAGE DOCUMENT IMAGES
【24h】

IMAGE-BASED KEYWORD RECOGNITION IN ORIENTAL LANGUAGE DOCUMENT IMAGES

机译:原始语言文档图像中基于图像的关键字识别

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

An algorithm is presented for keyword recognition in Oriental language document images. The objective is to recognize keywords composed of more than one consecutive character in document images where there are no explicit visually defined word boundaries. The technique exploits the redundancy expressed by the difference between the number of possible character strings of a fixed length and the number of legal words of that length. Sequences of character images are matched simultaneously to a dictionary of keywords and illegal strings that are visually similar to the keywords. A keyword is located if its image is more likely to occur than any of the illegal strings that are visually similar to it No intermediate character recognition step is used. The application of contextual information directly to the interpretation of features extracted from the image overcomes noise that could make isolated character recognition impossible and the location of words with conventional post-processing algorithms difficult. Experimental results demonstrate the ability of the proposed algorithm to correctly recognize words in the presence of noise that could not be overcome by conventional character recognition or post-processing algorithms. (C) 1997 Pattern Recognition Society. Published by Elsevier Science Ltd. [References: 11]
机译:提出了一种用于东方语言文档图像中关键字识别的算法。目的是识别在文档图像中没有一个以上连续字符组成的关键字,在这些图像中没有明确的视觉定义的单词边界。该技术利用了由固定长度的可能字符串的数量与该长度的合法单词的数量之间的差异表示的冗余。字符图像序列同时与关键字字典和视觉上与关键字相似的非法字符串匹配。如果关键字的图像比视觉上与其相似的任何非法字符串都更可能出现,则定位关键字。不使用中间字符识别步骤。将上下文信息直接应用于从图像中提取的特征的解释可以克服噪声,噪声可能使孤立的字符识别变得不可能,并且使用常规的后处理算法很难确定单词的位置。实验结果表明,所提出的算法能够在存在噪声的情况下正确识别单词,而传统的字符识别或后处理算法无法克服这些噪声。 (C)1997模式识别学会。由Elsevier Science Ltd.发布[参考:11]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号