...
首页> 外文期刊>Image and Vision Computing >Combination of HMMs for the representation of printed characters in noisy document images
【24h】

Combination of HMMs for the representation of printed characters in noisy document images

机译:HMM的组合,用于在嘈杂的文档图像中表示打印字符

获取原文
获取原文并翻译 | 示例
           

摘要

Many methods of printed character recognition have been proposed to-date, but although performance figures are usually stated for a particular set of fonts or size of text, it is rarely clear under what conditions of noise the measurements were taken. Baird has suggested a model of Document Imaging Defects, which enables authors to compare results against an emerging standard where one figure can be quoted to quantify the level of noise present in the document image. In this paper, a novel method is proposed for the recognition of printed characters, and its extension to the segmentation and recognition of noisy printed words is outlined. The method is based on the representation of the shape of a character by two Hidden Markov Models. Recognition is achieved by scoring these models against the test pattern and combining the results. The method has been evaluated using Baird's noise model, producing a peak performance of 99.5% on the test set in the presence of near-minimal noise. The method generalizes to recognize characters with noise levels greater than those included in the training set, and an investigation of the top-k performance suggests that much of the effect of noise on the recognition performance on images of natural language text could be overcome using a word recognizer employing shallow contextual knowledge.
机译:迄今为止,已经提出了许多打印字符识别的方法,但是尽管通常针对一组特定的字体或文本大小来说明性能数字,但很少能在噪声条件下进行测量。 Baird提出了“文档影像缺陷”模型,该模型使作者可以将结果与新兴标准进行比较,该标准可以引用一个数字来量化文档图像中存在的噪声水平。本文提出了一种新的识别印刷字符的方法,并概述了其扩展到噪声印刷词的分割和识别。该方法基于两个隐马尔可夫模型对字符形状的表示。通过根据测试模式对这些模型评分并组合结果来实现识别。该方法已使用贝尔德(Baird)噪声模型进行了评估,在存在接近最小噪声的情况下,测试集的峰值性能为99.5%。该方法可以识别噪声水平高于训练集中的噪声的字符,对top-k性能的研究表明,噪声可以通过使用自然语言来克服,对自然语言文本图像上的识别性能的影响很大。使用浅层上下文知识的单词识别器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号