首页> 外国专利> Degraded gray-scale document recognition using pseudo two-dimensional hidden markov models and N-best hypotheses

Degraded gray-scale document recognition using pseudo two-dimensional hidden markov models and N-best hypotheses

机译:使用伪二维隐藏马尔可夫模型和N-最佳假设进行灰度文档识别

摘要

Methods are disclosed for recognizing connected and degraded text embedded in a gray-scale image. Gray-scale pseudo two-dimensional hidden Markov models (HMMs) are used to represent images containing text elements such as characters or words. Observation vectors for the image are produced from pixel maps obtained by gray-scale optical scanning. Three components are employed to characterize a pixel: a convolved, quantized gray-level component, a pixel relative position component, and a pixel major stroke direction component. These components are organized as an observation vector, which is continuous in nature, invariant in different font sizes, and flexible for use in various quantization processes. In this manner, information loss or distortion due to binarization processes is eliminated; moreover, in cases where documents are binary in nature (e.g., faxed documents), the image may be compressed by subsampling into multi(gray)-level without losing information, thereby enabling recognition of the compressed images in a much shorter time. Furthermore, documents may be scanned and processed in gray-level with much lower resolution than in binary without sacrificing the performance. This can significantly increase the processing speed.
机译:公开了用于识别嵌入在灰度图像中的连接的和降级的文本的方法。灰度伪二维隐藏马尔可夫模型(HMM)用于表示包含文本元素(例如字符或单词)的图像。图像的观察向量是从通过灰度光学扫描获得的像素图生成的。采用三个分量来表征像素:卷积,量化的灰度分量,像素相对位置分量和像素主笔划方向分量。这些组件被组织为观察向量,其本质上是连续的,不同字体大小不变且可灵活用于各种量化过程。以这种方式,消除了由于二值化处理引起的信息丢失或失真;此外,在文档本质上是二进制的情况下(例如,传真文档),可以通过在不丢失信息的情况下通过二次采样将图像压缩为多(灰色)级别,从而能够在更短的时间内识别压缩的图像。此外,可以在不牺牲性能的情况下以比二进制文件低得多的分辨率以灰度级扫描和处理文档。这样可以大大提高处理速度。

著录项

  • 公开/公告号EP0694862A2

    专利类型

  • 公开/公告日1996-01-31

    原文格式PDF

  • 申请/专利权人 AT&T CORP.;

    申请/专利号EP19950401397

  • 发明设计人 KUO SHYH-SHIAW;YEN CHINCHING;

    申请日1995-06-15

  • 分类号G06K9/68;

  • 国家 EP

  • 入库时间 2022-08-22 03:47:18

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号