首页> 外国专利> RESOLUTION ADJUSTMENT OF AN IMAGE THAT INCLUDES TEXT UNDERGOING AN OCR PROCESS

RESOLUTION ADJUSTMENT OF AN IMAGE THAT INCLUDES TEXT UNDERGOING AN OCR PROCESS

机译:分辨率调整包括正在进行OCR进程的文本的图像

摘要

An optical character recognition process characterizes text lines in a textual image by their base-line, mean-line and x-height. The base-line for at least one text line in the image is determined by finding a parametric curve that maximizes a first fitness function that depends on the values of pixels through which the parametric curve passes and pixels below the parametric curve. The base-line corresponds to the parametric curve for which the first fitness function is maximized. The first fitness function is designed so that it increases with increasing lightless or brightness of pixels immediately below the parametric curve while also increasing with decreasing lightness of pixels through which the parametric curve passes. The mean-line is determined by incrementally shifting the base-line upward by predetermined amounts (e.g., a single pixel) until a second fitness function for the shifted base-line is maximized. The second fitness function is essentially the inverse of the first fitness function. Specifically, the second fitness function increases with increasing lightless of pixels immediately above the shifted base-line while also increasing with decreasing lightness of pixels through which the shifted base-line passes. The x-height is equal to the sum of the predetermined amounts by which the base-line is shifted upward in order to maximize the second fitness function. In some cases different groups of text-lines in the textual image may be characterized differently from one another. For example, each group may be characterized by a most probable x-height for that group.
机译:光学字符识别过程通过它们的基线,均值线和X高度表征文本图像中的文本线。通过找到图像中的至少一个文本线的基线是通过找到参数曲线来确定的,该参数曲线最大化第一适度函数,其取决于参数曲线通过的像素的值和参数曲线下方的像素。基线对应于第一个适合函数最大化的参数曲线。设计第一健身功能,使得它随着参数曲线下方的较小像素的亮度或亮度而增加,同时也随着参数曲线通过的像素的较低而增加而增加。通过将基线向上递增预定量(例如,单个像素)来确定平均线,直到偏移基线的第二适度函数最大化。第二个健身功能基本上是第一健康功能的倒数。具体地,第二个适应性函数随着立即在偏移的基线上方的像素上方增加而增加,同时也随着偏移的基线通过而降低的像素的较低而增加。 X高度等于预定量的总和,通过向上移动基线,以便最大化第二适合函数。在一些情况下,文本图像中的不同组文本线可以彼此不同地表征。例如,每个组可以特征在于该组的最可能的X高度。

著录项

  • 公开/公告号EP2545498B1

    专利类型

  • 公开/公告日2021-04-21

    原文格式PDF

  • 申请/专利权人

    申请/专利号EP20110753864

  • 申请日2011-03-07

  • 分类号G06K9/20;G06K19;G06K9;G06K9/32;

  • 国家 EP

  • 入库时间 2024-06-14 21:27:13

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号