首页> 外国专利> Offline text recognition without intraword character segmentation based on two-dimensional low frequency discrete Fourier transforms

Offline text recognition without intraword character segmentation based on two-dimensional low frequency discrete Fourier transforms

机译:基于二维低频离散傅里叶变换的无单词内字符分割的离线文本识别

摘要

Image analysis and recognition includes reading text, by digitally scanning a surface, locating the printed material in that digital image, and then recognizing words, phrases, or numbers based on their two dimensional, low frequency Fourier harmonics. One objective is to specifically apply this method of recognition to the postal industry, to include all shipping and labeling applications. Once the image of a word is digitized and isolated, a two-dimensional Fourier transform is computed of the digital image. The process is accomplished in the same manner regardless of the type of surface the printed text comes from, just as long as each word, phrase, or set of numbers to be recognized is isolated, stored in a digital form, and then Fourier Transformed. The sine and cosine coefficients from the Fourier Transform are then filtered to include only the low frequency, terms (i.e. DC term and first 5 harmonics in both vertical and horizontal axis). The sine and cosine terms (coefficients) then define 121 unique vectors which represent a 121 orthogonal vector space. The vector space is normalized to unity and each image of the word, phrase, or set of numbers defines a unique point along this 121 orthogonal vector hypersphere. A library of words, phrases, and/or numbers must be produced using many different font styles. The library when developed, will consist of sine and cosine coefficient values which represent each word, phrase, or number to be recognized. This library is uniquely fashioned by averaging the sine and cosine terms of similar font styles into what is called font groups.
机译:图像分析和识别包括通过数字扫描表面,在该数字图像中定位印刷材料来读取文本,然后基于其二维低频傅立叶谐波识别单词,短语或数字。一个目标是将这种识别方法专门应用于邮政行业,以包括所有运输和标签应用程序。一旦单词的图像被数字化和隔离,就对数字图像进行二维傅立叶变换。只要要隔离每个单词,词组或要识别的数字集,以数字形式存储,然后进行傅立叶变换,就可以用相同的方式完成此过程,而不管打印文本的表面类型如何。然后对来自傅立叶变换的正弦和余弦系数进行滤波以仅包括低频项(即直流项和垂直和水平轴上的前5个谐波)。然后,正弦和余弦项(系数)定义了121个唯一的矢量,它们代表121个正交矢量空间。向量空间被归一化为单位,单词,词组或数字集的每个图像沿此121个正交向量超球定义了一个唯一点。必须使用许多不同的字体样式来生成单词,短语和/或数字库。该库在开发时将由代表要识别的每个单词,短语或数字的正弦和余弦系数值组成。通过将相似字体样式的正弦和余弦项平均到所谓的字体组中,可以独特地构建该库。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号