首页> 外文OA文献 >A novel approach for skew estimation of document images in OCR system
【2h】

A novel approach for skew estimation of document images in OCR system

机译:一种新的OCR系统中文档图像偏斜估计方法

摘要

Optical character recognition (OCR) is an area which has always received special attention. OCR systems are typically built on the strategy of divide and conquer, rather than recognizing documents at one go. They utilize several stages during the course of recognition. There have been many stages in a typical OCR system, preprocessing stage in considered to be indispensable. An input image or information need to be normalized and converted into format acceptable by OCR system. OCR systems typically assume that documents were printed with a single direction of the text and that the acquisition process did not introduce a relevant skew. Practically this assumption is not very strong and printed document could be skewed at some angle with horizontal axis. In this paper, we have proposed a new technique for skew estimation of image document. In the proposed scheme, multiscale properties of an image are utilized together with principal component analysis to estimate the orientation of principal axis of clustered data.
机译:光学字符识别(OCR)是一个始终受到特别关注的领域。 OCR系统通常基于分而治之的策略,而不是一次性识别文档。他们在识别过程中利用了多个阶段。在典型的OCR系统中有很多阶段,预处理阶段被认为是必不可少的。输入的图像或信息需要进行标准化,并转换为OCR系统可接受的格式。 OCR系统通常假定文档是在文本的单个方向上打印的,并且获取过程没有引入相关的偏斜。实际上,这种假设不是很强,打印文档可能会与水平轴倾斜一些角度。在本文中,我们提出了一种新的图像文档偏斜估计技术。在提出的方案中,利用图像的多尺度特性以及主成分分析来估计聚类数据的主轴方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号