首页> 外文会议>IEEE International Conference on Image Processing >Using pyramid of histogram of oriented gradients on natural scene text recognition
【24h】

Using pyramid of histogram of oriented gradients on natural scene text recognition

机译:在自然场景文本识别中使用定向梯度直方图金字塔

获取原文

摘要

Because of the unconstrained environment of scene text, traditional Optical Character Recognition (OCR) engines fail to achieve satisfactory results. In this paper, we propose a new technique which employs first order Histogram of Oriented Gradient (HOG) through a spatial pyramid. The spatial pyramid can encode the relative spatial layout of the character parts while HOG can only include the local image shape without spatial relation. A feature descriptor combining these two can extracts more useful information from the image for text recognition. Chi-square kernel based Support Vector Machine is employed for classification based on the proposed feature descriptors. The method is tested on three public datasets, namely ICDAR2003 robust reading dataset, Street View Text (SVT) dataset and IIIT 5K-word dataset. The results on these dataset are comparable with the state-of-the-art methods.
机译:由于场景文本的环境不受限制,因此传统的光学字符识别(OCR)引擎无法获得令人满意的结果。在本文中,我们提出了一种新技术,该技术通过空间金字塔采用定向梯度直方图(HOG)。空间金字塔可以编码字符部分的相对空间布局,而HOG仅可以包括局部图像形状而没有空间关系。结合这两者的特征描述符可以从图像中提取更多有用的信息以进行文本识别。基于卡方核的支持向量机用于基于提出的特征描述符的分类。该方法在三个公共数据集上进行了测试,即ICDAR2003健壮阅读数据集,街景文本(SVT)数据集和IIIT 5K字数据集。这些数据集上的结果与最新方法相当。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号