Using pyramid of histogram of oriented gradients on natural scene text recognition

机译：在自然场景文本识别中使用定向梯度直方图金字塔

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Because of the unconstrained environment of scene text, traditional Optical Character Recognition (OCR) engines fail to achieve satisfactory results. In this paper, we propose a new technique which employs first order Histogram of Oriented Gradient (HOG) through a spatial pyramid. The spatial pyramid can encode the relative spatial layout of the character parts while HOG can only include the local image shape without spatial relation. A feature descriptor combining these two can extracts more useful information from the image for text recognition. Chi-square kernel based Support Vector Machine is employed for classification based on the proposed feature descriptors. The method is tested on three public datasets, namely ICDAR2003 robust reading dataset, Street View Text (SVT) dataset and IIIT 5K-word dataset. The results on these dataset are comparable with the state-of-the-art methods.

机译：由于场景文本的环境不受限制，因此传统的光学字符识别（OCR）引擎无法获得令人满意的结果。在本文中，我们提出了一种新技术，该技术通过空间金字塔采用定向梯度直方图（HOG）。空间金字塔可以编码字符部分的相对空间布局，而HOG仅可以包括局部图像形状而没有空间关系。结合这两者的特征描述符可以从图像中提取更多有用的信息以进行文本识别。基于卡方核的支持向量机用于基于提出的特征描述符的分类。该方法在三个公共数据集上进行了测试，即ICDAR2003健壮阅读数据集，街景文本（SVT）数据集和IIIT 5K字数据集。这些数据集上的结果与最新方法相当。

著录项

来源
《IEEE International Conference on Image Processing》|2014年|2629-2633|共5页
会议地点
作者
Zhi Rong Tan; Shangxuan Tian; Chew Lim Tan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
feature extraction; image classification; support vector machines; text detection; HOG; ICDAR2003 robust reading dataset; IIIT 5K-word dataset; SVT dataset; Street View Text; chi-square kernel; classification; feature descriptor; histogram of oriented gradients; local image shape; natural scene text recognition; spatial pyramid; support vector machine; Character recognition; Feature extraction; Histograms; Kernel; Shape; Testing; Text recognition; Feature extraction; Shape; Support vector machines; Text recognition;

机译：特征提取;图像分类;支持向量机;文本检测; HOG; ICDAR2003鲁棒读取数据集; IIIT 5K字数据集; SVT数据集;街景文本;卡方核;分类;特征描述符;定向梯度直方图;局部图像形状;自然场景文本识别;空间金字塔;支持向量机;字符识别;特征提取;直方图;内核;形状;测试;文本识别;特征提取;形状;支持向量机;文本识别;

相似文献

外文文献
中文文献
专利

1. HSPOG: An optimized target recognition method based on histogram of spatial pyramid oriented gradients [J] . Shaojun Guo, Feng Liu, Xiaohu Yuan, Tsinghua Science and Technology . 2021,第4期

机译：HSPOG：基于空间金字塔导向梯度直方图的优化目标识别方法
2. HSPOG:An Optimized Target Recognition Method Based on Histogram of Spatial Pyramid Oriented Gradients [J] . Shaojun Guo, Feng Liu, Xiaohu Yuan, 清华大学学报（英文版） . 2021,第004期

机译：HSPOG：一种基于空间金字塔导向梯度直方图的优化目标识别方法
3. Driving Posture Recognition by Joint Application of Motion History Image and Pyramid Histogram of Oriented Gradients [J] . Chao Yan, Frans Coenen, Bailing Zhang International Journal of Vehicular Technology . 2014,第Null期

机译：结合运动历史图像和定向金字塔的金字塔直方图进行驾驶姿势识别
4. Using pyramid of histogram of oriented gradients on natural scene text recognition [C] . Zhi Rong Tan, Shangxuan Tian, Chew Lim Tan IEEE International Conference on Image Processing . 2014

机译：在自然场景文本识别上使用导向梯度直方图的金字塔
5. Unified detection and recognition for reading text in scene images [D] . Weinman, Jerod J. 2008

机译：统一检测和识别以读取场景图像中的文本
6. Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences [O] . Chirag I. Patel, Dileep Labana, Sharnil Pandya, 2020

机译：基于面向梯度的特征融合的直方图用于行动视频序列中的人体行动识别
7. USING PYRAMID OF HISTOGRAM OF ORIENTED GRADIENTS ON NATURAL SCENE TEXT RECOGNITION [O] . Zhi Rong Tan, Shangxuan Tian, Chew Lim Tan 2015

机译：在自然场景文本识别中使用面向梯度直方图的金字塔
8. Implementation of a Cascaded Histogram of Oriented Gradient (HOG)-Based Pedestrian Detector. [R] . Reale, C., Gurram, P., Hu, S., 2013

机译：基于定向梯度（HOG）的行人探测器级联直方图的实现。

Using pyramid of histogram of oriented gradients on natural scene text recognition

摘要

著录项

相似文献

相关主题

期刊订阅