首页> 外文OA文献 >Optical character recognition in images of natural scenes
【2h】

Optical character recognition in images of natural scenes

机译:自然场景图像中的光学字符识别

摘要

This masters thesis presents and describes modern methods of optical character recognition in natural scenes. Methods with high classification results and are robust to illumination and geometric transformations were selected for the thesis. Our work is based on the implementation of three different methods for obtaining features. The basic HOG method, which also underlies the other two methods is one of the most popular feature extraction methods in object detection and character recognition. HOG method was primarily used in connection with human detection, but was adapted for character recognition also. PHOG method, which is based on HOG, converts the basic HOG algorithm into a pyramid scheme and also includes bilinear interpolation. Due to the pyramid structure of PHOG, the method is slower than the HOG algorithm, but more precise, since the feature vectors are larger. The third feature extraction method, which we have implemented is Co-HOG algorithm, which inherits all the good qualities of HOG method, such as invariance to illumination and geometric changes. Co-HOG is differs from HOG and PHOG, by its feature representation, where it also captures the spatial relationship of neighbouring pixels in order to describe the character more accurately. Among other things Coo-HOG is also a computationally faster than HOG and PHOG.ududDue to various factors in natural scene text images, the traditional character recognition systems produces inaccurate results, because it assumes that the characters do not differ in fonts and colors and presumes a monotonous background of images, whereas in obtaining features from natural scene images, the algorithms should be robust and invariant to character sizes, background noise, different fonts, local illumination changes and visual effects that draw attention, such as color blending. The above described methods do not require preprocessing and segmentation as traditional systems do, since the extract features with methods that describe the appearance of the object and the shape with gradient intensity and edge directions.ududFeature extraction methods were evaluated on a variety of databases such as ICDAR, Chars74K, CVL OCR DB. We have also generated a synthetic database of character images, that simulates characters in natural scenes, by including large variety of different fonts and noises in images. Synthetic image database was generated with the aim of increasing the training set and the improvement of classification accuracy.
机译:该硕士论文介绍并描述了自然场景中光学字符识别的现代方法。本文选择了分类结果高,对光照鲁棒性强,几何变换可靠的方法。我们的工作基于三种获取特征的方法的实现。基本HOG方法(也是其他两种方法的基础)是对象检测和字符识别中最受欢迎的特征提取方法之一。 HOG方法主要用于人类检测,但也适用于字符识别。基于HOG的PHOG方法将基本的HOG算法转换为金字塔方案,还包括双线性插值。由于PHOG的金字塔结构,该方法比HOG算法慢,但由于特征向量较大,因此更精确。我们实现的第三种特征提取方法是Co-HOG算法,它继承了HOG方法的所有优良特性,例如对光照的不变性和几何变化。 Co-HOG与HOG和PHOG的不同之处在于它的特征表示,它还捕获相邻像素的空间关系,以便更准确地描述字符。在其他方面,Coo-HOG的计算速度也比HOG和PHOG快。 ud ud由于自然场景文本图像中的各种因素,传统的字符识别系统产生的结果不准确,因为它假定字符的字体和字体没有差异。颜色并假定图像的背景是单调的,而在从自然场景图像中获取特征时,算法应具有鲁棒性,并且不影响字符大小,背景噪声,不同字体,局部照明变化以及引起人们注意的视觉效果(例如颜色融合)。上面描述的方法不需要像传统系统那样进行预处理和分割,因为提取特征使用的方法描述对象的外观以及具有梯度强度和边缘方向的形状。 ud ud特征提取方法在各种ICDAR,Chars74K,CVL OCR DB等数据库。我们还生成了一个字符图像合成数据库,通过在图像中包含各种不同的字体和噪点来模拟自然场景中的字符。合成图像数据库的产生是为了增加训练集和提高分类精度。

著录项

  • 作者

    Petek Rok;

  • 作者单位
  • 年度 2016
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号