首页> 外文OA文献 >Optical character recognition in images of natural scenes
【2h】

Optical character recognition in images of natural scenes

机译:自然场景图像中的光学字符识别

摘要

The diploma thesis presents a description and implementation of some modern techniques and methods for optical character recognition in images of natural scenes. When choosing methods, we focused on speed and accuracy. As a basis we have chosen the method of directional segment features with nonlinear mesh since it was developed for the mobile platform and thus meets the criteria of speed. Also, the method comparable to other methods reaches very good results. The proposed method was further upgraded with some other popular features extraction methods and classifiers.udOptical recognition of text in images of natural scenes is very problematic, because in them the text appears in a variety of sizes, colors, fonts and orientations. Also pictures of natural scenes typically have lower quality and contain complex background, which greatly complicates the process of recognition. Similar to the classic optical character recognition the systems for optical character recognition in the images of natural scenes typically consist of four steps: preprocessing, segmentation, feature extraction and classification. Preprocessing phase is designed to improve image quality, in the segmentation stage only the pixels that belong to each character are chosen in the picture. Both steps are due to the aforementioned problems of natural scenes extremely important. During the feature extraction phase the characteristics of a segmented character are calculated, which are user for further classification of the character corresponding class.udAll implemented methods were tested on image databases ICDAR, CVL OCR DB, and a hybrid collection that we have generated from the two mentioned databases.udImproved method presented in the thesis has achieved good results and is, in conjunction with the relevant text detection in images of natural scenes, suitable for migration and the use on the mobile platform.
机译:文凭论文介绍了一些用于自然场景图像中光学字符识别的现代技术和方法的描述和实现。在选择方法时,我们专注于速度和准确性。作为基础,我们选择了具有非线性网格的定向线段特征的方法,因为它是为移动平台开发的,因此符合速度标准。同样,与其他方法相比,该方法也能达到很好的效果。提议的方法进一步用其他一些流行的特征提取方法和分类器进行了升级。而且,自然场景的图片通常质量较低且包含复杂的背景,这极大地使识别过程复杂化。与经典的光学字符识别类似,用于自然场景图像中的光学字符识别的系统通常包括四个步骤:预处理,分割,特征提取和分类。预处理阶段旨在提高图像质量,在分割阶段,仅在图片中选择属于每个字符的像素。这两个步骤都是由于上述自然场景问题极为重要。在特征提取阶段,将计算分段字符的特征,以供用户进一步对字符对应类进行分类。本文所提出的改进方法取得了良好的效果,并且结合自然场景图像中的相关文本检测,适合迁移和在移动平台上使用。

著录项

  • 作者

    Petek Rok;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"sl","name":"Slovene","id":39}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号