...
首页> 外文期刊>Journal of Signal and Information Processing >Pre-Processing Images of Public Signage for OCR Conversion
【24h】

Pre-Processing Images of Public Signage for OCR Conversion

机译:预处理用于OCR转换的公共标牌图像

获取原文
           

摘要

In this paper, we propose a novel method to enhance the OCR (Optical Character Recognition) readability of public signboards captured by smart-phone cameras—both outdoors and indoors, and subject to various lighting conditions. A distinct feature of our technique is the detection of these signs in the HSV (Hue, Saturation and Value) color space, done in order to filter out the signboard from the background, and correctly interpret the textual details of each signboard. This is then binarized using a thresholding technique that is optimized for text printed on contrasting backgrounds, and passed through the Tesseract engine to detect individual characters. We test out our technique on a dataset of over 200 images taken in and around the campus of our college, and are successful in attaining better OCR results in comparison to traditional methods. Further, we suggest the utilization of a method to automatically assign ROIs (Regions Of Interest) to detected signboards, for better recognition of textual information.
机译:在本文中,我们提出了一种新颖的方法来增强智能手机相机在室外和室内以及在各种光照条件下捕获的公共招牌的OCR(光学字符识别)可读性。我们技术的一个独特功能是在HSV(色相,饱和度和值)色彩空间中检测这些标志,以从背景中过滤掉招牌,并正确解释每个招牌的文字细节。然后使用阈值技术对图像进行二值化,该技术针对在对比背景上打印的文本进行了优化,并通过Tesseract引擎检测单个字符。我们在大学校园内外拍摄的200幅图像的数据集上测试了我们的技术,与传统方法相比,它成功地获得了更好的OCR结果。此外,我们建议使用一种方法来自动将ROI(感兴趣区域)分配给检测到的招牌,以更好地识别文本信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号