A shadow removal method for tesseract text recognition

机译：用于tesseract文本识别的阴影去除方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For shadowed text images, the character recognition performance of Tesseract drops significantly. In this paper, we propose a new method to process the shadowed text images for the Tesseract's optical character recognition engine. First, a local adaptive threshold algorithm is used to transform the grayscale image into a binary image to capture the contours of texts. Next, to delete the salt-and-pepper noise in the shadow areas we propose a double-filtering algorithm, in which a projection method is used to remove the noise between texts and the median filter is used to remove the noise within characters. Finally, the processed binary image is fed into the Tesseract's optical character recognition engine. Experimental results show that the proposed method can achieve a better character recognition performance.

机译：对于阴影文本图像，Tesseract的字符识别性能会大大下降。在本文中，我们为Tesseract的光学字符识别引擎提出了一种处理阴影文本图像的新方法。首先，使用局部自适应阈值算法将灰度图像转换为二进制图像以捕获文本轮廓。接下来，为了删除阴影区域中的椒盐噪声，我们提出了一种双重过滤算法，其中使用投影方法去除文本之间的噪声，并使用中值滤波器去除字符内的噪声。最后，将经过处理的二进制图像输入到Tesseract的光学字符识别引擎中。实验结果表明，该方法可以达到较好的字符识别性能。

著录项

来源
《International Congress on Image and Signal Processing, BioMedical Engineering and Informatics》|2017年|1-5|共5页
会议地点
作者
Huimin Lu; Baofeng Guo; Juntao Liu; Xijun Yan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Optical character recognition software; Text recognition; Engines; Noise reduction; Microsoft Windows; Character recognition; Gray-scale;

机译：光学字符识别软件;文本识别;引擎;降噪; Microsoft Windows;字符识别;灰度;

相似文献

外文文献
中文文献
专利

1. A shadow detection and removal method for fruit recognition in natural environments [J] . Precision Agriculture . 2020,第4期

机译：自然环境中果实识别的影子检测与去除方法
2. Text Extraction and Recognition from the Normal Images using MSER Feature Extraction and Text Segmentation Methods [J] . Nitin Sharma, Nidhi Indian Journal of Science and Technology . 2017,第17期

机译：使用MSER特征提取和文本分割方法从普通图像中提取和识别文本
3. Object Oriented Shadow Detection and an Enhanced Method for Shadow Removal [J] . Divya S Kumar, Neenu Wilson International Journal on Computer Science and Engineering . 2016,第6期

机译：面向对象的阴影检测和增强的阴影去除方法
4. A shadow removal method for tesseract text recognition [C] . Huimin Lu, Baofeng Guo, Juntao Liu, International Congress on Image and Signal Processing, BioMedical Engineering and Informatics . 2017

机译：TESSERACT文本识别的影子拆除方法
5. Shadow removal for action recognition in a smart condo environment. [D] . Eskandari, Samaneh. 2014

机译：在智能公寓环境中去除阴影以进行动作识别。
6. A Study of Active Learning Methods for Named Entity Recognition in Clinical Text [O] . Yukun Chen, Thomas A. Lasko, Qiaozhu Mei, -1

机译：主动学习方法在临床文本中识别实体的研究
7. Retraction Note to: Multiview Gait Recognition Based on Silhouettes Generated after Shadow Detection and Removal Using Photometric Properties Method [O] . Rohit Katiyar, K. V. Arya, Vinay Kumar Pathak 2012

机译：收缩注意：基于暗影检测和使用光度特性方法的剪影产生的剪影的多视图步态识别
8. Statistical Removal of Shadow for Applications to Gait Recognition [R] . Hockersmith, B. 2008

机译：用于步态识别的阴影的统计去除

A shadow removal method for tesseract text recognition

摘要

著录项

相似文献

相关主题

期刊订阅