首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Grayscale-Projection Based Optimal Character Segmentation for Camera-Captured Faint Text Recognition
【24h】

Grayscale-Projection Based Optimal Character Segmentation for Camera-Captured Faint Text Recognition

机译:基于灰度投影的相机捕获模糊文本识别的最佳字符分割

获取原文

摘要

The faint text document images possess shallow characters inherently and the camera-captured form introduces more degradations such as low-resolution, non-uniform illumination and out-of-focus blur, which make the text binarization very difficult. In this paper, we propose a grayscale-projection based optimal character segmentation method for camera-captured faint text recognition. Instead of extracting the character candidates, we use the gradient projection to extract a series of segmentation candidates which contain inter-character gaps and intra-character gaps as well. In order to select the optimal segmentation path from all possible situations, we construct a segmentation tree and set a evaluation score for each path. The score integrates the information of single point projection, overall distribution and recognition probability. Finally the optimal segmentation path is obtained by selecting the path with the highest score. We collect a faint text recognition dataset and evaluate our method on it. Experimental results show that our method outperforms the binary-projection method and the convolutional recurrent neural network approach in terms of text segmentation and recognition accuracy.
机译:淡淡的文本文档图像固有地具有较浅的字符,而相机捕获的格式引入了更多的降级,例如低分辨率,照明不均匀和离焦模糊,这使得文本二值化非常困难。在本文中,我们提出了一种基于灰度投影的最优字符分割方法,用于相机捕获的模糊文本识别。代替提取候选字符,我们使用梯度投影提取一系列包含候选字符间隙和字符内间隙的分割候选字符。为了从所有可能的情况中选择最佳分割路径,我们构造了一个分割树并为每个路径设置了一个评估分数。分数综合了单点投影,总体分布和识别概率的信息。最后,通过选择得分最高的路径来获得最佳分割路径。我们收集了一个模糊的文本识别数据集,并在此数据集上评估了我们的方法。实验结果表明,我们的方法在文本分割和识别准确性方面优于二元投影方法和卷积递归神经网络方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号