首页> 外文会议>International Conference on Document Analysis and Recognition >Image Binarization for End-to-End Text Understanding in Natural Images
【24h】

Image Binarization for End-to-End Text Understanding in Natural Images

机译:在自然图像中实现端到端文本的图像二值化

获取原文

摘要

While modern off-the-shelf OCR engines show particularly high accuracy on scanned text, text detection and recognition in natural images still remains a challenging problem. Here, we demonstrate that OCR engines can still perform well on this harder task as long as appropriate image binarization is applied to input photographs. For such binarization, we systematically evaluate the performance of 12 binarization methods as well as of a new binarization algorithm that we propose here. Our evaluation includes different metrics and uses established natural image text recognition benchmarks (ICDAR 2003 and ICDAR 2011). Our main finding is thus the fact that image binarization methods combined with additional filtering of generated connected components and off-the-shelf OCR engines can achieve state-of-the-art performance for end-to-end text understanding in natural images.
机译:虽然现代现成的OCR发动机在扫描文本上表现出特别高的准确性,但在自然图像中的文本检测和识别仍然是一个具有挑战性的问题。在这里,我们证明,只要应用于输入照片的适当的图像二值化,OCR发动机仍然可以很好地执行良好的任务。对于此类二值化,我们系统地评估了12个二值化方法的性能以及我们在此提出的新二值化算法。我们的评估包括不同的指标,并使用已建立的自然图像文本识别基准(ICDAR 2003和ICDAR 2011)。因此,我们的主要发现是,图像二值化方法与所产生的连接部件和现成的OFR发动机的额外滤波相结合,可以实现自然图像中的最终文本理解的最先进的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号