Image Binarization for End-to-End Text Understanding in Natural Images

机译：在自然图像中实现端到端文本的图像二值化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While modern off-the-shelf OCR engines show particularly high accuracy on scanned text, text detection and recognition in natural images still remains a challenging problem. Here, we demonstrate that OCR engines can still perform well on this harder task as long as appropriate image binarization is applied to input photographs. For such binarization, we systematically evaluate the performance of 12 binarization methods as well as of a new binarization algorithm that we propose here. Our evaluation includes different metrics and uses established natural image text recognition benchmarks (ICDAR 2003 and ICDAR 2011). Our main finding is thus the fact that image binarization methods combined with additional filtering of generated connected components and off-the-shelf OCR engines can achieve state-of-the-art performance for end-to-end text understanding in natural images.

机译：虽然现代现成的OCR发动机在扫描文本上表现出特别高的准确性，但在自然图像中的文本检测和识别仍然是一个具有挑战性的问题。在这里，我们证明，只要应用于输入照片的适当的图像二值化，OCR发动机仍然可以很好地执行良好的任务。对于此类二值化，我们系统地评估了12个二值化方法的性能以及我们在此提出的新二值化算法。我们的评估包括不同的指标，并使用已建立的自然图像文本识别基准（ICDAR 2003和ICDAR 2011）。因此，我们的主要发现是，图像二值化方法与所产生的连接部件和现成的OFR发动机的额外滤波相结合，可以实现自然图像中的最终文本理解的最先进的性能。

著录项

来源
《International Conference on Document Analysis and Recognition》|2013年||共5页
会议地点
作者
Milyaev Sergey; Barinova Olga; Novikova Tatiana; Kohli Pushmeet;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41;
关键词
natural scene binarization; text localization;

机译：自然场景二值化;文本本地化;

相似文献

外文文献
中文文献
专利

1. Multi-Oriented Text Detection in Natural Scene Images Based on the Intersection of MSER With the Locally Binarized Image [J] . Anurag Agrahari, Rajib Ghosh Procedia Computer Science . 2020,第5期

机译：基于局部二值化图像的MSER的自然场景图像中的多面文本检测
2. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [J] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, Data in Brief . 2020,第3期

机译：Cursive-Text：自然场景图像中的端到端核心文本识别的全面数据集
3. A novel method for binarization of scene text images and its application in text identification [J] . Ghoshal Ranjit, Roy Anandarup, Banerjee Ayan, Pattern Analysis and Applications . 2019,第4期

机译：一种场景文本图像二值化的新方法及其在文本识别中的应用
4. Image Binarization for End-to-End Text Understanding in Natural Images [C] . Milyaev Sergey, Barinova Olga, Novikova Tatiana, International Conference on Document Analysis and Recognition . 2013

机译：图像二值化，用于自然图像中的端到端文本理解
5. Effective and efficient binarization of degraded document images. [D] . Parker, Jon Ivan. 2016

机译：对退化的文档图像进行有效和高效的二值化。
6. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [O] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, 2020

机译：草书文本：用于自然场景图像中端到端乌尔都语文本识别的综合数据集
7. Edge based Binarization for Video Text Images [O] . Zhiwei Zhou, Linlin Li, Chew Lim Tan 2011

机译：视频文本图像的基于边缘的二值化
8. Optimal binarization of gray-scaled digital images via fuzzy reasoning [R] . 2007

机译：基于模糊推理的灰度数字图像最优二值化

Image Binarization for End-to-End Text Understanding in Natural Images

摘要

著录项

相似文献

相关主题

期刊订阅