Beyond OCRs for Document Blur Estimation

机译：超越OCR的文档模糊估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The current document blur/quality estimation algorithms rely on the OCR accuracy to measure their success. A sharp document image, however, at times may yield lower OCR accuracy owing to factors independent of blur or quality of capture. The necessity to rely on OCR is mainly due to the difficulty in quantifying the quality otherwise. In this work, we overcome this limitation by proposing a novel dataset for document blur estimation, for which we physically quantify the blur using a capture set-up which computationally varies the focal distance of the camera. We also present a selective search mechanism to improve upon the recently successful patch-based learning approaches (using codebooks or convolutional neural networks). We present a thorough analysis of the improved blur estimation pipeline using correlation with OCR accuracy as well as the actual amount of blur. Our experiments demonstrate that our method outperforms the current state-of-the-art by a significant margin.

机译：当前的文档模糊/质量估计算法依靠OCR准确性来衡量其成功。但是，由于与模糊或捕获质量无关的因素，清晰的文档图像有时可能会产生较低的OCR精度。依赖OCR的必要性主要是由于否则难以量化质量。在这项工作中，我们通过提出一种用于文档模糊估计的新颖数据集来克服此限制，为此，我们使用捕获设置物理上量化模糊，该捕获设置在计算上改变了相机的焦距。我们还提出了一种选择性搜索机制，以改进最近成功的基于补丁的学习方法（使用码本或卷积神经网络）。我们使用与OCR精度的相关性以及实际的模糊量，对改进的模糊估计流水线进行了全面的分析。我们的实验表明，我们的方法大大优于当前的最新技术。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|1101-1107|共7页
会议地点
作者
Pranjal Kumar Rai; Sajal Maheshwari; Ishit Mehta; Parikshit Sakurikar; Vineet Gandhi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Optical character recognition software; Cameras; Lenses; Estimation; Pipelines; Image quality; Correlation;

机译：光学字符识别软件;相机;镜头;估计;管道;图像质量;相关性;

相似文献

外文文献
中文文献
专利

1. Adaptive fuzzy model for blur estimation on document images [J] . Kieu Van Cuong, Cloppet Florence, Vincent Nicole Pattern recognition letters . 2017,第Jana15期

机译：用于文档图像模糊估计的自适应模糊模型
2. Blur kernel estimation of noisy-blurred image via dynamic structure prior [J] . Chen Xueling, Zhu Yu, Liu Wei, Neurocomputing . 2020,第Auga25期

机译：通过现有动态结构模糊噪声模糊图像的内核估计
3. Research on Cross-Correlative Blur Length Estimation Algorithm in Motion Blur Image [J] . Li Dongming, Su Zhengbo, Su Wei, Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2016,第1a114期

机译：运动模糊图像中互相关模糊长度估计算法的研究
4. Beyond OCRs for Document Blur Estimation [C] . Pranjal Kumar Rai, Sajal Maheshwari, Ishit Mehta, IAPR International Conference on Document Analysis and Recognition . 2017

机译：超越OCRS文件模糊估计
5. A hybrid two-dimensional HMM and MLP OCR system for processing multi-font and low-quality English documents. [D] . Fu, Nenghong. 2004

机译：混合的二维HMM和MLP OCR系统，用于处理多字体和低质量的英语文档。
6. Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight [O] . Michael Cutter, Roberto Manduchi -1

机译：迈向移动OCR：如何在无视的情况下对文档进行良好的拍摄
7. A novel approach for skew estimation of document images in OCR system [O] . Sarfraz M., Zidouri A., Shahab S.A. 2005

机译：一种新的OCR系统中文档图像偏斜估计方法

Beyond OCRs for Document Blur Estimation

摘要

著录项

相似文献

相关主题

期刊订阅