首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Real-Time Document Localization in Natural Images by Recursive Application of a CNN
【24h】

Real-Time Document Localization in Natural Images by Recursive Application of a CNN

机译:通过CNN的递归应用对自然图像进行实时文档定位

获取原文

摘要

Seamless integration of information from digital and paper documents is crucial for efficient knowledge management. One convenient way to achieve this is to digitize a document from a natural image. This requires precise localization of the document in the image. Several methods have been proposed to solve this problem but they rely on traditional image processing techniques which are not robust to extreme viewpoint and background variations. Deep Convolutional Neural Networks (CNNs), on the other hand, have shown to be extremely robust to variations in background and viewpoint in object detection and classification tasks. Inspired by their robustness and generality, we propose a novel CNN based method to accurately localize documents in real-time. We model localization problem as a key point detection problem. The four corners of the documents are jointly predicted by a Deep Convolutional Neural Network. We then refine our prediction using a novel recursive application of a CNN. Performance of the system is evaluated on ICDAR 2015 SmartDoc Competition 1 dataset. The results are comparable to state of the art on simple backgrounds and improve the state of the art to 94% from the previous 86% on the complex background. Code, dataset, and models are available at: https://github.com/KhurramJaved96/Recursive-CNNs.
机译:来自数字和纸质文档的信息的无缝集成对于有效的知识管理至关重要。实现此目的的一种简便方法是从自然图像中数字化文档。这需要在图像中精确定位文档。已经提出了几种方法来解决该问题,但是它们依赖于传统的图像处理技术,该技术对于极端的视点和背景变化不稳健。另一方面,深度卷积神经网络(CNN)已显示出对物体检测和分类任务中背景和视点变化的强大支持。受它们的健壮性和通用性启发,我们提出了一种基于CNN的新颖方法来实时准确地定位文档。我们将定位问题建模为关键点检测问题。深度卷积神经网络共同预测了文档的四个角。然后,我们使用CNN的新型递归应用完善我们的预测。系统的性能在ICDAR 2015 SmartDoc Competition 1数据集中进行了评估。结果可与简单背景上的最新技术相媲美,并将当前水平从复杂背景上的86%提高到94%。代码,数据集和模型可在以下网址获得:https://github.com/KhurramJaved96/Recursive-CNNs。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号