首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Deep Networks for Degraded Document Image Binarization through Pyramid Reconstruction
【24h】

Deep Networks for Degraded Document Image Binarization through Pyramid Reconstruction

机译:通过金字塔重建实现Digraded文档图像二值的深度网络

获取原文

摘要

Binarization of document images is an important processing step for document images analysis and recognition. However, this problem is quite challenging in some cases because of the quality degradation of document images, such as varying illumination, complicated backgrounds, image noises due to ink spots, water stains or document creases. In this paper, we propose a framework based on deep convolutional neural-network (DCNN) for adaptive binarization of degraded document images. The basic idea of our method is to decompose a degraded document image into a spatial pyramid structure by using DCNN, with each layer at different scale. Then the foreground image is sequentially reconstructed from these layers in a coarse-to-fine manner by using deconvolutional network. Such kind of decomposition is quite beneficial, since multi-resolution supervision information can be directly introduced into network learning. We also define several loss functions about label consistency and foregrounds smoothing to further regularize the training of the network. Experimental results demonstrate the effectiveness of the proposed method.
机译:文档图像的二值化是文档图像分析和识别的重要处理步骤。然而,在某些情况下,由于文档图像的质量劣化,例如不同的照明,复杂的背景,由于墨水斑,水渍或文档折痕,图像噪声的质量劣化,这一问题非常具有挑战性。在本文中,我们提出了一种基于深度卷积神经网络(DCNN)的框架,用于降级文档图像的自适应二值。我们的方法的基本思想是通过使用DCNN将劣化的文档图像分解为空间金字塔结构,每个层以不同的刻度。然后,通过使用去卷积网络以粗到精细的方式顺序地重建前景图像。这种分解是非常有益的,因为可以直接引入多分辨率监督信息到网络学习中。我们还定义了有关标签一致性和前景平滑的几个损失功能,以进一步规范网络培训。实验结果表明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号