Deep Networks for Degraded Document Image Binarization through Pyramid Reconstruction

机译：通过金字塔重建实现Digraded文档图像二值的深度网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Binarization of document images is an important processing step for document images analysis and recognition. However, this problem is quite challenging in some cases because of the quality degradation of document images, such as varying illumination, complicated backgrounds, image noises due to ink spots, water stains or document creases. In this paper, we propose a framework based on deep convolutional neural-network (DCNN) for adaptive binarization of degraded document images. The basic idea of our method is to decompose a degraded document image into a spatial pyramid structure by using DCNN, with each layer at different scale. Then the foreground image is sequentially reconstructed from these layers in a coarse-to-fine manner by using deconvolutional network. Such kind of decomposition is quite beneficial, since multi-resolution supervision information can be directly introduced into network learning. We also define several loss functions about label consistency and foregrounds smoothing to further regularize the training of the network. Experimental results demonstrate the effectiveness of the proposed method.

机译：文档图像的二值化是文档图像分析和识别的重要处理步骤。然而，在某些情况下，由于文档图像的质量劣化，例如不同的照明，复杂的背景，由于墨水斑，水渍或文档折痕，图像噪声的质量劣化，这一问题非常具有挑战性。在本文中，我们提出了一种基于深度卷积神经网络（DCNN）的框架，用于降级文档图像的自适应二值。我们的方法的基本思想是通过使用DCNN将劣化的文档图像分解为空间金字塔结构，每个层以不同的刻度。然后，通过使用去卷积网络以粗到精细的方式顺序地重建前景图像。这种分解是非常有益的，因为可以直接引入多分辨率监督信息到网络学习中。我们还定义了有关标签一致性和前景平滑的几个损失功能，以进一步规范网络培训。实验结果表明了该方法的有效性。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|732p|共6页
会议地点
作者
Gaofeng Meng; Kun Yuan; Ying Wu; Shiming Xiang; Chunhong Pan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Image reconstruction; Training; Feature extraction; Image resolution; Histograms; Testing; Ink;

机译：图像重建;训练;特征提取;图像分辨率;直方图;测试;墨水;

相似文献

外文文献
中文文献
专利

1. Binarization of degraded document images based on hierarchical deep supervised network [J] . Quang Nhat Vo, Soo Hyung Kim, Hyung Jeong Yang, Pattern Recognition: The Journal of the Pattern Recognition Society . 2018,第期

机译：基于分层深度监督网络的降级文档图像二值化
2. Nonlinear diffusion equation with selective source for binarization of degraded document images [J] . Zhongjie Du, Chuanjiang He Applied Mathematical Modelling . 2021,第Nova期

机译：非线性扩散方程，具有可降解文档图像二值化的选择性源
3. Binarization of degraded document images with global-local U-Nets [J] . Huang Xiao, Li Lin, Liu Rong, Optik: Zeitschrift fur Licht- und Elektronenoptik: = Journal for Light-and Electronoptic . 2020,第期

机译：具有全球局部U-Net的降级文档图像二值化
4. Deep Networks for Degraded Document Image Binarization through Pyramid Reconstruction [C] . Gaofeng Meng, Kun Yuan, Ying Wu, IAPR International Conference on Document Analysis and Recognition . 2017

机译：通过金字塔重构进行深度网络降级的文档图像二值化
5. Effective and efficient binarization of degraded document images. [D] . Parker, Jon Ivan. 2016

机译：对退化的文档图像进行有效和高效的二值化。
6. Reference-Driven Compressed Sensing MR Image Reconstruction Using Deep Convolutional Neural Networks without Pre-Training [O] . Di Zhao, Feng Zhao, Yongjin Gan 2020

机译：使用深度卷积神经网络的无预训练的参考驱动压缩感测MR图像重建
7. Binarization of Degraded Document Images Using Convolutional Neural Networks and Wavelet-Based Multichannel Images [O] . Younes Akbari, Somaya Al-Maadeed, Kalthoum Adam 2020

机译：使用卷积神经网络和基于小波的多通道图像的降级文档图像的二值化

Deep Networks for Degraded Document Image Binarization through Pyramid Reconstruction

摘要

著录项

相似文献

相关主题

期刊订阅