首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Using Convolutional Encoder-Decoder for Document Image Binarization
【24h】

Using Convolutional Encoder-Decoder for Document Image Binarization

机译:使用卷积编码器解码器进行文档图像二值化

获取原文

摘要

Document image binarization is one of the critical initial steps for document analysis and understanding. Previous work mostly focused on exploiting hand-crafted features to build statistical models for distinguishing text from background. However, these approaches only achieved limited success because: (a) the effectiveness of hand-crafted features is limited by the researcher's domain knowledge and understanding on the documents, and (b) a universal model cannot always capture the complexity of different document degradations. In order to address these challenges, we propose a convolutional encoder-decoder model with deep learning for document image binarization in this paper. In the proposed method, mid-level document image representations are learnt by a stack of convolutional layers, which compose the encoder in this architecture. Then the binarization image is obtained by mapping low resolution representations to the original size through the decoder, which is composed by a series of transposed convolutional layers. We compare the proposed binarization method with other binarization algorithms both qualitatively and quantitatively on the public dataset. The experimental results show that the proposed method has comparable performance to the other hand-crafted binarization approaches and has more generalization capabilities with limited in-domain training data.
机译:文档图像二值化是文档分析和理解的关键初始步骤之一。以前的工作主要集中在利用手工制作的功能来构建统计模型,以区分从背景中的文本。然而,这些方法只取得了有限的成功,因为:(a)手工制作功能的有效性受到研究人员的域知识和对文件的理解的限制,(b)通用模型不能总是捕捉不同文档降级的复杂性。为了解决这些挑战,我们提出了一种卷积编码器 - 解码器模型,对本文进行了深入学习的文档图像二值化。在所提出的方法中,中级文档图像表示由一堆卷积层学习,该卷积层,该层组成在此架构中的编码器。然后通过将低分辨率表示通过解码器将低分辨率表示映射到原始尺寸来获得二值化图像,该解码器由一系列转档卷积层组成。我们将建议的二值化方法与其他二值化算法进行比较,在公共数据集上定性和定量。实验结果表明,该方法对其他手工制作的二值化方法具有相当的性能,具有更多泛化能力,具有有限的域培训数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号