首页> 外文会议>IEEE International Conference on Image Processing >Improving Document Binarization Via Adversarial Noise-Texture Augmentation
【24h】

Improving Document Binarization Via Adversarial Noise-Texture Augmentation

机译:通过对抗性噪声纹理增强来改善文档二值化

获取原文

摘要

Binarization of degraded document images is an elementary step in most problems involving document image analysis. The paper re-visits the binarization problem by introducing an adversarial learning approach. We construct a Texture Augmentation Network that transfers the texture element of a degraded reference document image to a clean binary image. In this way, the network creates multiple versions of the same textual content with various noisy textures, thus enlarging the available document binarization datasets. Finally, the newly generated images are passed through a Binarization network to get back the clean version. By jointly training the two networks we can increase the adversarial robustness of our system. The most significant contribution of our framework is that it does not require any paired data unlike other Deep Learning-based methods [1], [2], [3]. Such a novel approach has never been implemented earlier thus making it the very first of its kind in Document Image Analysis community. Experimental results suggest that the proposed method1 achieves superior performance over widely used DIBCO datasets.
机译:降级的文档图像的二值化是涉及文档图像分析的大多数问题中的基本步骤。本文通过引入对抗性学习方法来重新审视二值化问题。我们构建了一个纹理增强网络,该网络将降级的参考文档图像的纹理元素传输到干净的二进制图像。以这种方式,网络创建具有各种噪声纹理的相同文本内容的多个版本,从而扩大了可用的文档二值化数据集。最后,新生成的图像将通过Binarization网络传递,以获取干净版本。通过联合训练两个网络,我们可以提高系统的对抗性。与其他基于深度学习的方法[1],[2],[3]不同,我们框架最重要的贡献在于它不需要任何配对数据。这种新颖的方法以前从未实现过,因此使其成为“文档图像分析”社区中的此类方法中的第一个。实验结果表明,提出的方法 1 与广泛使用的DIBCO数据集相比,具有卓越的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号