首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Document Image Binarization with Fully Convolutional Neural Networks
【24h】

Document Image Binarization with Fully Convolutional Neural Networks

机译:全卷积神经网络的文档图像二值化

获取原文
获取外文期刊封面目录资料

摘要

Binarization of degraded historical manuscript images is an important pre-processing step for many document processing tasks. We formulate binarization as a pixel classification learning task and apply a novel Fully Convolutional Network (FCN) architecture that operates at multiple image scales, including full resolution. The FCN is trained to optimize a continuous version of the Pseudo F-measure metric and an ensemble of FCNs outperform the competition winners on 4 of 7 DIBCO competitions. This same binarization technique can also be applied to different domains such as Palm Leaf Manuscripts with good performance. We analyze the performance of the proposed model w.r.t. the architectural hyperparameters, size and diversity of training data, and the input features chosen.
机译:降级的历史手稿图像的二值化是许多文档处理任务的重要预处理步骤。我们将二值化公式化为像素分类学习任务,并应用了一种新颖的全卷积网络(FCN)架构,该架构可在多种图像比例(包括全分辨率)下运行。 FCN经过培训,可以优化Pseudo F-measure指标的连续版本,并且FCN的整体表现优于7个DIBCO比赛中的4个比赛获胜者。这种相同的二值化技术也可以应用于具有良好性能的不同领域,例如“棕榈叶手稿”。我们分析建议模型w.r.t.的性能架构的超参数,训练数据的大小和多样性以及选择的输入功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号