首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Cutting the Error by Half: Investigation of Very Deep CNN and Advanced Training Strategies for Document Image Classification
【24h】

Cutting the Error by Half: Investigation of Very Deep CNN and Advanced Training Strategies for Document Image Classification

机译:将错误减少一半:非常深的CNN和文档图像分类的高级培训策略的研究

获取原文

摘要

We present an exhaustive investigation of recent Deep Learning architectures, algorithms, and strategies for the task of document image classification to finally reduce the error by more than half. Existing approaches, such as the DeepDoc-Classifier, apply standard Convolutional Network architectures with transfer learning from the object recognition domain. The contribution of the paper is threefold: First, it investigates recently introduced very deep neural network architectures (GoogLeNet, VGG, ResNet) using transfer learning (from real images). Second, it proposes transfer learning from a huge set of document images, i.e. 400; 000 documents. Third, it analyzes the impact of the amount of training data (document images) and other parameters to the classification abilities. We use two datasets, the Tobacco-3482 and the large-scale RVL-CDIP dataset. We achieve an accuracy of 91:13% for the Tobacco-3482 dataset while earlier approaches reach only 77:6%. Thus, a relative error reduction of more than 60% is achieved. For the large dataset RVL-CDIP, an accuracy of 90:97% is achieved, corresponding to a relative error reduction of 11:5%.
机译:我们将对最近的深度学习架构,算法和策略进行详尽的调查,以完成文档图像分类任务,以最终将错误减少一半以上。诸如DeepDoc-Classifier之类的现有方法将标准的卷积网络体系结构与从对象识别域进行的转移学习一起应用。论文的贡献有三点:首先,它使用转移学习(来自真实图像)研究了最近介绍的非常深的神经网络体系结构(GoogLeNet,VGG,ResNet)。其次,它建议从大量文档图像(即400张文档)中进行转移学习。 000个文档。第三,分析训练数据量(文档图像)和其他参数对分类能力的影响。我们使用两个数据集,即Tobacco-3482和大规模RVL-CDIP数据集。对于Tobacco-3482数据集,我们达到91:13 \%的准确性,而更早的方法仅达到77:6 \%。因此,实现了超过60%的相对误差减少。对于大型数据集RVL-CDIP,可实现90:97 \%的精度,对应于11:5 \%的相对误差减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号