首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Real-Time Document Image Classification Using Deep CNN and Extreme Learning Machines
【24h】

Real-Time Document Image Classification Using Deep CNN and Extreme Learning Machines

机译:使用深度CNN和极限学习机进行实时文档图像分类

获取原文
获取外文期刊封面目录资料

摘要

This paper presents an approach for real-time training and testing for document image classification. In production environments, it is crucial to perform accurate and (time-)efficient training. Existing deep learning approaches for classifying documents do not meet these requirements, as they require much time for training and fine-tuning the deep architectures. Motivated from Computer Vision, we propose a two-stage approach. The first stage trains a deep network that works as feature extractor and in the second stage, Extreme Learning Machines (ELMs) are used for classification. The proposed approach outperforms all previously reported structural and deep learning based methods with a final accuracy of 83.24% on Tobacco-3482 dataset, leading to a relative error reduction of 25% when compared to a previous Convolutional Neural Network (CNN) based approach (DeepDocClassifier). More importantly, the training time of the ELM is only 1.176 seconds and the overall prediction time for 2,482 images is 3.066 seconds. As such, this novel approach makes deep learning-based document classification suitable for large-scale real-time applications.
机译:本文提出了一种用于文档图像分类的实时培训和测试方法。在生产环境中,执行准确且(时间)高效的培训至关重要。现有的用于文档分类的深度学习方法不满足这些要求,因为它们需要大量时间来训练和微调深度架构。从计算机视觉的动机出发,我们提出了一种两阶段的方法。第一阶段训练用作特征提取器的深度网络,第二阶段使用极限学习机(ELM)进行分类。拟议的方法优于所有先前报告的基于结构和深度学习的方法,在Tobacco-3482数据集上的最终精度为83.24%,与以前的基于卷积神经网络(CNN)的方法相比,相对误差减少了25% (DeepDocClassifier)。更重要的是,ELM的训练时间仅为1.176秒,而2,482张图像的总预测时间为3.066秒。因此,这种新颖的方法使基于深度学习的文档分类适用于大规模实时应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号