Real-Time Document Image Classification Using Deep CNN and Extreme Learning Machines

机译：使用深度CNN和极限学习机进行实时文档图像分类

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents an approach for real-time training and testing for document image classification. In production environments, it is crucial to perform accurate and (time-)efficient training. Existing deep learning approaches for classifying documents do not meet these requirements, as they require much time for training and fine-tuning the deep architectures. Motivated from Computer Vision, we propose a two-stage approach. The first stage trains a deep network that works as feature extractor and in the second stage, Extreme Learning Machines (ELMs) are used for classification. The proposed approach outperforms all previously reported structural and deep learning based methods with a final accuracy of 83.24% on Tobacco-3482 dataset, leading to a relative error reduction of 25% when compared to a previous Convolutional Neural Network (CNN) based approach (DeepDocClassifier). More importantly, the training time of the ELM is only 1.176 seconds and the overall prediction time for 2,482 images is 3.066 seconds. As such, this novel approach makes deep learning-based document classification suitable for large-scale real-time applications.

机译：本文提出了一种用于文档图像分类的实时培训和测试方法。在生产环境中，执行准确且（时间）高效的培训至关重要。现有的用于文档分类的深度学习方法不满足这些要求，因为它们需要大量时间来训练和微调深度架构。从计算机视觉的动机出发，我们提出了一种两阶段的方法。第一阶段训练用作特征提取器的深度网络，第二阶段使用极限学习机（ELM）进行分类。拟议的方法优于所有先前报告的基于结构和深度学习的方法，在Tobacco-3482数据集上的最终精度为83.24％，与以前的基于卷积神经网络（CNN）的方法相比，相对误差减少了25％（DeepDocClassifier）。更重要的是，ELM的训练时间仅为1.176秒，而2,482张图像的总预测时间为3.066秒。因此，这种新颖的方法使基于深度学习的文档分类适用于大规模实时应用。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|1318-1323|共6页
会议地点
作者
Andreas Kölsch; Muhammad Zeshan Afzal; Markus Ebbecke; Marcus Liwicki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Training; Real-time systems; Layout; Training data; Convolutional neural networks;

机译：特征提取训练实时系统布局训练数据卷积神经网络;

相似文献

外文文献
中文文献
专利

1. Real-time COVID-19 diagnosis from X-Ray images using deep CNN and extreme learning machines stabilized by chimp optimization algorithm [J] . Hu Tianqing, Khishe Mohammad, Mohammadi Mokhtar, Biomedical signal processing and control . 2021,第Pta2期

机译：利用深CNN和极端学习机通过黑猩猩优化算法稳定的X射线图像实时Covid-19诊断
2. Representation learning with deep extreme learning machines for efficient image set classification [J] . Uzair Muhammad, Shafait Faisal, Ghanem Bernard, Neural computing & applications . 2018,第4期

机译：具有深度极端学习机器的表示学习，以实现高效图像集分类
3. Header Based Classification of Journals Using Document Image Segmentation and Extreme Learning Machine [J] . Kalpana S, Vijaya MS International Journal of Image Processing . 2014,第5期

机译：使用文档图像分割和极限学习机的基于标题的期刊分类
4. Real-Time Document Image Classification using Deep CNN and Extreme Learning Machines [C] . Andreas Kolsch, Muhammad Zeshan Afzal, Markus Ebbecke, IAPR International Conference on Document Analysis and Recognition . 2017

机译：使用深层CNN和极端学习机的实时文档图像分类
5. Semi-Supervised Machine Learning & Deep Learning Models in Crisis-Related Informativeness Classification [D] . Rennola, Alessandro. 2019

机译：危机相关信息性分类中的半监督机器学习与深度学习模型
6. RAC-CNN: multimodal deep learning based automatic detection and classification of rod and cone photoreceptors in adaptive optics scanning light ophthalmoscope images [O] . David Cunefare, Alison L. Huckenpahler, Emily J. Patterson, 2019

机译：RAC-CNN：基于多模式深度学习的自适应光学扫描光学检眼镜图像中杆和锥感光体的自动检测和分类
7. Real-Time Document Image Classification using Deep CNN and Extreme Learning Machines [O] . Kölsch, Andreas, Afzal, Muhammad Zeshan, Ebbecke, Markus, 2017

机译：使用深度CNN和Extreme的实时文档图像分类学习机器

Real-Time Document Image Classification Using Deep CNN and Extreme Learning Machines

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅