Two Stream Deep Network for Document Image Classification

机译：两流深度网络用于文档图像分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel two-stream approach for document image classification. The proposed approach leverages textual and visual modalities to classify document images into ten categories, including letter, memo, news article, etc. In order to alleviate dependency of textual stream on performance of underlying OCR (which is the case with general content based document image classifiers), we utilize a filter based feature-ranking algorithm. This algorithm ranks the features of each class based on their ability to discriminate document images and selects a set of top 'K' features that are retained for further processing. In parallel, the visual stream uses deep CNN models to extract structural features of document images.Finally, textual and visual streams are concatenated together using an average ensembling method. Experimental results reveal that the proposed approach outperforms the state-of-the-art system with a significant margin of 4.5% on publicly available Tobacco-3482 dataset.

机译：本文提出了一种新颖的两流方法进行文档图像分类。所提出的方法利用文本和视觉方式将文档图像分为十类，包括信件，备忘录，新闻文章等。为了减轻文本流对底层OCR性能的依赖性（在基于常规内容的文档图像中就是这种情况）分类器），我们利用基于过滤器的特征排名算法。该算法根据其区分文档图像的能力对每个类别的特征进行排序，并选择保留的一组顶级“ K”特征以进行进一步处理。并行地，视觉流使用深层CNN模型提取文档图像的结构特征。最后，文本和视觉流使用平均集合方法连接在一起。实验结果表明，所提出的方法优于最新系统，在公开提供的Tobacco-3482数据集上有4.5％的显着优势。

著录项

来源
《International Conference on Document Analysis and Recognition》|2019年|1410-1416|共7页
会议地点
作者
Muhammad Nabeel Asim; Muhammad Usman Ghani Khan; Muhammad Imran Malik; Khizar Razzaque; Andreas Dengel; Sheraz Ahmed;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Streaming media; Optical character recognition software; Visualization; Convolutional neural networks; Task analysis; Convolution;

机译：特征提取流媒体光学字符识别软件可视化卷积神经网络任务分析卷积;

相似文献

外文文献
中文文献
专利

1. Two-stream feature aggregation deep neural network for scene classification of remote sensing images [J] . Xu Kejie, Huang Hong, Deng Peifang, Information Sciences: An International Journal . 2020,第1期

机译：两流特征聚合深神经网络遥感图像的场景分类
2. Deep Feature Fusion via Two-Stream Convolutional Neural Network for Hyperspectral Image Classification [J] . Li Xian, Ding Mingli, Pizurica Aleksandra IEEE Transactions on Geoscience and Remote Sensing . 2020,第4期

机译：用于高光谱图像分类的两流卷积神经网络深度特征融合
3. DELTA: A deep dual-stream network for multi-label image classification [J] . Yu Wan-Jin, Chen Zhen-Duo, Luo Xin, Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：三角洲：用于多标签图像分类的深层双流网络
4. Two Stream Deep Network for Document Image Classification [C] . Muhammad Nabeel Asim, Muhammad Usman Ghani Khan, Muhammad Imran Malik, International Conference on Document Analysis and Recognition . 2019

机译：用于文档图像分类的两个流深网络
5. Intelligent watermarking of long streams of document images =TATOUAGE INTELLIGENT DE QUANTITéS MASSIVES DE DOCUMENTS NUMERISéS [D] . Vellasques, Eduardo. 2013

机译：长文档图像流的智能水印=大量标准化文档文档
6. Deep learning approach to classification of lung cytological images: Two-step training using actual and synthesized images by progressive growing of generative adversarial networks [O] . Atsushi Teramoto, Tetsuya Tsukamoto, Ayumi Yamada, 2020

机译：深层学习方法肺细胞学图像分类：使用实际和合成图像的两步训练通过逐步生长的生长生长育种网络
7. Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks [O] . Arindam Das, Saikat Roy, Ujjwal Bhattacharya, 2018

机译：文档图像分类与域内传输学习和深卷积神经网络的堆叠概括

Two Stream Deep Network for Document Image Classification

摘要

著录项

相似文献

相关主题

期刊订阅