Multimodal Document Image Classification

机译：多峰文档图像分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

State-of-the-art methods for document image classification rely on visual features extracted by deep convolutional neural networks (CNNs). These methods do not utilize rich semantic information present in the text of the document, which can be extracted using Optical Character Recognition (OCR). We first study the performance of state-of-the-art text classification approaches when applied to noisy text obtained from OCR. We then show that fusing this textual information with visual CNN methods produces state-of-the-art results on the RVL-CDIP classification dataset.

机译：文档图像分类的最先进方法依赖于深卷积神经网络（CNNS）提取的可视特征。这些方法不利用文档文本中存在的丰富语义信息，这可以使用光学字符识别（OCR）来提取。我们首先研究从OCR获得的嘈杂文本时，研究最先进的文本分类方法的性能。然后，我们展示了使用Visual CNN方法的融合此文本信息在RVL-CDIP分类数据集上产生最先进的结果。

著录项

来源
《International Conference on Document Analysis and Recognition》|2019年|71-77|共7页
会议地点
作者
Rajiv Jain; Curtis Wigington;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Hafnium; 6G mobile communication; Text analysis;

机译：;; 6G移动通信;文本分析;
入库时间 2022-08-26 14:34:37

相似文献

外文文献
中文文献
专利

1. Multimodal page classification in administrative document image streams [J] . Mancal Rusinol, Volkmar Frinken, Dimosthenis Karatzas, International Journal on Document Analysis and Recognition . 2014,第4期

机译：行政文档图像流中的多模式页面分类
2. Multimodality imaging in takotsubo syndrome: a joint consensus document of the European Association of Cardiovascular Imaging (EACVI) and the Japanese Society of Echocardiography (JSE) [J] . Rodolfo Citro, Hiroyuki Okura, Jelena R Ghadri, Journal of echocardiography . 2020,第4期

机译：Takotsubo综合征中的多模成像：欧洲心血管成像协会（EACVI）和日本超声心动图社会的联合共识文件（JSE）
3. Correction to: Multimodality imaging in takotsubo syndrome: a joint consensus document of the European Association of Cardiovascular Imaging (EACVI) and the Japanese Society of Echocardiography (JSE) [J] . Rodolfo Citro, Hiroyuki Okura, Jelena R Ghadri, Journal of echocardiography . 2020,第4期

机译：TAKOTUBO综合征中的多模成像：欧洲心血管成像（EACVI）和日本超声心动学学会（JSE）的联合共识文件
4. Multimodal Classification of Document Embedded Images [C] . Matheus Viana, Quoc-Bao Nguyen, John Smith, IAPR International Workshop on Graphics Recognition . 2018

机译：文档嵌入式图像的多模式分类
5. Simultaneous image classification and annotation via fusing multimodal heterogeneous image features. [D] . Wacker, Taylor. 2014

机译：通过融合多模式异构图像特征，同时进行图像分类和注释。
6. A two-step registration-classification approach to automated segmentation of multimodal images for high-throughput greenhouse plant phenotyping [O] . Michael Henke, Astrid Junker, Kerstin Neumann, 2020

机译：一种两步登记分类方法以实现高通量温室植物表型多峰图像的自动分割
7. Complex Document Classification and Localization Application on Identity Document Images [O] . Awal, Ahmad-Montaser,, Ghanmi, Nabil, Sicre, Ronan, 2017

机译：复杂文件分类与本地化在身份证件图像上的应用
8. Multimodal Task-Driven Dictionary Learning for Image Classification. [R] . Bahrampour, S. 2015

机译：用于图像分类的多模任务驱动字典学习。

Multimodal Document Image Classification

摘要

著录项

相似文献

相关主题

期刊订阅