Visual Saliency and Terminology Extraction for Document Classification

机译：视觉显着性和术语提取，用于文档分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The document digitization process becomes a crucial economical issue in our society. Then, it becomes necessary to be able to organize this huge amount of documents. The work proposed in this paper tends to propose a new method to automatically classify documents using a saliency-based segmentation process on one hand, and a terminology extraction and annotation on the other hand. The saliency-based segmentation is used to extract salient regions and by the way logo, while the terminology approach is used to annotate them and to automatically classify the document. The approach does not require human expertise, and use Google Images as a knowledge database. The results obtained on a real database of 1766 documents show the relevance of the approach.

机译：文件数字化过程成为我们社会中至关重要的经济问题。然后，必须能够组织大量的文档。本文提出的工作趋向于提出一种新方法，该方法一方面使用基于显着性的分割过程，另一方面使用术语提取和注释来自动对文档进行分类。基于显着性的分割用于提取显着区域并通过徽标进行分类，而术语方法则用于对其进行批注并自动对文档进行分类。该方法不需要专业知识，并且可以将Google图片用作知识数据库。在1766个文档的真实数据库中获得的结果表明了该方法的相关性。

著录项

来源
《International workshop on graphics recognition》|2014年|96-108|共13页
会议地点
作者
Duthil Benjamin; Coustaty Mickael; Courboulay Vincent; Jean-Marc Ogier;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A visual attention-based keyword extraction for document classification [J] . Wu Xing, Du Zhikang, Guo Yike Multimedia Tools and Applications . 2018,第19期

机译：基于视觉注意的关键词提取，用于文档分类
2. Saliency Cuts: Salient Region Extraction based on Local Adaptive Thresholding for Image Information Recognition of the Visually Impaired [J] . Mukhiddinov Mukhriddin, Jeong Rag-Gyo, Cho Jinsoo The international arab journal of information technology . 2020,第5期

机译：显着性切割：基于局部自适应阈值对视觉损害的图像信息识别的突出区提取
3. Using Ontology To Improve Precision Of Terminology Extraction From Documents [J] . Wen Zhang, Taketoshi Yoshida, Xijin Tang Expert systems with applications . 2009,第5期

机译：使用本体来提高从文档中提取术语的精度
4. Document page similarity based on layout visual saliency: Application to query by example and document classification [C] . Veronique EGLIN, Stephane BRES International Conference on Document Analysis and Recognition . 2003

机译：文档基于布局视觉显着性的页面相似性：按示例和文档分类查询的应用程序
5. Multi-Word Terminology Extraction and Its Role in Document Embedding [D] . Kulkarni, Jayanth Prakash. 2021

机译：多字术语提取及其在文献嵌入中的作用
6. Objects Classification by Learning-Based Visual Saliency Model and Convolutional Neural Network [O] . Na Li, Xinbo Zhao, Yongjia Yang, 2016

机译：基于学习的视觉显着性模型和卷积神经网络对目标进行分类
7. Fusion of Multiple Visual Cues for Visual Saliency Extraction from Wearable Camera Settings with Strong Motion [O] . Boujut, Hugo, Benois-Pineau, Jenny, Mégret, Rémi 2012

机译：融合多种视觉提示，从可穿戴式摄像机设置中以强烈运动提取视觉显着性

Visual Saliency and Terminology Extraction for Document Classification

摘要

著录项

相似文献

相关主题

期刊订阅