Selecting Fine-Tuned Features for Layout Analysis of Historical Documents

机译：选择微调的特征以进行历史文档的布局分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate fine-tuned features learned by deep neural networks in the context of layout analysis. Pre-training and fine-tuning are techniques used in deep neural networks to learn representations (features) of input. However, it is not clear if the fine-tuned features are all useful for a following classification task. We investigate this problem using feature selection. Firstly, features are learned by a deep neural network, where stacked autoencoders are used for pre-training and then the whole network is fine-tuned. Then, a feature selection method is used to select relevant features for classification. We observe that despite fine-tuning, a significant number of the features are still redundant or irrelevant for layout classification. Furthermore, features from the top layer of the stacked autoencoders are generally more relevant for classification than those from lower layers.

机译：在本文中，我们将研究深度神经网络在布局分析背景下学习到的微调特征。预训练和微调是在深度神经网络中用于学习输入表示（功能）的技术。但是，尚不清楚微调后的功能是否对后续分类任务有用。我们使用功能选择调查此问题。首先，通过深度神经网络学习特征，其中使用堆叠式自动编码器进行预训练，然后对整个网络进行微调。然后，使用特征选择方法来选择用于分类的相关特征。我们观察到，尽管进行了微调，但许多功能对于布局分类仍然是多余的或不相关的。此外，来自堆叠式自动编码器顶层的特征通常比来自较低层的特征更适合分类。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|281-286|共6页
会议地点
作者
Hao Wei; Mathias Seuret; Marcus Liwicki; Rolf Ingold; Pei Fu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Neural networks; Layout; Training; Decoding; Task analysis; Support vector machines;

机译：特征提取;神经网络;布局;训练;解码;任务分析;支持向量机;

相似文献

外文文献
中文文献
专利

1. Historical document layout analysis using anisotropic diffusion and geometric features [J] . Galal M.BinMakhashen, Sabri A.Mahmoud International journal on digital libraries . 2020,第3期

机译：使用各向异性扩散和几何特征的历史文献布局分析
2. Comparative Study of Layout Analysis of Tabulated Historical Documents [J] . Liang Xusheng, Cheddad Abbas, Hall Johan Big Data Research . 2021,第1期

机译：表现历史文献布局分析的比较研究
3. Analysis of Water Retention Changes in Selected Lake-Wetland Catchments of West Polesie Based on Historical Documents [J] . Katarzyna Mi?siak-Wójcik Limnological Review . 2018,第2期

机译：基于历史文献的西波利西湖湿地集水区持水量变化分析
4. Selecting Fine-Tuned Features for Layout Analysis of Historical Documents [C] . Hao Wei, Mathias Seuret, Marcus Liwicki, IAPR International Conference on Document Analysis and Recognition . 2017

机译：选择历史文档布局分析的微调功能
5. Analysis of selected geometry and measurement learning expectations in some Asian countries and United States States as specified in national and state curriculum documents. [D] . Chen, Jung-Chih. 2005

机译：根据国家和州课程文件中指定的内容，分析某些亚洲国家和美国对选定的几何形状和测量学习的期望。
6. Sugar industry sponsorship of germ-free rodent studies linking sucrose to hyperlipidemia and cancer: An historical analysis of internal documents [O] . Cristin E. Kearns, Dorie Apollonio, Stanton A. Glantz 2017

机译：制糖业赞助的无糖啮齿动物研究将蔗糖与高脂血症和癌症联系起来：内部文献的历史分析
7. Multi-task Layout Analysis for Historical Handwritten Documents Using Fully Convolutional Networks [O] . Yue Xu, Fei Yin, Zhaoxiang Zhang, 2018

机译：使用完全卷积网络的历史手写文档的多任务布局分析

Selecting Fine-Tuned Features for Layout Analysis of Historical Documents

摘要

著录项

相似文献

相关主题

期刊订阅