首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Selecting Fine-Tuned Features for Layout Analysis of Historical Documents
【24h】

Selecting Fine-Tuned Features for Layout Analysis of Historical Documents

机译:选择微调的特征以进行历史文档的布局分析

获取原文

摘要

In this paper, we investigate fine-tuned features learned by deep neural networks in the context of layout analysis. Pre-training and fine-tuning are techniques used in deep neural networks to learn representations (features) of input. However, it is not clear if the fine-tuned features are all useful for a following classification task. We investigate this problem using feature selection. Firstly, features are learned by a deep neural network, where stacked autoencoders are used for pre-training and then the whole network is fine-tuned. Then, a feature selection method is used to select relevant features for classification. We observe that despite fine-tuning, a significant number of the features are still redundant or irrelevant for layout classification. Furthermore, features from the top layer of the stacked autoencoders are generally more relevant for classification than those from lower layers.
机译:在本文中,我们将研究深度神经网络在布局分析背景下学习到的微调特征。预训练和微调是在深度神经网络中用于学习输入表示(功能)的技术。但是,尚不清楚微调后的功能是否对后续分类任务有用。我们使用功能选择调查此问题。首先,通过深度神经网络学习特征,其中使用堆叠式自动编码器进行预训练,然后对整个网络进行微调。然后,使用特征选择方法来选择用于分类的相关特征。我们观察到,尽管进行了微调,但许多功能对于布局分类仍然是多余的或不相关的。此外,来自堆叠式自动编码器顶层的特征通常比来自较低层的特征更适合分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号