首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Unsupervised Feature Learning for Writer Identification and Writer Retrieval
【24h】

Unsupervised Feature Learning for Writer Identification and Writer Retrieval

机译:用于作者识别和作者检索的无监督特征学习

获取原文

摘要

Deep Convolutional Neural Networks (CNN) have shown great success in supervised classification tasks such as character classification or dating. Deep learning methods typically need a lot of annotated training data, which is not available in many scenarios. In these cases, traditional methods are often better than or equivalent to deep learning methods. In this paper, we propose a simple, yet effective, way to learn CNN activation features in an unsupervised manner. Therefore, we train a deep residual network using surrogate classes. The surrogate classes are created by clustering the training dataset, where each cluster index represents one surrogate class. The activations from the penultimate CNN layer serve as features for subsequent classification tasks. We evaluate the feature representations on two publicly available datasets. The focus lies on the ICDAR17 competition dataset on historical document writer identification (Historical-WI). We show that the activation features we trained without supervision are superior to descriptors of state-of-the-art writer identification methods. Additionally, we achieve comparable results in the case of handwriting classification using the ICFHR16 competition dataset on historical Latin script types (CLaMM16).
机译:深度卷积神经网络(CNN)在有监督的分类任务(例如字符分类或约会)中显示出巨大的成功。深度学习方法通​​常需要大量带注释的训练数据,这在许多情况下是不可用的。在这些情况下,传统方法通常优于或等同于深度学习方法。在本文中,我们提出了一种简单而有效的方式,以无监督的方式学习CNN激活功能。因此,我们使用代理类训练深度残差网络。代理类是通过对训练数据集进行聚类创建的,其中每个聚类索引代表一个代理类。来自倒数第二个CNN层的激活充当后续分类任务的功能。我们在两个公开可用的数据集上评估特征表示。重点是关于历史文档作者识别(Historical-WI)的ICDAR17竞赛数据集。我们表明,我们在没有监督的情况下训练的激活功能要优于最先进的作者识别方法的描述符。此外,在使用ICFHR16竞争数据集(基于历史拉丁文字类型(CLaMM16))进行笔迹分类的情况下,我们获得了可比的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号