首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Hybrid multi-layer deep CNN/aggregator feature for image classification
【24h】

Hybrid multi-layer deep CNN/aggregator feature for image classification

机译:多层多层深CNN /聚合器功能用于图像分类

获取原文

摘要

Deep Convolutional Neural Networks (DCNN) have established a remarkable performance benchmark in the field of image classification, displacing classical approaches based on hand-tailored aggregations of local descriptors. Yet DCNNs impose high computational burdens both at training and at testing time, and training them requires collecting and annotating large amounts of training data. Supervised adaptation methods have been proposed in the literature that partially re-learn a transferred DCNN structure from a new target dataset. Yet these require expensive bounding-box annotations and are still computationally expensive to learn. In this paper, we address these shortcomings of DCNN adaptation schemes by proposing a hybrid approach that combines conventional, unsupervised aggregators such as Bag-of-Words (BoW), with the DCNN pipeline by treating the output of intermediate layers as densely extracted local descriptors. We test a variant of our approach that uses only intermediate DCNN layers on the standard PASCAL VOC 2007 dataset and show performance significantly higher than the standard BoW model and comparable to Fisher vector aggregation but with a feature that is 150 times smaller. A second variant of our approach that includes the fully connected DCNN layers significantly outperforms Fisher vector schemes and performs comparably to DCNN approaches adapted to Pascal VOC 2007, yet at only a small fraction of the training and testing cost.
机译:深度卷积神经网络(DCNN)在图像分类领域建立了卓越的性能基准,取代了基于手工定制的局部描述符聚合的经典方法。然而,DCNN在训练和测试时都施加了很高的计算负担,对它们进行训练需要收集和注释大量的训练数据。在文献中提出了有监督的适应方法,其从新的目标数据集中部分地重新学习了转移的DCNN结构。然而,这些需要昂贵的边界框注释,并且学习仍然在计算上昂贵。在本文中,我们通过提出一种混合方法来解决DCNN自适应方案的这些缺点,该方法将传统的无监督聚合器(如词袋(BoW))与DCNN管道相结合,方法是将中间层的输出视为密集提取的本地描述符。我们测试了一种方法的变体,该变体仅在标准PASCAL VOC 2007数据集上使用中间DCNN层,并显示出明显高于标准BoW模型的性能,并且可与Fisher向量聚合相媲美,但功能却小了150倍。我们的方法的第二种变体包括完全连接的DCNN层,其性能明显优于Fisher向量方案,并且与适用于Pascal VOC 2007的DCNN方法相比具有相当的性能,但培训和测试成本却很小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号