首页> 外文期刊>Neural processing letters >A Comprehensive Study on VLAD
【24h】

A Comprehensive Study on VLAD

机译:瓦德综合研究

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Recently, the vector of locally aggregated descriptor (VLAD) has shown its great effectiveness in diverse computer vision tasks including image retrieval, Scene classification, and action recognition. Its great success stems from its powerful representation ability and computational efficiency. However, it remains unclear about its theoretical foundation and how it is connected to basic while important algorithms, e.g., the bag-of-words model and match kernels, and how its performance is affected by parameter configurations, e.g., normalization and pooling, which are also widely used in state-of-the-art algorithms based on local features. In this paper, with an aim to achieve the full capacity of VLAD, we conduct a comprehensive and in-depth study from both theoretical analysis and experimental practice perspectives. As a theoretical contribution, we provide a new formulation of VLAD via match kernels, which serves to connect VLAD with existing important encoding methods based on local features. As a contribution to the practical use of VLAD, we comprehensively investigate the roles and effects of the two widely-used operations in local feature encoding: normalization and pooling. To the best of our knowledge, our work provides the first comprehensive study on VLAD, which will not only enable a full understanding of it but also provide an important guidance for state-of-the-art algorithms based on local features. We have conducted extensive experiments on three benchmark datasets: Scene-15, Caltech 101 and PPMI for both image classification and action recognition.
机译:最近,局部聚合描述符(VLAD)的向量在包括图像检索,场景分类和动作识别的各种计算机视觉任务中显示了它的效率。它的成功源于强大的代表能力和计算效率。然而,它仍然尚不清楚其理论基础以及它如何连接到基本的,而是重要算法,例如,单词袋式模型和匹配内核,以及其性能如何受参数配置的影响,例如,归一化和汇集还基于本地特征广泛应用于最先进的算法。在本文中,旨在实现VLAD的全部能力,我们从理论分析和实验实践角度进行全面而深入的研究。作为一种理论贡献,我们通过匹配内核提供了一种新的VLAD,它用于将VLAD与基于本地特征的现有重要编码方法连接。作为对VLAD的实际应用的贡献,我们全面调查了两个广泛使用的操作在本地特征编码中的角色和影响:归一化和汇集。据我们所知,我们的工作为VLAD提供了第一个全面的研究,这不仅能够充分了解它,而且还为基于本地特征提供了最先进的算法的重要指导。我们对三个基准数据集进行了广泛的实验:用于图像分类和动作识别的场景-15,CALTECH 101和PPMI。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号