【24h】

When VLAD Met Hilbert

机译:当VLAD遇见希尔伯特

获取原文

摘要

In many challenging visual recognition tasks where training data is limited, Vectors of Locally Aggregated Descriptors (VLAD) have emerged as powerful image/video representations that compete with or outperform state-of the-art approaches. In this paper, we address two fundamental limitations of VLAD: its requirement for the local descriptors to have vector form and its restriction to linear classifiers due to its high-dimensionality. To this end, we introduce a kernelized version of VLAD. This not only lets us inherently exploit more sophisticated classification schemes, but also enables us to efficiently aggregate nonvector descriptors (e.g., manifold-valued data) in the VLAD framework. Furthermore, we propose an approximate formulation that allows us to accelerate the coding process while still benefiting from the properties of kernel VLAD. Our experiments demonstrate the effectiveness of our approach at handling manifold-valued data, such as covariance descriptors, on several classification tasks. Our results also evidence the benefits of our nonlinear VLAD descriptors against the linear ones in Euclidean space using several standard benchmark datasets.
机译:在训练数据有限的许多具有挑战性的视觉识别任务中,局部聚合描述符向量(VLAD)已作为强大的图像/视频表示形式出现,它们可以与现有方法竞争或胜过其先进方法。在本文中,我们解决了VLAD的两个基本限制:它要求局部描述符具有矢量形式,并且由于其高维性而限制于线性分类器。为此,我们介绍了VLAD的内核版本。这不仅使我们能够固有地利用更复杂的分类方案,而且使我们能够在VLAD框架中有效地聚合非矢量描述符(例如,多值数据)。此外,我们提出了一种近似公式,可以使我们加快编码过程,同时仍然受益于内核VLAD的特性。我们的实验证明了我们的方法在一些分类任务上处理流形值数据(例如协方差描述符)的有效性。我们的结果还证明了使用几个标准基准数据集,非线性VLAD描述子相对于欧几里得空间中的线性VLAD描述子的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号