首页> 外文会议>European conference on computer vision >DeepKSPD: Learning Kernel-Matrix-Based SPD Representation For Fine-Grained Image Recognition
【24h】

DeepKSPD: Learning Kernel-Matrix-Based SPD Representation For Fine-Grained Image Recognition

机译:DeepKSPD:学习基于核矩阵的SPD表示以进行细粒度的图像识别

获取原文

摘要

As a second-order pooled representation, covariance matrix has attracted much attention in visual recognition, and some pioneering works have recently integrated it into deep learning. A recent study shows that kernel matrix works considerably better than covariance matrix for this kind of representation, by modeling the higher-order, nonlinear relationship among pooled visual descriptors. Nevertheless, in that study neither the descriptors nor the kernel matrix is deeply learned. Worse, they are considered separately, hindering the pursuit of an optimal representation. To improve this situation, this work designs a deep network that jointly learns local descriptors and kernel-matrix-based pooled representation in an end-to-end manner. The derivatives for the mapping from a local descriptor set to this representation are derived to carry out backpropagation. More importantly, we introduce the Daleckii-Krein formula from Operator theory to give a concise and unified result on differentiating general functions defined on symmetric positive-definite (SPD) matrix, which shows its better numerical stability in conducting backpropagation compared with the existing method when handling the Riemannian geometry of SPD matrix. Experiments on fine-grained image benchmark datasets not only show the superiority of kernel-matrix-based SPD representation with deep local descriptors, but also verify the advantage of the proposed deep network in pursuing better SPD representations. Also, ablation study is provided to explain why and from where these improvements are attained.
机译:作为二阶合并表示,协方差矩阵已在视觉识别中引起了广泛关注,并且一些开创性工作最近已将其集成到深度学习中。最近的研究表明,通过对合并的视觉描述子之间的高阶非线性关系进行建模,核矩阵对于这种表示形式的工作要比协方差矩阵好得多。然而,在那项研究中,描述符和核矩阵都没有被深刻地学习过。更糟糕的是,它们被分开考虑,从而阻碍了对最佳表示的追求。为了改善这种情况,这项工作设计了一个深度网络,该网络以端到端的方式共同学习局部描述符和基于内核矩阵的池化表示。导出从本地描述符集到此表示形式的映射的导数,以执行反向传播。更重要的是,我们从算子理论中引入了Daleckii-Krein公式,以求简明统一的结果来区分对称正定(SPD)矩阵上定义的一般函数,与现有方法相比,它在进行反向传播时具有更好的数值稳定性。处理SPD矩阵的黎曼几何。对细粒度图像基准数据集进行的实验不仅显示了基于核矩阵的SPD表示与深度局部描述符的优越性,而且还验证了所提出的深度网络在追求更好的SPD表示形式方面的优势。另外,提供消融研究以解释为什么以及从何处获得这些改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号