...
首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Compositional Model Based Fisher Vector Coding for Image Classification
【24h】

Compositional Model Based Fisher Vector Coding for Image Classification

机译:基于成分模型的Fisher矢量编码图像分类

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Deriving from the gradient vector of a generative model of local features, Fisher vector coding (FVC) has been identified as an effective coding method for image classification. Most, if not all, FVC implementations employ the Gaussian mixture model (GMM) as the generative model for local features. However, the representative power of a GMM can be limited because it essentially assumes that local features can be characterized by a fixed number of feature prototypes, and the number of prototypes is usually small in FVC. To alleviate this limitation, in this work, we break the convention which assumes that a local feature is drawn from one of a few Gaussian distributions. Instead, we adopt a compositional mechanism which assumes that a local feature is drawn from a Gaussian distribution whose mean vector is composed as a linear combination of multiple key components, and the combination weight is a latent random variable. In doing so we greatly enhance the representative power of the generative model underlying FVC. To implement our idea, we design two particular generative models following this compositional approach. In our first model, the mean vector is sampled from the subspace spanned by a set of bases and the combination weight is drawn from a Laplace distribution. In our second model, we further assume that a local feature is composed of a discriminative part and a residual part. As a result, a local feature is generated by the linear combination of discriminative part bases and residual part bases. The decomposition of the discriminative and residual parts is achieved via the guidance of a pre-trained supervised coding method. By calculating the gradient vector of the proposed models, we derive two new Fisher vector coding strategies. The first is termed Sparse Coding-based Fisher Vector Coding (SCFVC) and can be used as the substitute of traditional GMM based FVC. The second is termed Hybrid Sparse Coding-based Fisher vector coding (HSCFVC) since it combines the merits of both pre-trained supervised coding methods and FVC. Using pre-trained Convolutional Neural Network (CNN) activations as local features, we experimentally demonstrate that the proposed methods are superior to traditional GMM based FVC and achieve state-of-the-art performance in various image classification tasks.
机译:源自局部特征生成模型的梯度向量,Fisher向量编码(FVC)已被确定为图像分类的有效编码方法。大多数(如果不是全部)FVC实现采用高斯混合模型(GMM)作为局部特征的生成模型。但是,由于GMM基本上假定局部特征可以由固定数量的特征原型来表征,并且FVC中的原型数量通常很少,因此可以限制其代表性。为了减轻这种局限性,在这项工作中,我们打破了约定,即假定局部特征是从一些高斯分布之一中得出的。取而代之,我们采用一种组合机制,该机制假定局部特征是从高斯分布中提取的,其平均向量由多个关键成分的线性组合组成,并且组合权重是一个潜在的随机变量。通过这样做,我们极大地增强了FVC生成模型的代表性。为了实现我们的想法,我们按照这种组合方法设计了两个特定的生成模型。在我们的第一个模型中,均值矢量是从由一组基础跨越的子空间中采样的,并且组合权重是从拉普拉斯分布中得出的。在我们的第二个模型中,我们进一步假设局部特征是由可区分部分和残差部分组成的。结果,通过区分零件基础和剩余零件基础的线性组合生成局部特征。区分部分和残差部分的分解是通过预训练的监督编码方法的指导来实现的。通过计算所提出模型的梯度向量,我们得出了两种新的Fisher向量编码策略。第一个称为基于稀疏编码的Fisher矢量编码(SCFVC),可以用作传统基于GMM的FVC的替代。第二种方法称为基于混合稀疏编码的Fisher向量编码(HSCFVC),因为它结合了预训练监督编码方法和FVC的优点。使用预训练的卷积神经网络(CNN)激活作为局部特征,我们通过实验证明了所提出的方法优于传统的基于GMM的FVC,并在各种图像分类任务中实现了最新的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号