首页> 外文期刊>Image and Vision Computing >How to use Bag-of-Words model better for image classification
【24h】

How to use Bag-of-Words model better for image classification

机译:如何更好地使用词袋模型进行图像分类

获取原文
获取原文并翻译 | 示例
           

摘要

The Bag-of-Words (BoW) framework is well-known in image classification. In the framework, there are two essential steps: 1) coding, which encodes local features by a visual vocabulary, and 2) pooling, which pools over the response of all features into image representation. Many coding and pooling methods are proposed, and how to apply them better in different conditions has become a practical problem. In this paper, to better use BoW in different applications, we study the relation between many typical coding methods and two popular pooling methods. Specifically, complete combinations of coding and pooling are evaluated based on an extremely large range of vocabulary sizes (16 to 1M) on five primary and popular datasets. Three typical ones are 15 Scenes, Caltech 101 and PASCAL VOC 2007, while the other two large-scale ones are Caltech 256 and ImageNet. Based on the systematic evaluation, some interesting conclusions are drawn. Some conclusions are the extensions of previous viewpoints, while some are different but important to understand BoW model. Based on these conclusions, we provide detailed application criterions by evaluating coding and pooling based on precision, efficiency and memory requirements in different applications. We hope that this study can be helpful to evaluate different coding and pooling methods, the conclusions can be beneficial to better understand BoW, and the application criterions can be valuable to use BoW better indifferent applications. (C) 2014 Elsevier B.V. All rights reserved.
机译:单词袋(BoW)框架在图像分类中是众所周知的。在该框架中,有两个基本步骤:1)编码,通过视觉词汇对局部特征进行编码,以及2)合并,将所有特征的响应合并为图像表示。提出了许多编码和合并方法,如何在不同条件下更好地应用它们已成为一个实际问题。在本文中,为了更好地在不同的应用中使用BoW,我们研究了许多典型的编码方法与两种流行的合并方法之间的关系。具体来说,将基于五个主要数据集和流行数据集上极大范围的词汇量(16到1M)来评估编码和合并的完整组合。三个典型的场景是15个场景,Caltech 101和PASCAL VOC 2007,另外两个大型场景是Caltech 256和ImageNet。在系统评价的基础上,得出了一些有趣的结论。一些结论是先前观点的扩展,而某些结论则不同,但对于理解BoW模型很重要。基于这些结论,我们通过根据不同应用程序中的精度,效率和内存要求评估编码和合并,从而提供详细的应用程序准则。我们希望这项研究可以帮助评估不同的编码和合并方法,得出的结论有助于更好地了解BoW,并且应用标准对于使用BoW更好的不同应用程序可能有价值。 (C)2014 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号