How to use Bag-of-Words model better for image classification

Wang Chong; Huang Kaiqi

首页> 外文期刊>Image and Vision Computing >How to use Bag-of-Words model better for image classification

【24h】

How to use Bag-of-Words model better for image classification

机译：如何更好地使用词袋模型进行图像分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Bag-of-Words (BoW) framework is well-known in image classification. In the framework, there are two essential steps: 1) coding, which encodes local features by a visual vocabulary, and 2) pooling, which pools over the response of all features into image representation. Many coding and pooling methods are proposed, and how to apply them better in different conditions has become a practical problem. In this paper, to better use BoW in different applications, we study the relation between many typical coding methods and two popular pooling methods. Specifically, complete combinations of coding and pooling are evaluated based on an extremely large range of vocabulary sizes (16 to 1M) on five primary and popular datasets. Three typical ones are 15 Scenes, Caltech 101 and PASCAL VOC 2007, while the other two large-scale ones are Caltech 256 and ImageNet. Based on the systematic evaluation, some interesting conclusions are drawn. Some conclusions are the extensions of previous viewpoints, while some are different but important to understand BoW model. Based on these conclusions, we provide detailed application criterions by evaluating coding and pooling based on precision, efficiency and memory requirements in different applications. We hope that this study can be helpful to evaluate different coding and pooling methods, the conclusions can be beneficial to better understand BoW, and the application criterions can be valuable to use BoW better indifferent applications. (C) 2014 Elsevier B.V. All rights reserved.

机译：单词袋（BoW）框架在图像分类中是众所周知的。在该框架中，有两个基本步骤：1）编码，通过视觉词汇对局部特征进行编码，以及2）合并，将所有特征的响应合并为图像表示。提出了许多编码和合并方法，如何在不同条件下更好地应用它们已成为一个实际问题。在本文中，为了更好地在不同的应用中使用BoW，我们研究了许多典型的编码方法与两种流行的合并方法之间的关系。具体来说，将基于五个主要数据集和流行数据集上极大范围的词汇量（16到1M）来评估编码和合并的完整组合。三个典型的场景是15个场景，Caltech 101和PASCAL VOC 2007，另外两个大型场景是Caltech 256和ImageNet。在系统评价的基础上，得出了一些有趣的结论。一些结论是先前观点的扩展，而某些结论则不同，但对于理解BoW模型很重要。基于这些结论，我们通过根据不同应用程序中的精度，效率和内存要求评估编码和合并，从而提供详细的应用程序准则。我们希望这项研究可以帮助评估不同的编码和合并方法，得出的结论有助于更好地了解BoW，并且应用标准对于使用BoW更好的不同应用程序可能有价值。（C）2014 Elsevier B.V.保留所有权利。

著录项

来源
《Image and Vision Computing》 |2015年第6期|65-74|共10页
作者
Wang Chong; Huang Kaiqi;
展开▼
作者单位

Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100864, Peoples R China;

Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100864, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Image classification; Bag-of-Words; Visual vocabulary; Coding; Pooling;

机译：图像分类词袋视觉词汇编码池化;

相似文献

外文文献
中文文献
专利

1. Fetal Ultrasound Image Classification Using a Bag-of-words Model Trained on Sonographers’ Eye Movements [J] . M. Ahmed, J.A. Noble Procedia Computer Science . 2016,第1期

机译：使用超声医师眼动训练的词袋模型对胎儿超声图像进行分类
2. A Two-Level Recurrent Neural Network Language Model Based on the Continuous Bag-of-Words Model for Sentence Classification [J] . Lee Yo Han, Kim Dong W., Lim Myo Taeg International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2019,第1期

机译：一种基于句子分类连续袋式模型的两级反复性神经网络语言模型
3. Keypoint selection for efficient bag-of-words feature generation and effective image classification [J] . Lin Wei-Chao, Tsai Chih-Fong, Chen Zong-Yao, Information Sciences: An International Journal . 2016,第Null期

机译：关键点选择可实现有效的词袋特征生成和有效的图像分类
4. Bag-of-Words model based image classification and evaluation of image sample quality in remote sensing [C] . Zhang Hui, Li Bangyu, Wang Yan, IEEE Region 10 Conference . 2013

机译：基于词袋模型的遥感图像分类和图像样本质量评估
5. Large-scale Content-based Multimedia Analysis and Applications Using Bag-of-words Model. [D] . Zhang, Ning. 2012

机译：使用词袋模型的大规模基于内容的多媒体分析和应用。
6. The influence of preprocessing on text classification using a bag-of-words representation [O] . Yaakov HaCohen-Kerner, Daniel Miller, Yair Yigal, 2020

机译：使用袋式表示预处理预处理对文本分类的影响
7. An Image Classification Method Based on Optimized Fuzzy Bag-of-words Model [O] . Zilong Li, Yong Zhou, Rong Bao 2019

机译：一种基于优化模糊袋式模型的图像分类方法

How to use Bag-of-Words model better for image classification

摘要

著录项

相似文献

相关主题

期刊订阅