Contextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization

机译：用于多类对象分类的上下文指导视觉词袋模型

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Bag-of-words model (BOW) is inspired by the text classification problem, where a document is represented by an unsorted set of contained words. Analogously, in the object categorization problem, an image is represented by an unsorted set of discrete visual words (BOVW). In these models, relations among visual words are performed after dictionary construction. However, close object regions can have far descriptions in the feature space, being grouped as different visual words. In this paper, we present a method for considering geometrical information of visual words in the dictionary construction step. Object interest regions are obtained by means of the Harris-Affine detector and then described using the SIFT descriptor. Afterward, a contextual-space and a feature-space are defined, and a merging process is used to fuse feature words based on their proximity in the contextual-space. Moreover, we use the Error Correcting Output Codes framework to learn the new dictionary in order to perform multi-class classification. Results show significant classification improvements when spatial information is taken into account in the dictionary construction step.

机译：词袋模型（BOW）受到文本分类问题的启发，其中文档由一组未排序的包含词表示。类似地，在对象分类问题中，图像由未排序的离散视觉单词（BOVW）集表示。在这些模型中，视觉词之间的关系是在字典构建之后执行的。但是，接近的对象区域在特征空间中可能具有很长的描述，被分为不同的视觉单词。在本文中，我们提出了一种在词典构建步骤中考虑视觉单词的几何信息的方法。通过Harris-Affine检测器获得对象感兴趣区域，然后使用SIFT描述符进行描述。之后，定义上下文空间和特征空间，并基于特征词在上下文空间中的接近度，使用合并过程融合特征词。此外，我们使用纠错输出代码框架来学习新词典，以便执行多类分类。当在字典构建步骤中考虑空间信息时，结果显示出明显的分类改进。

著录项

来源
《Computer analysis of images and patterns.》|2009年|p.748-756|共9页
会议地点 Munster(DE);Munster(DE)
作者
Mehdi Mirza-Mohammadi; Sergio Escalera; Petia Radcva;
展开▼
作者单位

Dept. Matematica Aplicada i Analisi, Gran Via 585, 08007, Barcelona, Spain;

Dept. Matematica Aplicada i Analisi, Gran Via 585, 08007, Barcelona, Spain,Computer Vision Center, Campus UAB, Edifici O, 08193, Bellaterra, Barcelona;

Dept. Matematica Aplicada i Analisi, Gran Via 585, 08007, Barcelona, Spain,Computer Vision Center, Campus UAB, Edifici O, 08193, Bellaterra, Barcelona;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Incorporating Contextual Information into Bag-of-Visual-Words Framework for Effective Object Categorization [J] . Shuang BAI, Tetsuya MATSUMOTO, Yoshinori TAKEUCHI, IEICE transactions on information and systems . 2012,第12期

机译：将上下文信息整合到有效词目分类的可视化词袋框架中
2. Incorporating Contextual Information into Bag-of-Visual-Words Framework for Effective Object Categorization [J] . Shuang BAI, Tetsuya MATSUMOTO, Yoshinori TAKEUCH, IEICE Transactions on Information and Systems . 2012,第12期

机译：将上下文信息整合到有效词分类系统中
3. A feature binding computational model for multi-class object categorization and recognition [J] . Xishun Wang, Xi Liu, Zhongzhi Shi, Neural computing & applications . 2012,第6期

机译：用于多类对象分类和识别的特征绑定计算模型
4. Contextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization [C] . Mehdi Mirza-Mohammadi, Sergio Escalera, Petia Radeva International Conference on Computer Analysis of Images and Patterns . 2009

机译：用于多类对象分类的上下文引导袋 - 视觉单词模型
5. Visual attention and object categorization: From psychophysics to computational models. [D] . Peters, Robert J. 2004

机译：视觉注意力和对象分类：从心理物理学到计算模型。
6. NEUROIMAGING EVIDENCE FOR OBJECT MODEL VERIFICATION THEORY: ROLE OF PREFRONTAL CONTROL IN VISUAL OBJECT CATEGORIZATION [O] . Giorgio Ganis, Haline E. Schendan, Stephen M. Kosslyn -1

机译：对象模型验证理论的神经影像学证据：前控制在视觉对象分类中的作用
7. Contextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization [O] . Mehdi Mirza-mohammadi, Sergio Escalera, Petia Radeva 2014

机译：用于多类对象分类的上下文引导视觉词模型
8. Development of Multi-Class, Multi-Criteria Bicycle Traffic Assignment Models and Solution Algorithms. [R] . Ruy, S., Chen, A., Su, J., 2015

机译：多级，多标准自行车交通分配模型和解决方案算法的开发。

Contextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization

摘要

著录项

相似文献

相关主题

期刊订阅