首页> 外文会议>Computer analysis of images and patterns. >Contextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization
【24h】

Contextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization

机译:用于多类对象分类的上下文指导视觉词袋模型

获取原文
获取原文并翻译 | 示例

摘要

Bag-of-words model (BOW) is inspired by the text classification problem, where a document is represented by an unsorted set of contained words. Analogously, in the object categorization problem, an image is represented by an unsorted set of discrete visual words (BOVW). In these models, relations among visual words are performed after dictionary construction. However, close object regions can have far descriptions in the feature space, being grouped as different visual words. In this paper, we present a method for considering geometrical information of visual words in the dictionary construction step. Object interest regions are obtained by means of the Harris-Affine detector and then described using the SIFT descriptor. Afterward, a contextual-space and a feature-space are defined, and a merging process is used to fuse feature words based on their proximity in the contextual-space. Moreover, we use the Error Correcting Output Codes framework to learn the new dictionary in order to perform multi-class classification. Results show significant classification improvements when spatial information is taken into account in the dictionary construction step.
机译:词袋模型(BOW)受到文本分类问题的启发,其中文档由一组未排序的包含词表示。类似地,在对象分类问题中,图像由未排序的离散视觉单词(BOVW)集表示。在这些模型中,视觉词之间的关系是在字典构建之后执行的。但是,接近的对象区域在特征空间中可能具有很长的描述,被分为不同的视觉单词。在本文中,我们提出了一种在词典构建步骤中考虑视觉单词的几何信息的方法。通过Harris-Affine检测器获得对象感兴趣区域,然后使用SIFT描述符进行描述。之后,定义上下文空间和特征空间,并基于特征词在上下文空间中的接近度,使用合并过程融合特征词。此外,我们使用纠错输出代码框架来学习新词典,以便执行多类分类。当在字典构建步骤中考虑空间信息时,结果显示出明显的分类改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号