首页> 外文会议>Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on >Visual stem mapping and Geometric Tense coding for Augmented Visual Vocabulary
【24h】

Visual stem mapping and Geometric Tense coding for Augmented Visual Vocabulary

机译:视觉词干映射和几何时态编码以增强视觉词汇

获取原文
获取原文并翻译 | 示例

摘要

This paper addresses the problem of affine distortions caused by viewpoint changes for the application of image retrieval. We study how to expand the visual words from a query image for better retrieval recall without the sacrifice of retrieval precision and efficiency. Our main contribution is the building of visual dictionaries that retain the mapping relationships between visual words extracted from different viewpoints of the same object. Additionally, in each mapping rule we record the affine transformation in which the two visual words are related, as a compact code of viewpoints relationships. By analogizing the concepts of verb stem and verb tense in text, we use Visual Stems to denote visual words extracted from robust local patches, and record the relationships between their affine variants as visual stem mapping rules, including the geometric relationships coded as Geometric Tenses. In this way, our method augments original visual vocabulary with sufficient and accurate expansion information. In query phase, only the objects corresponding to the same visual stems and coherent geometric tense codes will be regarded as similar ones. Moreover, the mapping rules can be learned offline with only one sample for each object. Experiments show that our method can support efficient object retrieval with high recall, requiring little extra time and space cost over traditional visual vocabularies.
机译:本文针对视点变化在图像检索中的应用解决了仿射失真的问题。我们研究了如何从查询图像中扩展视觉单词,以实现更好的检索召回率,而又不牺牲检索精度和效率。我们的主要贡献是建立了视觉词典,该词典保留了从同一对象的不同视点提取的视觉单词之间的映射关系。另外,在每个映射规则中,我们记录了两个视觉单词相关的仿射变换,作为视点关系的紧凑代码。通过在文本中模拟动词词干和动词时态的概念,我们使用视觉词干表示从健壮的局部补丁中提取的视觉词,并将其仿射变体之间的关系记录为视觉词干映射规则,包括编码为几何时态的几何关系。通过这种方式,我们的方法通过足够且准确的扩展信息来扩展原始视觉词汇。在查询阶段,只有对应于相同视觉茎和一致几何时态代码的对象才被视为相似对象。此外,每个对象仅需一个样本就可以离线学习映射规则。实验表明,我们的方法可以支持具有高召回率的有效对象检索,与传统视觉词汇相比,所需的时间和空间成本很少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号