首页> 外文会议>Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on >Visual stem mapping and Geometric Tense coding for Augmented Visual Vocabulary

【24h】

Visual stem mapping and Geometric Tense coding for Augmented Visual Vocabulary

机译：视觉词干映射和几何时态编码以增强视觉词汇

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses the problem of affine distortions caused by viewpoint changes for the application of image retrieval. We study how to expand the visual words from a query image for better retrieval recall without the sacrifice of retrieval precision and efficiency. Our main contribution is the building of visual dictionaries that retain the mapping relationships between visual words extracted from different viewpoints of the same object. Additionally, in each mapping rule we record the affine transformation in which the two visual words are related, as a compact code of viewpoints relationships. By analogizing the concepts of verb stem and verb tense in text, we use Visual Stems to denote visual words extracted from robust local patches, and record the relationships between their affine variants as visual stem mapping rules, including the geometric relationships coded as Geometric Tenses. In this way, our method augments original visual vocabulary with sufficient and accurate expansion information. In query phase, only the objects corresponding to the same visual stems and coherent geometric tense codes will be regarded as similar ones. Moreover, the mapping rules can be learned offline with only one sample for each object. Experiments show that our method can support efficient object retrieval with high recall, requiring little extra time and space cost over traditional visual vocabularies.

机译：本文针对视点变化在图像检索中的应用解决了仿射失真的问题。我们研究了如何从查询图像中扩展视觉单词，以实现更好的检索召回率，而又不牺牲检索精度和效率。我们的主要贡献是建立了视觉词典，该词典保留了从同一对象的不同视点提取的视觉单词之间的映射关系。另外，在每个映射规则中，我们记录了两个视觉单词相关的仿射变换，作为视点关系的紧凑代码。通过在文本中模拟动词词干和动词时态的概念，我们使用视觉词干表示从健壮的局部补丁中提取的视觉词，并将其仿射变体之间的关系记录为视觉词干映射规则，包括编码为几何时态的几何关系。通过这种方式，我们的方法通过足够且准确的扩展信息来扩展原始视觉词汇。在查询阶段，只有对应于相同视觉茎和一致几何时态代码的对象才被视为相似对象。此外，每个对象仅需一个样本就可以离线学习映射规则。实验表明，我们的方法可以支持具有高召回率的有效对象检索，与传统视觉词汇相比，所需的时间和空间成本很少。

著录项

来源
《Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on 》|2012年|p.3234- 3241|共8页
会议地点 Providence RI(US)
作者
Gao Ke; Zhang Yongdong; Luo Ping; Zhang Wei; Xia Junhai; Lin Shouxun;
展开▼
作者单位

Advanced Computing Research Laboratory, Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP391.41;
关键词

相似文献

外文文献
中文文献
专利

1. Performance Improvement of Multi-View Video Coding Based on Geometric Prediction and Human Visual System [J] . Mian-Shiuan Li, Mei-Juan Chen, Chia-Hung Yeh, International journal of imaging systems and technology . 2015 ,第1期

机译：基于几何预测和人类视觉系统的多视点视频编码性能改进
2. Multimodal visualization of complementary color-coded FA map and tensor glyphs for interactive tractography ROI seeding [J] . Voltoline Raphael, Wu Shin-Ting Computers & Graphics . 2021 ,第May期

机译：互补颜色编码的多媒体可视化和交互式牵引ROI播种的张量晶文
3. Graphics, Audio-visuals and Interaction (GAI) based Handheld Augmented Reality System [J] . Edmund Ng Giap Weng, Md. Abdullah-Al-Jubair, Shahren Ahmad Zaidi Adruce, Procedia - Social and Behavioral Sciences . 2013 ,第2期

机译：基于图形，视听与交互（GAI）的手持增强现实系统
4. Visual stem mapping and Geometric Tense coding for Augmented Visual Vocabulary [C] . Gao Ke, Zhang Yongdong, Luo Ping, IEEE Conference on Computer Vision and Pattern Recognition . 2012

机译：用于增强视觉词汇的视觉阀杆映射和几何时态编码
5. Beyond the Receptive Field: An Analysis of Natural Scenes and a Geometric Interpretation of Efficient Coding Strategies by the Mammalian Visual System [D] . Vilankar, Kedarnath Padmakar. 2017

机译：超越接受领域：哺乳动物视觉系统对自然场景的分析和有效编码策略的几何解释
6. Typical Toddlers Participation in Just-in-Time Programming of Vocabulary for Visual Scene Display Augmentative and Alternative Communication Apps on Mobile Technology: A Descriptive Study [O] . Christine Holyfield, Kathryn Drager, Janice Light, -1

机译：典型的幼儿参与即时词汇编程的视觉技术在移动技术上的增强和替代通信应用程序：描述性研究
7. Geometric Visualization Interaction System Based on Augmented Reality [O] . Feng HAN, Xing-Qian LI, Jian-Jun WANG, 2017

机译：基于增强现实的几何可视化交互系统

Visual stem mapping and Geometric Tense coding for Augmented Visual Vocabulary

摘要

著录项

相似文献

相关主题

期刊订阅