...
首页> 外文期刊>IEEE transactions on multimedia >Discrete Multimodal Hashing With Canonical Views for Robust Mobile Landmark Search
【24h】

Discrete Multimodal Hashing With Canonical Views for Robust Mobile Landmark Search

机译:具有规范视图的离散多峰散列,可进行可靠的移动地标搜索

获取原文
获取原文并翻译 | 示例

摘要

Mobile landmark search (MLS) recently receives increasing attention for its great practical values. However, it still remains unsolved due to two important challenges. One is high bandwidth consumption of query transmission, and the other is the huge visual variations of query images sent from mobile devices. In this paper, we propose a novel hashing scheme, named as canonical view based discrete multimodal hashing (CV-DMH), to handle these problems. First, a submodular function is designed to measure visual representativeness and redundancy of a view set. With it, canonical views, which capture key visual appearances of landmark with limited redundancy, are efficiently discovered with an iterative mining strategy. Second, multimodal sparse coding is applied to transform visual features from multiple modalities into an intermediate representation. It can robustly and adaptively characterize visual contents of varied landmark images with certain canonical views. Finally, compact binary codes are learned on intermediate representation within a tailored discrete binary embedding model which preserves visual relations of images measured with canonical views and removes the involved noises. In this part, we develop a new augmented Lagrangian multiplier (ALM) based optimization method to directly solve the discrete binary codes. We can not only explicitly deal with the discrete constraint, but also consider the bit-uncorrelated constraint and balance constraint together. The proposed solution can desirably avoid accumulated quantization errors in conventional optimization method which simply adopts two-step ``relaxing+rounding'' framework. Experiments on real world landmark datasets demonstrate the superior performance of CV-DMH over several state-of-the-art methods.
机译:移动地标搜索(MLS)最近因其巨大的实用价值而受到越来越多的关注。但是,由于两个重要挑战,它仍然没有解决。一个是查询传输的高带宽消耗,另一个是从移动设备发送的查询图像的巨大视觉变化。在本文中,我们提出了一种新颖的哈希方案,称为基于规范视图的离散多峰哈希(CV-DMH),以解决这些问题。首先,设计了一个子模块函数,以测量视觉代表性和视图集的冗余度。有了它,就可以通过迭代挖掘策略有效地发现规范视图,这些规范视图以有限的冗余度捕获了地标的关键视觉外观。其次,应用多模式稀疏编码将视觉特征从多种模式转换为中间表示。它可以使用某些规范视图来鲁棒性和自适应地表征各种地标图像的视觉内容。最后,在定制的离散二进制嵌入模型内的中间表示上学习紧凑的二进制代码,该模型保留了使用规范视图测得的图像的视觉关系并消除了所涉及的噪声。在这一部分中,我们开发了一种新的基于增强拉格朗日乘数(ALM)的优化方法,可以直接求解离散二进制代码。我们不仅可以显式地处理离散约束,而且可以将位无关的约束和平衡约束一起考虑。所提出的解决方案可以理想地避免简单地采用两步``松弛+舍入''框架的常规优化方法中的累积量化误差。现实世界地标数据集上的实验证明了CV-DMH优于几种最新方法的性能。

著录项

  • 来源
    《IEEE transactions on multimedia》 |2017年第9期|2066-2079|共14页
  • 作者单位

    School of Information Technology and Electrical Engineering, University of Queensland, Brisbane, QLD, Australia;

    School of Information Technology and Electrical Engineering, University of Queensland, Brisbane, QLD, Australia;

    Department of Computer Science, San Diego State University, San Diego, CA, USA;

    School of Computing, National University of Singapore, Singapore;

    School of Information Science and Engineering, Shandong Normal University, Shandong, China;

    School of Information Technology and Electrical Engineering, University of Queensland, Brisbane, QLD, Australia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Visualization; Binary codes; Robustness; Optimization; Mobile communication; Mobile handsets; Redundancy;

    机译:可视化;二进制代码;稳健性;优化;移动通信;移动手机;冗余;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号