首页> 外文会议>Chinese conference on pattern recognition and computer vision >Deep Voice-Visual Cross-Modal Retrieval with Deep Feature Similarity Learning
【24h】

Deep Voice-Visual Cross-Modal Retrieval with Deep Feature Similarity Learning

机译:具有深度特征相似性学习的深度语音视觉跨模态检索

获取原文

摘要

Thanks to the development of deep learning, voice-visual cross-modal retrieval has made remarkable progress in recent years. However, there still exist some bottlenecks: how to establish effective correlation between voices and images to improve the retrieval precision and how to reduce data storage and speed up retrieval in large-scale cross-modal data. In this paper, we propose a novel Voice-Visual Cross-Modal Hashing (V2CMH) method, which can generate hash codes with low storage memory and fast retrieval properties. Specially, the proposed V2CMH method can leverage deep feature similarity to establish the semantic relationship between voices and images. In addition, for hash codes learning, our method attempts to preserve the semantic similarity of binary codes and reduce the information loss of binary codes generation. Experiments illustrate that V2CMH algorithm can achieve better retrieval performance than other state-of-the-art cross-modal retrieval algorithms.
机译:得益于深度学习的发展,近年来,视听交叉模式检索取得了显着进展。但是,仍然存在一些瓶颈:如何在语音和图像之间建立有效的关联以提高检索精度,以及如何减少数据存储量并加快大规模跨模态数据的检索速度。在本文中,我们提出了一种新颖的Voice-Visual Cross-Modal Hashing(V2CMH)方法,该方法可以生成具有低存储内存和快速检索特性的哈希码。特别地,所提出的V2CMH方法可以利用深度特征相似性来建立语音和图像之间的语义关系。另外,对于哈希码学习,我们的方法尝试保留二进制码的语义相似性并减少二进制码生成的信息损失。实验表明,V2CMH算法比其他最新的交叉模式检索算法具有更好的检索性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号