Deep Voice-Visual Cross-Modal Retrieval with Deep Feature Similarity Learning

机译：具有深度特征相似性学习的深度语音视觉跨模态检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Thanks to the development of deep learning, voice-visual cross-modal retrieval has made remarkable progress in recent years. However, there still exist some bottlenecks: how to establish effective correlation between voices and images to improve the retrieval precision and how to reduce data storage and speed up retrieval in large-scale cross-modal data. In this paper, we propose a novel Voice-Visual Cross-Modal Hashing (V2CMH) method, which can generate hash codes with low storage memory and fast retrieval properties. Specially, the proposed V2CMH method can leverage deep feature similarity to establish the semantic relationship between voices and images. In addition, for hash codes learning, our method attempts to preserve the semantic similarity of binary codes and reduce the information loss of binary codes generation. Experiments illustrate that V2CMH algorithm can achieve better retrieval performance than other state-of-the-art cross-modal retrieval algorithms.

机译：得益于深度学习的发展，近年来，视听交叉模式检索取得了显着进展。但是，仍然存在一些瓶颈：如何在语音和图像之间建立有效的关联以提高检索精度，以及如何减少数据存储量并加快大规模跨模态数据的检索速度。在本文中，我们提出了一种新颖的Voice-Visual Cross-Modal Hashing（V2CMH）方法，该方法可以生成具有低存储内存和快速检索特性的哈希码。特别地，所提出的V2CMH方法可以利用深度特征相似性来建立语音和图像之间的语义关系。另外，对于哈希码学习，我们的方法尝试保留二进制码的语义相似性并减少二进制码生成的信息损失。实验表明，V2CMH算法比其他最新的交叉模式检索算法具有更好的检索性能。

著录项

来源
《Chinese conference on pattern recognition and computer vision》|2019年|454-465|共12页
会议地点
作者
Yaxiong Chen; Xiaoqiang Lu; Yachuang Feng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cross-modal retrieval; Deep hashing; Deep feature similarity;

机译：跨模式检索;深度哈希;深度特征相似;

相似文献

外文文献
中文文献
专利

1. Deep attentional fine-grained similarity network with adversarial learning for cross-modal retrieval [J] . Qingrong Cheng, Xiaodong Gu Multimedia Tools and Applications . 2020,第41a42期

机译：深度预关注细粒度相似网络，对跨模型检索的对抗学习
2. Deep semantic similarity adversarial hashing for cross-modal retrieval [J] . Qiang Haopeng, Wan Yuan, Xiang Lun, Neurocomputing . 2020,第Auga4期

机译：跨模型检索的深度语义相似性伴随着
3. DHLBT: Efficient Cross-Modal Hashing Retrieval Method Based on Deep Learning Using Large Batch Training [J] . Xuewang Zhang, Jinzhao Lin, Yin Zhou International journal of software engineering and knowledge engineering . 2021,第7期

机译：DHLBT：基于大型批量训练的深度学习的高效跨模态散列检索方法
4. Deep Voice-Visual Cross-Modal Retrieval with Deep Feature Similarity Learning [C] . Yaxiong Chen, Xiaoqiang Lu, Yachuang Feng Chinese conference on pattern recognition and computer vision . 2019

机译：深度语音视觉跨模型检索，具有深度特征相似度学习
5. Cross-Modal Data Retrieval and Generation Using Deep Neural Networks [D] . Udaiyar, Premkumar. 2020

机译：使用深神经网络的跨模型数据检索和生成
6. Deep Unsupervised Hashing for Large-Scale Cross-Modal Retrieval Using Knowledge Distillation Model [O] . Mingyong Li, Qiqi Li, Lirong Tang, 2021

机译：使用知识蒸馏模型进行大规模交叉模态检索的深度无监督散列
7. Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval [O] . Song Liu, Shengsheng Qian, Yang Guan, 2020

机译：基于联合模态分布的相似性散列大规模无监督的深度跨模型检索

Deep Voice-Visual Cross-Modal Retrieval with Deep Feature Similarity Learning

摘要

著录项

相似文献

相关主题

期刊订阅