Effective and efficient indexing in cross-modal hashing-based datasets

Chiu Chih-Yi; Markchit Sarawut

首页> 外文期刊>Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing >Effective and efficient indexing in cross-modal hashing-based datasets

【24h】

Effective and efficient indexing in cross-modal hashing-based datasets

机译：基于跨模型散列的数据集有效和高效的索引

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

To overcome the barrier of storage and computation, the hashing technique has been widely used for nearest neighbor search in multimedia retrieval applications recently. Particularly, cross-modal retrieval that searches across different modalities becomes an active but challenging problem. Although numerous of cross-modal hashing algorithms are proposed to yield compact binary codes, exhaustive search is impractical for large-scale datasets, and Hamming distance computation suffers inaccurate results. In this paper, we propose a novel search method that utilizes a probability-based index scheme over binary hash codes in cross-modal retrieval. The proposed indexing scheme employs a few binary bits from the hash code as the index code. We construct an inverted index table based on the index codes, and train a neural network for ranking and indexing to improve the retrieval accuracy. Experiments are performed on two benchmark datasets for retrieval across image and text modalities, where hash codes are generated and compared with several state-of-the-art cross-modal hashing methods. Results show the proposed method effectively boosts the performance on search accuracy, computation cost, and memory consumption in these datasets and hashing methods. The source code is available on https://github.com/msarawut/HCI.

机译：为了克服存储和计算的屏障，最近已经广泛用于多媒体检索应用中的最近邻居搜索的散列技术。特别地，跨越不同方式搜索的跨模型检索成为一个有效但具有挑战性的问题。尽管提出了许多跨模型散列算法来产生紧凑的二进制代码，但大规模数据集的详尽搜索是不切实际的，并且汉明距离计算遭受不准确的结果。在本文中，我们提出了一种新的搜索方法，该方法利用基于概率的索引散列码在跨模型检索中的二进制哈希代码。所提出的索引方案从哈希码中使用一些二进制位作为索引代码。我们构建基于索引代码的反向索引表，并培训一个神经网络进行排序和索引以提高检索精度。在两个基准数据集上执行实验，用于跨图像和文本方式检索，其中生成哈希代码并与若干先前的跨模型散列方法进行比较。结果显示，所提出的方法有效地提高了这些数据集和散列方法中的搜索精度，计算成本和内存消耗的性能。源代码在https://github.com/msarawut/hci上提供。

著录项

来源
《Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing》 |2020年第2020期|共13页
作者
Chiu Chih-Yi; Markchit Sarawut;
展开▼
作者单位

Natl Chiayi Univ Dept Comp Sci &

Informat Engn 300 Syuefu Rd Chiayi 60004 Taiwan;

Natl Chiayi Univ Dept Comp Sci &

Informat Engn 300 Syuefu Rd Chiayi 60004 Taiwan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类图像通信、多媒体通信;通信;
关键词
Binary embedding; Cross-modal retrieval; Inverted indexing; Learning to rank; Nearest neighbor search;

机译：二进制嵌入;跨模型检索;倒置索引;学习排名;最近的邻居搜索;

相似文献

外文文献
中文文献
专利

1. Effective and efficient indexing in cross-modal hashing-based datasets [J] . Intelligence: A Multidisciplinary Journal . 2020,第期

机译：基于跨模型散列的数据集有效和高效的索引
2. Efficient, robust and effective rank aggregation for massive biological datasets [J] . Pierre Andrieu, Bryan Brancotte, Laurent Bulteau, Future generation computer systems . 2021,第Nova期

机译：大规模生物数据集的高效，稳健和有效等级聚集
3. Novel similarity measures for the effective and efficient retrieval of pharmacological datasets [J] . Oscar Miguel Rivera Borroto, Yoandy Hernandez Diaz, Jose Manuel Garcia de la Vega Afinidad: Revista de Quimica Teorica y Aplicada . 2011,第551期

机译：有效和高效检索药理数据集的新颖相似性度量
4. BIG DATA CHALLENGES INDEXING LARGE-VOLUME, HETEROGENEOUS EO DATASETS FOR EFFECTIVE DATA DISCOVERY [C] . Alison Waterfall, Victoria Bennett, Steve Donegan, European Space Agency;Living planet symposium;EUMETSAT;European Commission . 2016

机译：大数据挑战表明有效数据发现需要大量，异构的EO数据集
5. Enabling Efficient Scientific Analytics over Extreme-scale Adaptive Mesh Refinement Data through Effective In Situ Indexing. [D] . Zou, Xiaocheng. 2016

机译：通过有效的原位索引，对极端规模的自适应网格细化数据进行高效的科学分析。
6. REINDEER: efficient indexing of k-mer presence and abundance in sequencing datasets [O] . Camille Marchet, Zamin Iqbal, Daniel Gautheret, -1

机译：REINDEER：在测序数据集中高效索引k-mer的存在和丰度
7. REINDEER: efficient indexing of k-mer presence and abundance in sequencing datasets [O] . Camille Marchet, Zamin Iqbal, Daniel Gautheret, 2020

机译：驯鹿：在测序数据集中高效索引K-MER的存在和丰富

Effective and efficient indexing in cross-modal hashing-based datasets

摘要

著录项

相似文献

相关主题

期刊订阅