Use of permutation prefixes for efficient and scalable approximate similarity search

Andrea Esuli

首页> 外文期刊>Information Processing & Management >Use of permutation prefixes for efficient and scalable approximate similarity search

【24h】

Use of permutation prefixes for efficient and scalable approximate similarity search

机译：使用置换前缀进行有效和可扩展的近似相似性搜索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present the Permutation Prefix Index (this work is a revised and extended version of Esuli (2009b), presented at the 2009 LSDS-IR Workshop, held in Boston) (PP-Index), an index data structure that supports efficient approximate similarity search.The PP-Index belongs to the family of the permutation-based indexes, which are based on representing any indexed object with "its view of the surrounding world", i.e., a list of the elements of a set of reference objects sorted by their distance order with respect to the indexed object.In its basic formulation, the PP-Index is strongly biased toward efficiency. We show how the effectiveness can easily reach optimal levels just by adopting two "boosting" strategies: multiple index search and multiple query search, which both have nice parallelization properties.We study both the efficiency and the effectiveness properties of the PP-Index, experimenting with collections of sizes up to one hundred million objects, represented in a very high-dimensional similarity space.

机译：我们介绍置换前缀索引（这项工作是Esuli（2009b）的修订和扩展版本，在波士顿举行的2009 LSDS-IR研讨会上进行了介绍）（PP-Index），该索引数据结构支持有效的近似相似性搜索.PP-Index属于基于置换的索引的族，这些索引基于具有“其周围环境的视图”的任何索引对象的表示，即，按其排序的一组参考对象的元素列表相对于索引对象的距离顺序.PP-Index在其基本公式中强烈偏向效率。我们展示了仅通过采用两种具有良好并行化特性的“提升”策略（即多索引搜索和多查询搜索）即可轻松达到最佳水平。我们研究了PP-Index的效率和有效性属性，进行了实验在一个非常高维的相似性空间中代表着多达一亿个对象的集合。

著录项

来源
《Information Processing & Management》 |2012年第5期|p.889-902|共14页
作者
Andrea Esuli;
展开▼
作者单位

Istituto di Scienza e Tecnologie dell'Informazione, Consiglio Nazionale delle Ricerche, via Giuseppe Moruzzi, 1. 56124 Pisa, Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
approximate similarity search; metric space; scalability;

机译：近似相似度搜索;公制空间可扩展性;
入库时间 2022-08-17 23:20:15

相似文献

外文文献
中文文献
专利

1. Metric Index: An efficient and scalable solution for precise and approximate similarity search [J] . David Novak, Michal Batko, Pavel Zezula Information Systems . 2011,第4期

机译：指标索引：高效，可扩展的解决方案，用于精确和近似的相似度搜索
2. MI-File: using inverted files for scalable approximate similarity search [J] . Giuseppe Amato, Claudio Gennaro, Pasquale Savino Multimedia Tools and Applications . 2014,第3期

机译：MI文件：使用反向文件进行可伸缩的近似相似度搜索
3. A hybrid harmony search algorithm with efficient job sequence scheme and variable neighborhood search for the permutation flow shop scheduling problems [J] . Fuqing Zhao, Yang Liu, Yi Zhang, Engineering Applications of Artificial Intelligence . 2017,第octa期

机译：具有有效作业序列方案和可变邻域搜索的混合式和谐搜索算法，用于置换流水车间调度问题
4. On Semantic Solutions for Efficient Approximate Similarity Search on Large-Scale Datasets [C] . Alexander Ocsa, Jose Luis Huillca, Cristian Lopez del Alamo Iberoamerican congress on pattern recognition . 2018

机译：大规模数据集上有效近似相似搜索的语义解决方案
5. Efficient Similarity Search with Cache-Conscious Data Traversal [D] . Tang, Xun 2015

机译：具有缓存意识的数据遍历的高效相似性搜索
6. CPSARST: an efficient circular permutation search tool applied to the detection of novel protein structural relationships [O] . Wei-Cheng Lo, Ping-Chiang Lyu 2008

机译：CPSARST：一种有效的循环排列搜索工具用于检测新型蛋白质结构关系
7. PP-Index: using permutation prefixes for efficient and scalable approximate similarity search [O] . Esuli Andrea 2009

机译：PP-Index：使用置换前缀进行有效和可扩展的近似相似性搜索
8. Use of an approximate similarity principle for the thermal scaling of a full-scale thrust augmenting ejector [R] . Barankiewicz, Wendy, Perusek, Gail P., Ibrahim, Mounir 1992

机译：使用近似相似性原理进行全尺寸推力增强喷射器的热缩放

Use of permutation prefixes for efficient and scalable approximate similarity search

摘要

著录项

相似文献

相关主题

期刊订阅