A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces

机译：高维空间中相似性搜索方法的定量分析与绩效研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For similarity search in high-dimensional vector spaces (or 'HDVSs'), researchers have proposed a number of new methods (or adaptations of existing methods) based, in the main, on data-space partitioning. However, the performance of these methods generally degrades as dimensionality increases. Although this phenomenon-known as the 'dimensional curse'-is well known, little or no quantitative analysis of the phenomenon is available. In this paper, we provide a detailed analysis of partitioning and clustering techniques for similarity search in HDVSs. We show formally that these methods exhibit linear complexity at high dimensionality, and that existing methods are outperformed on average by a simple sequential scan if the number of dimensions exceeds around 10. Consequently, we come up with an alternative organization based on approximations to make the unavoidable sequential scan as fast as possible. We describe a simple vector approximation scheme, called VA-file, and report on an experimental evaluation of this and of two tree-based index methods (an R~*-tree and an X-tree).

机译：对于高维向量空间（或'HDVSS'）中的相似性搜索，研究人员在主要的数据空间分区中提出了许多基于主数据空间分区的新方法（或现有方法的适应）。然而，随着维度增加，这些方法的性能通常会降低。虽然这种现象称为“尺寸曲折 - 是众所周知的，但对于现象来说很少或没有定量分析。在本文中，我们提供了对HDVS中相似性搜索的分区和聚类技术的详细分析。我们正式展示这些方法在高维度下表现出线性复杂性，并且如果尺寸的数量超过10.因此，如果尺寸的数量超过10，则现有方法平均过于简单的顺序扫描。因此，我们基于近似的替代组织提出尽可能快地保持不可避免的连续扫描。我们描述了一种简单的向量近似方案，称为VA文件，并报告了这一点的实验评估和基于树的索引方法（AN r〜* -tree和X树）。

著录项

来源
《International conference on very large databases》|1998年||共12页
会议地点
作者
Roger Weber; Hans-J. Schek; Stephen Blott;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类各种专用数据库;
关键词

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning in High-dimensional Continuous State Spaces - A State Space Compression Method Based on Multivariate Analysis - [J] . Hideki Satoh, 佐藤仁樹電子情報通信学会技術研究報告. 非線形問題. Nonlinear Problems . 2006,第547期

机译：高维连续状态空间的强化学习-一种基于多元分析的状态空间压缩方法-
2. A State Space Compression Method Based on Multivariate Analysis for Reinforcement Learning in High-Dimensional Continuous State Spaces [J] . Hideki SATOH IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2006,第8期

机译：基于多元分析的状态空间压缩方法在高维连续状态空间中的强化学习
3. Reinforcement Learning in High-dimensional Continuous State Spaces - A State Space Compression Method Based on Multivariate Analysis [J] . Hideki SATOH 電子情報通信学会技術研究報告. 非線形問題. Nonlinear Problems . 2005,第547期

机译：高维连续状态空间中的强化学习-基于多元分析的状态空间压缩方法
4. A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces [C] . Roger Weber, Hans-J. Schek, Stephen Blott International conference on very large databases . 1998

机译：高维空间中相似性搜索方法的定量分析与绩效研究
5. Intelligent motion planning and analysis with probabilistic roadmap methods for the study of complex and high-dimensional motions [D] . Tapia, Lydia 2009

机译：智能运动计划和概率路线图分析，用于研究复杂的高维运动
6. Improved bone bonding of hydroxyapatite spacers with a high porosity in a quantitative computed tomography‐image pixel analysis: A prospective 1‐year comparative study of the consecutive cohort undergoing double‐door cervical laminoplasty [O] . Yoshiki Takeoka, Takashi Yurube, Koichiro Maeno, 2020

机译：定量计算机断层扫描图像像素分析中具有高孔隙率的羟基磷灰石垫片的改善的骨结合性：连续队列接受双门颈椎椎体成形术的前瞻性一年比较研究
7. A State Space Compression Method Based on Multivariate Analysis for Reinforcement Learning in High-dimensional Continuous State Spaces [O] . 2006

机译：基于多元分析的状态空间压缩方法在高维连续状态空间中的强化学习

A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces

摘要

著录项

相似文献

相关主题

期刊订阅