首页> 外文会议>Proceedings of the Twenty-third international conference on very large data bases >M-tree: An Efficient Access Method for Similarity Search in Metric Spaces
【24h】

M-tree: An Efficient Access Method for Similarity Search in Metric Spaces

机译:M树:度量空间中相似性搜索的一种有效访问方法

获取原文
获取原文并翻译 | 示例

摘要

A new access method, called M-tree, is proposed to organize and search large data sets from a generic "metric space", i.e. where object proximity is only defined by a distance function satisfying the positivity, symmetry, and triangle inequality postulates. We detail algorithms for insertion of objects and split management, which keep the M-tree always balanced - several heuristic split alternatives are considered and experimentally evaluated. Algorithms for similarity (range and k-nearest neighbors) queries are also described. Results from extensive experimentation with a prototype system are reported, considering as the performance criteria the number of page I/O's and the number of distance computations. The results demonstrate that the M-tree indeed extends the domain of applicability beyond the traditional vector spaces, performs reasonably well in high-dimensional data spaces, and scales well in case of growing files.
机译:提出了一种新的访问方法,称为M树,用于从通用“度量空间”组织和搜索大型数据集,即对象接近度仅由满足正,对称和三角不等式假设的距离函数定义。我们详细介绍了对象插入和拆分管理的算法,这些算法使M树始终保持平衡-考虑并实验评估了几种启发式拆分方案。还描述了用于相似性(范围和k最近邻)查询的算法。报告了使用原型系统进行广泛实验的结果,其中将页I / O的数量和距离计算的数量作为性能标准。结果表明,M树确实将适用范围扩展到了传统向量空间之外,在高维数据空间中表现良好,并且在文件增长的情况下可很好地扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号