首页> 外文会议>Pacific-Asia conference on knowledge discovery and data mining >Efficient Approximate Algorithms for the Closest Pair Problem in High Dimensional Spaces
【24h】

Efficient Approximate Algorithms for the Closest Pair Problem in High Dimensional Spaces

机译:高维空间中最接近对问题的高效近似算法

获取原文
获取外文期刊封面目录资料

摘要

The Closest Pair Problem (CPP) is one of the fundamental problems that has a wide range of applications in data mining, such as unsupervised data clustering, user pattern similarity search, etc. A number of exact and approximate algorithms have been proposed to solve it in the low dimensional space. In this paper, we address the problem when the metric space is of a high dimension. For example, the drug-target or movie-user interaction data could contain as many as hundreds of features. To solve this problem under the ℓ_2 norm, we present two novel approximate algorithms. Our algorithms are based on the novel idea of projecting the points into the real line. We prove high probability bounds on the run time and accuracy for both of the proposed algorithms. Both algorithms are evaluated via comprehensive experiments and compared with existing best-known approaches. The experiments reveal that our proposed approaches outperform the existing methods.
机译:最近对问题(CPP)是在数据挖掘中具有广泛应用的基本问题之一,例如无监督数据聚类,用户模式相似性搜索等。已提出了许多精确和近似算法来解决该问题。在低维空间中。在本文中,我们解决了度量空间为高维时的问题。例如,毒品目标或电影与用户的互动数据可能包含多达数百个特征。为了解决ℓ_2范数下的这个问题,我们提出了两种新颖的近似算法。我们的算法基于将点投影到实线上的新颖​​思想。我们证明了两种算法在运行时间和准确性上的高概率边界。两种算法均通过全面的实验进行了评估,并与现有的最佳方法进行了比较。实验表明,我们提出的方法优于现有方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号