...
首页> 外文期刊>Knowledge-Based Systems >Location difference of multiple distances based kappa-nearest neighbors algorithm
【24h】

Location difference of multiple distances based kappa-nearest neighbors algorithm

机译:基于kappa-最近邻算法的多距离位置差

获取原文
获取原文并翻译 | 示例
           

摘要

kappa-nearest neighbors (kNN) classifiers are commonly used in various applications due to their relative simplicity and the absence of necessary training. However, the time complexity of the basic algorithm is quadratic, which makes them inappropriate for large scale datasets. At the same time, the performance of most improved algorithms based on tree structures decreases rapidly with increase in dimensionality of dataset, and tree structures have different complexity in different datasets. In this paper, we introduce the concept of "location difference of multiple distances, and use it to measure the difference between different data points. In this way, location difference of multiple distances based nearest neighbors searching algorithm (LDMDBA) is proposed. LDMDBA has a time complexity of O(logdnlogn) and does not rely on a search tree. This makes LDMDBA the only kNN method that can be efficiently applied to high dimensional data and has very good stability on different datasets. In addition, most of the existing methods have a time complexity of O(n) to predict a data point outside the dataset By contrast, LDMDBA has a time complexity of O(logdlogn) to predict a query point in datasets of different dimensions, and, therefore, can be applied in real systems and large scale databases. The effectiveness and efficiency of LDMDBA are demonstrated in experiments involving public and artificial datasets. (C) 2015 Elsevier B.V. All rights reserved.
机译:由于其相对简单并且没有必要的训练,因此kappa最近邻(kNN)分类器通常用于各种应用程序中。但是,基本算法的时间复杂度是二次的,这使其不适用于大规模数据集。同时,大多数基于树结构的改进算法的性能随着数据集维数的增加而迅速下降,并且树结构在不同数据集中具有不同的复杂度。在本文中,我们引入了“多距离位置差异”的概念,并用它来度量不同数据点之间的差异。以此为基础,提出了基于多距离位置差异的最近邻搜索算法(LDMDBA)。 O(logdnlogn)的时间复杂度并且不依赖于搜索树,这使LDMDBA成为唯一一种可以有效地应用于高维数据并且在不同数据集上具有非常好的稳定性的kNN方法。具有O(n)的时间复杂度以预测数据集外部的数据点相比之下,LDMDBA具有O(logdlogn)的时间复杂度以预测不同维度的数据集中的查询点,因此可以实际应用系统和大型数据库。LDMDBA的有效性和效率在涉及公共和人工数据集的实验中得到了证明(C)2015 Elsevier BV保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号