首页> 外文会议>International Computer Engineering Conference >K-Means Tree for Fast Furthest Neighbor Approximation
【24h】

K-Means Tree for Fast Furthest Neighbor Approximation

机译:K-meace树快速最远邻近近似

获取原文
获取外文期刊封面目录资料

摘要

Searching for the furthest neighbor in a given dataset is a linear time complexity problem. This complexity rises to be quadratic when we need to find the furthest neighbor for each point (example) in a dataset. That is, the brute force algorithm takes O(n2) to find the furthest neighbor for all points. Such an algorithm is computationally expensive, particularly when the number of samples n in a dataset is large. In this paper, we introduce an approximate tree-based searching method mainly to reduce the time complexity of the search. The proposed method recursively utilizes the k-means approach in order to split the data into sub-groups and then arranges them as a tree structure. Using such a structure, the searching process consumes O(log(n)) to find the approximated furthest neighbor from a given example; and O(nlog(n)) to find it for all examples in the dataset. Our experiments show that the proposed method is reliable and efficient in approximating the furthest neighbor, therefore, can be used in practice, particularly for big data.
机译:在给定数据集中搜索最远邻居是一个线性时间复杂性问题。当我们需要在数据集中找到每个点(示例)的最远邻居时,这种复杂性升高。也就是说,蛮力算法需要o(n 2 )找到所有要点的最远的邻居。这种算法是计算昂贵的,特别是当数据集中的样本N的数量大时。在本文中,我们介绍了一种主要的基于树的搜索方法,主要是为了减少搜索的时间复杂性。所提出的方法递归地利用K-ulit方法,以便将数据分成子组,然后将它们排列为树结构。使用这样的结构,搜索过程消耗O(log(n))以从给定示例找到近似的最远邻居;和o(nlog(n))为数据集中的所有示例找到它。我们的实验表明,该方法可靠且有效地在近似最远邻居,因此可以在实践中使用,特别是对于大数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号