K-Means Tree for Fast Furthest Neighbor Approximation

机译：K-meace树快速最远邻近近似

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Searching for the furthest neighbor in a given dataset is a linear time complexity problem. This complexity rises to be quadratic when we need to find the furthest neighbor for each point (example) in a dataset. That is, the brute force algorithm takes O(n^{2) to find the furthest neighbor for all points. Such an algorithm is computationally expensive, particularly when the number of samples n in a dataset is large. In this paper, we introduce an approximate tree-based searching method mainly to reduce the time complexity of the search. The proposed method recursively utilizes the k-means approach in order to split the data into sub-groups and then arranges them as a tree structure. Using such a structure, the searching process consumes O(log(n)) to find the approximated furthest neighbor from a given example; and O(nlog(n)) to find it for all examples in the dataset. Our experiments show that the proposed method is reliable and efficient in approximating the furthest neighbor, therefore, can be used in practice, particularly for big data.}

机译：在给定数据集中搜索最远邻居是一个线性时间复杂性问题。当我们需要在数据集中找到每个点（示例）的最远邻居时，这种复杂性升高。也就是说，蛮力算法需要o（n^{2 ）找到所有要点的最远的邻居。这种算法是计算昂贵的，特别是当数据集中的样本N的数量大时。在本文中，我们介绍了一种主要的基于树的搜索方法，主要是为了减少搜索的时间复杂性。所提出的方法递归地利用K-ulit方法，以便将数据分成子组，然后将它们排列为树结构。使用这样的结构，搜索过程消耗O（log（n））以从给定示例找到近似的最远邻居;和o（nlog（n））为数据集中的所有示例找到它。我们的实验表明，该方法可靠且有效地在近似最远邻居，因此可以在实践中使用，特别是对于大数据。}

著录项

来源
《International Computer Engineering Conference》|2020年|77-82|共6页
会议地点
作者
Ahmad S. Tarawneh; Ahmad B. Hassanat; Issam Elkhadiri; Dmitry Chetverikov; Malek Alrashidi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Force; Clustering algorithms; Big Data; Approximation algorithms; Time complexity; Testing;

机译：可视化;力量;聚类算法;大数据;近似算法;时间复杂性;测试;

相似文献

外文文献
中文文献
专利

1. Furthest-Pair-Based Binary Search Tree for Speeding Big Data Classification Using K-Nearest Neighbors [J] . Hassanat Ahmad B. A. Big Data . 2018,第3期

机译：基于最远对的二进制搜索树，用于使用K最近邻加速大数据分类
2. Exploiting the structure of furthest neighbor search for fast approximate results [J] . Ryan R. Curtin, Javier Echauz, Andrew B. Gardner Information Systems . 2019,第FEBa期

机译：利用最远邻居搜索的结构获得快速的近似结果
3. A fast minimum spanning tree algorithm based on K-means [J] . Zhong Caiming, Malinen Mikko, Miao Duoqian, Information Sciences: An International Journal . 2015,第Null期

机译：基于K均值的快速最小生成树算法
4. Fast Approximate Furthest Neighbors with Data-Dependent Candidate Selection [C] . Ryan R. Curtin, Andrew B. Gardner International conference on similarity search and applications . 2016

机译：与数据相关的候选选择的快速近似最远邻居
5. Improving the approximation ratio of the maximum agreement forest (MAF) on k trees and estimating the approximation ratio of the acyclic-MAF on k trees. [D] . Bhabak, Puspal. 2011

机译：改进k棵树上最大一致性森林（MAF）的近似比率，并估计k棵树上无环MAF的近似比率。
6. A Fast Exact k-Nearest Neighbors Algorithm for High Dimensional Search Using k-Means Clustering and Triangle Inequality [O] . Xueyi Wang -1

机译：快速精确最近邻居法高维搜索使用K-均值聚类和三角不等式
7. Furthest-Pair-Based Binary Search Tree for Speeding Big Data Classification Using K-Nearest Neighbors [O] . Ahmad B.A. Hassanat 2018

机译：基于基于的基于对的二进制搜索树，用于使用k-intelt邻居加速大数据分类

K-Means Tree for Fast Furthest Neighbor Approximation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅