首页> 美国卫生研究院文献>other >Two-point-based binary search trees for accelerating big data classification using KNN

【2h】

Two-point-based binary search trees for accelerating big data classification using KNN

机译：基于两点的二进制搜索树用于使用KNN加速大数据分类

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Big data classification is very slow when using traditional machine learning classifiers, particularly when using a lazy and slow-by-nature classifier such as the k-nearest neighbors algorithm (KNN). This paper proposes a new approach which is based on sorting the feature vectors of training data in a binary search tree to accelerate big data classification using the KNN approach. This is done using two methods, both of which utilize two local points to sort the examples based on their similarity to these local points. The first method chooses the local points based on their similarity to the global extreme points, while the second method chooses the local points randomly. The results of various experiments conducted on different big datasets show reasonable accuracy rates compared to state-of-the-art methods and the KNN classifier itself. More importantly, they show the high classification speed of both methods. This strong trait can be used to further improve the accuracy of the proposed methods.

机译：当使用传统的机器学习分类器时，大数据分类非常慢，尤其是当使用诸如k近邻算法（KNN）的懒惰和慢速分类器时。本文提出了一种新的方法，该方法基于在二叉搜索树中对训练数据的特征向量进行排序以使用KNN方法加速大数据分类。这使用两种方法完成，两种方法都利用两个局部点根据与这些局部点的相似性对示例进行排序。第一种方法是根据局部点与全局极端点的相似性来选择，而第二种方法则是随机选择局部点。与最先进的方法和KNN分类器本身相比，在不同的大型数据集上进行的各种实验的结果显示出合理的准确率。更重要的是，它们显示了这两种方法的高分类速度。这种强大的特性可以用来进一步提高所提出方法的准确性。

著录项

期刊名称 other
作者
Ahmad B. A. Hassanat;
展开▼
作者单位

展开▼
年(卷),期 -1(13),11
年度 -1
页码 e0207772
总页数 15
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Norm-Based Binary Search Trees for Speeding Up KNN Big Data Classification [J] . Ahmad B. A. Hassanat Computers . 2018,第4期

机译：基于范数的二进制搜索树，用于加快KNN大数据分类
2. Furthest-Pair-Based Binary Search Tree for Speeding Big Data Classification Using K-Nearest Neighbors [J] . Hassanat Ahmad B. A. Big Data . 2018,第3期

机译：基于最远对的二进制搜索树，用于使用K最近邻加速大数据分类
3. Compressed Binary Bit Trees: A New Data Structure For Accelerating Database Searching [J] . Smellie A Journal of chemical information and modeling . 2009,第2期

机译：压缩二进制位树：用于加速数据库搜索的新数据结构
4. A New Fuzzy Decision Tree Classification Method for Mining High-Speed Data Streams Based on Binary Search Trees [C] . Zhoujun Li, Tao Wang, Ruoxue Wang, Annual International Workshop on Frontiers in Algorithmics . 2007

机译：一种基于二元搜索树的高速数据流的新模糊决策树分类方法
5. Data structures, binary search trees: A study of random Weyl trees [D] . Goudjil, Amar 1999

机译：数据结构，二叉搜索树：随机Weyl树的研究
6. Improving GPU-accelerated adaptive IDW interpolation algorithm using fast kNN search [O] . Gang Mei, Nengxiong Xu, Liangliang Xu -1

机译：使用快速kNN搜索改进GPU加速的自适应IDW插值算法
7. Two-point-based binary search trees for accelerating big data classification using KNN [O] . Ahmad B. A. Hassanat 2018

机译：基于两点的二进制搜索树，用于加速使用KNN加速大数据分类
8. Pipeline synthetic aperture radar data compression utilizing systolic binary tree-searched architecture for vector quantization [R] . 1995

机译：利用收缩二叉树搜索结构进行矢量量化的流水线合成孔径雷达数据压缩

Two-point-based binary search trees for accelerating big data classification using KNN

摘要

著录项

相似文献

相关主题

期刊订阅