首页> 外文会议>Advances in Knowledge Discovery and Data Mining >k-nearest Neighbor Classification on Spatial Data Streams Using P-trees
【24h】

k-nearest Neighbor Classification on Spatial Data Streams Using P-trees

机译:使用P树在空间数据流上进行k最近邻分类

获取原文

摘要

Classification of spatial data streams is crucial, since the training dataset changes often. Building a new classifier each time can be very costly with most techniques. In this situation, k-nearest neighbor (KNN) classification is a very good choice, since no residual classifier needs to be built ahead of time. KNN is extremely simple to implement and lends itself to a wide variety of variations. We propose a new method of KNN classification for spatial data using a new, rich, data-mining-ready structure, the Peano-count-tree (P-tree). We merely perform some AND/OR operations on P-trees to find the nearest neighbors of a new sample and assign the class label. We have fast and efficient algorithms for the AND/OR operations, which reduce the classification time significantly. Instead of taking exactly the k nearest neighbors we form a closed-KNN set. Our experimental results show closed-KNN yields higher classification accuracy as well as significantly higher speed.
机译:空间数据流的分类至关重要,因为训练数据集经常更改。对于大多数技术而言,每次构建一个新的分类器都可能会非常昂贵。在这种情况下,k近邻(KNN)分类是一个很好的选择,因为不需要提前构建任何残差分类器。 KNN的实现非常简单,并且可以实现多种变体。我们提出了一种新的空间数据KNN分类方法,该方法采用了一种新的,丰富的数据挖掘就绪结构,即Peano-count-tree(P-tree)。我们仅对P树执行一些AND / OR操作,以查找新样本的最近邻居并分配类别标签。我们拥有用于AND / OR运算的快速高效算法,可显着减少分类时间。而不是精确地获取k个最近的邻居,我们形成一个封闭的KNN集。我们的实验结果表明,封闭式KNN具有更高的分类精度以及明显更高的速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号