...
首页> 外文期刊>International journal of applied evolutionary computation >Density and Distance Based KNN Approach to Classification
【24h】

Density and Distance Based KNN Approach to Classification

机译:基于密度和距离的KNN分类方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

KNN algorithm is a simple and efficient algorithm developed to solve classification problems. However, it encounters problems when classifying datasets with non-uniform density distributions. The existing KNN voting mechanism may lose essential information by considering majority only and get degraded performance when a dataset has uneven distribution. The other drawback comes from the way that KNN treat all the participating candidates equally when judging upon one test datum. To overcome the weaknesses of KNN, a Region of Influence Based KNN (RI-KNN) is proposed. RI-KNN computes for each training datum region of influence information based on their nearby data (i.e. locality information) so that each training datum can encode some locality information from its region. Information coming from both training and testing stages will contribute to the formation of weighting formula. By solving these two problems, RI-KNN is shown to outperform KNN upon several artificial datasets and real datasets without sacrificing time cost much in nearly all tested datasets.
机译:KNN算法是一种为解决分类问题而开发的简单高效的算法。但是,在对密度分布不​​均匀的数据集进行分类时会遇到问题。现有的KNN投票机制可能仅考虑多数就可能丢失重要信息,并且在数据集分布不均时会降低性能。另一个缺点来自于KNN在判断一个测试数据时平等对待所有参与的候选人的方式。为了克服KNN的弱点,提出了一种基于影响区域的KNN(RI-KNN)。 RI-KNN根据其附近的数据(即位置信息)为每个训练数据计算影响信息区域,以便每个训练数据都可以对来自其区域的一些位置信息进行编码。来自训练和测试阶段的信息将有助于加权公式的形成。通过解决这两个问题,在几乎所有测试数据集上,RI-KNN在几个人工数据集和真实数据集上均表现出优于KNN的效果,而不会花费太多时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号