首页> 外文会议>International Conference on Enterprise Information Systems >ANALYSIS OF THE ITERATED PROBABILISTIC WEIGHTED K NEAREST NEIGHBOR METHOD, A NEW DISTANCE-BASED ALGORITHM
【24h】

ANALYSIS OF THE ITERATED PROBABILISTIC WEIGHTED K NEAREST NEIGHBOR METHOD, A NEW DISTANCE-BASED ALGORITHM

机译:迭代概率加权k最近邻法,基于新的距离算法分析

获取原文

摘要

The k-Nearest Neighbor (k-NN) classification method assigns to an unclassified point the class of the nearest of a set of previously classified points. A problem that arises when aplying this technique is that each labeled sample is given equal importance in deciding the class membership of the pattern to be classified, regardless of the typicalness of each neighbor. We report on the application of a new hybrid version named Iterated Probabilistic Weighted k Nearest Neighbor algorithm (IPW-k-NN) which classifies new cases based on the probability distribution each case has to belong to each class. These probabilities are computed for each case in the training database according to the k Nearest Neighbors it has in this database; this is a new way to measure the typicalness of a given case with regard to every class. Experiments have been carried out using UCI Machine Learning Repository well-known databases and performing 10-fold cross-validation to validate the results obtained in each of them. Three different distances (Euclidean, Camberra and Chebychev) are used in the comparison done.
机译:k最近邻(k-nn)分类方法分配给未分类的点,最近的一组先前分类的点。在A1POORTING的情况下,出现的问题是每个标记的样本在决定要分类的模式的班级成员资格时,无论每个邻居的典型性如何,都会给出同等重要的。我们报告了一个名为迭代概率加权k最近邻域算法(IPW-k-Nn)的新的混合版本的应用程序,该概率基于概率分布对每个类​​别属于每个类的新案例。根据它在该数据库中的K最近邻居,在培训数据库中计算这些概率;这是一种衡量给定案例的典型性的新方法。使用UCI机器学习存储库众所周知的数据库进行实验,并执行10倍的交叉验证以验证在每个中获得的结果。在比较中使用了三个不同的距离(欧几里德,Camberra和Chebychev)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号