首页> 外文期刊>Expert Systems with Application >A generalized mean distance-based k-nearest neighbor classifier
【24h】

A generalized mean distance-based k-nearest neighbor classifier

机译:基于广义平均距离的k最近邻分类器

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

K-nearest neighbor (KNN) rule is a well-known non-parametric classifier that is widely used in pattern recognition. However, the sensitivity of the neighborhood size k always seriously degrades the KNN-based classification performance, especially in the case of the small sample size with the existing outliers. To overcome this issue, in this article we propose a generalized mean distance-based k-nearest neighbor classifier (GMDKNN) by introducing multi-generalized mean distances and the nested generalized mean distance that are based on the characteristic of the generalized mean. In the proposed method, multi-local mean vectors of the given query sample in each class are calculated by adopting its class-specific k nearest neighbors. Using the achieved k local mean vectors per class, the corresponding k generalized mean distances are calculated and then used to design the categorical nested generalized mean distance. In the classification phase, the categorical nested generalized mean distance is used as the classification decision rule and the query sample is classified into the class with the minimum nested generalized mean distance among all the classes. Extensive experiments on the UCI and KEEL data sets, synthetic data sets, the KEEL noise data sets and the UCR time series data sets are conducted by comparing the proposed method to the state-of-art KNN-based methods. The experimental results demonstrate that the proposed GMDKNN performs better and has the less sensitiveness to k. Thus, our proposed GMDKNN with the robust and effective classification performanrP could-be a promising method for-pattern-recognition-in some expert and intelligence systems. (C) 2018 Elsevier Ltd. All rights reserved.
机译:K最近邻(KNN)规则是众所周知的非参数分类器,广泛用于模式识别。但是,邻域大小k的敏感度始终会严重降低基于KNN的分类性能,尤其是在样本量较小且存在异常值的情况下。为了克服这个问题,在本文中,我们通过引入基于广义均值特征的多重广义均值距离和嵌套广义均值距离,提出了一种基于广义均值距离的k最近邻分类器(GMDKNN)。在提出的方法中,通过采用特定于类别的k个最近邻居来计算每个类别中给定查询样本的多局部均值向量。使用每个类别获得的k个局部平均向量,计算相应的k个广义平均距离,然后将其用于设计分类嵌套的广义平均距离。在分类阶段,将分类嵌套的广义平均距离用作分类决策规则,并将查询样本分类为所有类别中嵌套最小化的平均平均距离最小的类别。通过将提出的方法与基于KNN的最新方法进行比较,对UCI和KEEL数据集,合成数据集,KEEL噪声数据集和UCR时间序列数据集进行了广泛的实验。实验结果表明,提出的GMDKNN性能更好,对k的敏感性较小。因此,我们提出的具有强大而有效的分类性能的GMDKNN在某些专家和智能系统中可能是一种有前途的模式识别方法。 (C)2018 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号