首页> 外文期刊>Expert Systems with Application >Nearest neighbor classification of categorical data by attributes weighting
【24h】

Nearest neighbor classification of categorical data by attributes weighting

机译:通过属性加权对分类数据进行最近邻分类

获取原文
获取原文并翻译 | 示例
       

摘要

Subspace classification of categorical data is an essential process for many real-world applications such as computer-aided medical diagnosis and collaborative recommendation. The nearest neighbor classifiers have sparked wide interest from these applications because of their simplicity and flexibility. However, they become ineffective when applied to categorical data, due to the lack of a well-defined distance measure used to compute dissimilarities between categorical samples in the projected subspaces. In this paper, we tackle the problem by defining a series of weighted distance functions for categorical attributes, and applying them to derive new nearest neighbor classifiers. Four attribute-weighting measures are proposed, with two defined on global feature-ranking approaches while the other two on local approaches. The experimental results conducted on real categorical data sets demonstrate that all four classifiers outperform consistently the traditional methods, and show the suitability of the proposal for the real applications in terms of automated feature selection.
机译:分类数据的子空间分类是许多现实应用(例如计算机辅助医疗诊断和协作推荐)中必不可少的过程。由于它们的简单性和灵活性,最近的邻居分类器引起了这些应用的广泛兴趣。但是,由于缺少用于计算投影子空间中分类样本之间差异的明确定义的距离度量,因此将它们应用于分类数据时变得无效。在本文中,我们通过为分类属性定义一系列加权距离函数并将其应用到新的最近邻分类器上来解决该问题。提出了四个属性加权度量,其中两个定义在全局特征排名方法上,而另外两个在局部方法上定义。在真实分类数据集上进行的实验结果表明,所有四个分类器的性能均优于传统方法,并且从自动特征选择的角度证明了该建议对于实际应用的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号