首页> 外文期刊>Decision support systems >A novel similarity classifier with multiple ideal vectors based on k-means clustering
【24h】

A novel similarity classifier with multiple ideal vectors based on k-means clustering

机译:基于k均值聚类的具有多个理想向量的新型相似度分类器

获取原文
获取原文并翻译 | 示例
       

摘要

In the literature, researchers and practitioners can find a manifold of algorithms to perform a classification task. The similarity classifier is one of the more recently suggested classification algorithms. In this paper, we suggest a novel similarity classifier with multiple ideal vectors per class that are generated with k-means clustering in combination with the jump method. Two approaches for pre-processing, via simple standardization and via principal component analysis in combination with the MAP test and Parallel Analysis, are presented. On the artificial data sets, the novel classifier with standardization and with transformation power Y = 1 for the jump method results in significantly higher mean classification accuracies than the standard classifier. The results of the artificial data sets demonstrate that in contrast to the standard similarity classifier, the novel approach has the ability to cope with more complex data structures. For the real-world credit data sets, the novel similarity classifier with standardization and Y = 1 achieves competitive results or even outperforms the k-nearest neighbour classifier, the Naive Bayes algorithm, decision trees, random forests and the standard similarity classifier.
机译:在文献中,研究人员和从业人员可以找到多种算法来执行分类任务。相似度分类器是最近提出的分类算法之一。在本文中,我们提出了一种新颖的相似性分类器,该分类器具有每类多个理想向量,这些向量是通过k-means聚类结合跳转方法生成的。提出了两种预处理方法,分别是简单的标准化和主成分分析,以及MAP测试和并行分析。在人工数据集上,具有跳跃方法的标准化和转换能力Y = 1的新型分类器比标准分类器具有更高的平均分类精度。人工数据集的结果表明,与标准相似性分类器相比,该新方法具有应付更复杂数据结构的能力。对于现实世界的信用数据集,具有标准化且Y = 1的新颖相似度分类器可达到竞争结果,甚至优于k最近邻分类器,朴素贝叶斯算法,决策树,随机森林和标准相似度分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号