【24h】

Unsupervised feature selection based on clustering

机译:基于聚类的无监督特征选择

获取原文
获取外文期刊封面目录资料

摘要

Feature selection plays an important part in improving the classification accuracy and the quality of clustering in many applications. Feature selection has been widely studied in supervised learning, but in unsupervised learning it is still relatively rare. In this paper, a novel definition of feature differentiation for identifying (determining) the relatively important features is presented, and a one-pass clustering-based feature selection approach is introduced. The new method with nearly linear time complexity selects the optimal subset according to the variation of the feature differentiation. Experimental results on UCI datasets show that our method, by removing the irrelevant or redundant features, can achieve promising classification and clustering results for most datasets. Compared with other traditional feature selection approaches the proposed algorithm has obtained similar or even better performance in terms of dimensionality reduction and classification accuracy.
机译:在许多应用中,特征选择在提高分类准确度和聚类质量方面起着重要作用。特征选择已在监督学习中得到了广泛的研究,但是在无监督学习中,它仍然相对较少。在本文中,提出了一种新的特征区分定义,用于识别(确定)相对重要的特征,并介绍了一种基于一遍聚类的特征选择方法。具有近似线性时间复杂度的新方法根据特征差异的变化选择最佳子集。在UCI数据集上的实验结果表明,通过删除不相关或多余的特征,我们的方法可以为大多数数据集实现有希望的分类和聚类结果。与其他传统特征选择方法相比,该算法在降维和分类精度方面获得了相似甚至更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号