首页> 外文会议>International conference on computer aided systems theory >Feature Selection for Unsupervised Learning via Comparison of Distance Matrices
【24h】

Feature Selection for Unsupervised Learning via Comparison of Distance Matrices

机译:通过距离矩阵的比较,无监督学习的特征选择

获取原文

摘要

Feature selection for unsupervised learning is generally harder than for supervised learning, because the former lacks the class information of the latter, and thus an obvious way by which to measure the quality of a feature subset. In this paper, we propose a new method based on representing data sets by their distance matrices, and judging feature combinations by how well the distance matrix using only these features resembles the distance matrix of the full data set. Using articial data for which the relevant features were known, we observed that the results depend on the data dimensionality, the fraction of relevant features, the overlap between clusters in the relevant feature subspaces, and how to measure the similarity of distance matrices. Our method consistently achieved higher than 80% detection rates of relevant features for a wide variety of experimental configurations.
机译:无监督学习的特征选择通常比监督学习更难,因为前者缺乏后者的课堂信息,因此可以测量特征子集的质量的明显方式。在本文中,我们提出了一种基于其距离矩阵表示数据集的新方法,并通过仅使用这些特征的距离矩阵的判断特征组合类似于完整数据集的距离矩阵。使用相关特征的曲面数据,我们观察到结果取决于数据维度,相关特征的分数,相关特征子空间中的簇之间的重叠,以及如何测量距离矩阵的相似性。我们的方法始终如一地实现了各种实验配置的相关特征的检测率高于80%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号