首页> 外文会议>International Conference on Machine Learning, Optimization, and Data Science >Using Clustering for Supervised Feature Selection to Detect Relevant Features
【24h】

Using Clustering for Supervised Feature Selection to Detect Relevant Features

机译:使用群集进行监督功能选择以检测相关功能

获取原文

摘要

In many applications in machine learning, large quantities of features and information are available, but these can be of low quality. A novel filter method for feature selection for classification termed COLD is presented that uses class-wise clustering to reduce the dimensionality of the data. The idea behind this approach is that if a relevant feature would be removed from the set of features, the separation of clusters belonging to different classes will deteriorate. Four artificial examples and two real-world data sets are presented on which COLD is compared with several popular filter methods. For the artificial examples, only COLD is capable to consistently rank the features according to their contribution to the separation of the classes. For the real-world Dermatology and Arrhythmia dataset, COLD demonstrates the ability to remove a large number of features and improve the classification accuracy or, at a minimum, not degrade the performance considerably.
机译:在机器学习中的许多应用中,可以使用大量的特性和信息,但这些功能可以很低。提出了一种用于分类的特征选择的新颖滤波器方法,其使用类 - WISE聚类来减少数据的维度。这种方法背后的想法是,如果将从该集合中删除相关功能,则属于不同类别的群集的分离将恶化。介绍了四个人工例子和两个现实数据集,其中与几种流行的过滤方法进行了比较了寒冷。对于人工示例,只有冷能够根据它们对类别的分离的贡献始终如一地将特征级别。对于真实世界的皮肤病学和心律失常数据集,COLD展示了消除大量特征并提高分类精度的能力,或者最小不会降低性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号