首页> 外文会议>International Conference on Applied Mathematics, Modelling and Statistics Application >Feature Selection on the Basis of Rough Set Theory and Univariate Marginal Distribution Algorithm
【24h】

Feature Selection on the Basis of Rough Set Theory and Univariate Marginal Distribution Algorithm

机译:基于粗糙集理论和单变量边际分布算法的特征选择

获取原文

摘要

Feature selection is an important preprocessing step in machine learning. The aim of feature selection is to find an optimal subset from original features that satisfies a criterion. Rough set theory (RST) is one of the most effective ways to solve feature selection problem, but RST is inefficient in large scale datasets. In order to solve this problem, in this paper, we proposed a novel feature selection algorithm RSUMDA on the basis of univariate marginal distribution algorithm. RST was used to obtain the significance of each feature as the original probability of UMDA and then UMDA was to search the optimal feature subset that using the number of the selected feature and the accuracy of the classifier as fitness function. Experimentation was carried out in 4 UCI datasets. The results showed that our algorithm could effectively reduce the number of the features, improve the accuracy of the classifier and quicken the convergence rate.
机译:特征选择是机器学习的重要预处理步骤。特征选择的目的是从满足标准的原始功能找到最佳子集。粗糙集理论(RST)是解决特征选择问题的最有效的方法之一,但RST在大型数据集中效率低。为了解决这个问题,在本文中,我们提出了一种基于单变量边际分布算法的新特征选择算法RSUMDA。 RST用于获得每个特征作为UMDA的原始概率的每个特征的重要性,然后UMDA是搜索使用所选特征的数量和分类器的准确性作为适合度功能的最佳特征子集。实验是在4个UCI数据集中进行的。结果表明,我们的算法可以有效地减少特征的数量,提高分类器的准确性并加快收敛速度​​。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号