首页> 外文会议>Pacific Rim international conference on artificial intelligence >Fuzzy Rough Based Feature Selection by Using Random Sampling
【24h】

Fuzzy Rough Based Feature Selection by Using Random Sampling

机译:基于随机抽样的模糊粗糙集特征选择

获取原文

摘要

Feature selection, i.e., Attribute reduction, is one of the most important applications of fuzzy rough set theory. The application of attribute reduction based on fuzzy rough set is inefficient or even unfeasible on large scale data. Considering the random sampling technique is an effective method to statistically reduce the calculation on large scale data, we introduce it into the fuzzy rough based feature selection algorithm. This paper thus proposes a random reduction algorithm based on random sampling. The main contribution of this paper is the introduction of the idea of random sampling in the selection of attributes based on minimum redundancy and maximum correlation. First, in each iteration the significance of attribute is not computed on all the objects in the whole datasets, but on part of randomly selected objects. By this way, the maximum relevant attribute is chosen on the condition of less calculation. Secondly, in the process of choosing attribute in each iteration, the sample is different so as to select the minimum redundancy attribute. Finally, the experimental results show that the reduction algorithm can obviously reduce the running time of the reduction algorithm on the condition of limited classification accuracy loss.
机译:特征选择,即属性约简,是模糊粗糙集理论的最重要应用之一。在大型数据上,基于模糊粗糙集的属性约简的应用效率低下甚至不可行。考虑到随机抽样技术是一种统计上减少大规模数据计算的有效方法,我们将其引入基于模糊粗糙集的特征选择算法。因此,本文提出了一种基于随机采样的随机约简算法。本文的主要贡献是在基于最小冗余和最大相关性的属性选择中引入了随机抽样的思想。首先,在每次迭代中,属性的重要性不是在整个数据集中的所有对象上计算的,而是在随机选择的对象的一部分上计算的。通过这种方式,在较少计算的情况下选择最大相关属性。其次,在每次迭代中选择属性的过程中,样本是不同的,以选择最小冗余属性。最后,实验结果表明,在分类精度损失有限的情况下,该约简算法可以明显减少该约简算法的运行时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号