...
首页> 外文期刊>Applied Soft Computing >Feature selection based on artificial bee colony and gradient boosting decision tree
【24h】

Feature selection based on artificial bee colony and gradient boosting decision tree

机译:基于人工蜜蜂殖民地和梯度提升决策树的特征选择

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Data from many real-world applications can be high dimensional and features of such data are usually highly redundant. Identifying informative features has become an important step for data mining to not only circumvent the curse of dimensionality but to reduce the amount of data for processing. In this paper, we propose a novel feature selection method based on bee colony and gradient boosting decision tree aiming at addressing problems such as efficiency and informative quality of the selected features. Our method achieves global optimization of the inputs of the decision tree using the bee colony algorithm to identify the informative features. The method initializes the feature space spanned by the dataset. Less relevant features are suppressed according to the information they contribute to the decision making using an artificial bee colony algorithm. Experiments are conducted with two breast cancer datasets and six datasets from the public data repository. Experimental results demonstrate that the proposed method effectively reduces the dimensions of the dataset and achieves superior classification accuracy using the selected features. (C) 2018 Elsevier B.V. All rights reserved.
机译:来自许多真实世界应用的数据可以是高维度的,并且这些数据的特征通常是高度冗余的。识别信息特征已成为数据挖掘的重要步骤,不仅规避维度的诅咒,而是减少处理的数据量。在本文中,我们提出了一种基于蜜蜂殖民地和渐变升压决策树的新颖特征选择方法,旨在解决所选功能的效率和信息性质量等问题。我们的方法使用BEE菌落算法来识别信息特征的全局优化决策树的输入。该方法初始化数据集跨越的特征空间。根据使用人造蜜蜂菌落算法的决策,根据他们有助于决策的信息抑制了较少的相关特征。实验用两个乳腺癌数据集和来自公共数据存储库的六个数据集进行。实验结果表明,所提出的方法有效地减少了数据集的尺寸,并使用所选功能实现卓越的分类精度。 (c)2018 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号