...
首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >Evolutionary induction of a decision tree for large-scale data: a GPU-based approach
【24h】

Evolutionary induction of a decision tree for large-scale data: a GPU-based approach

机译:大规模数据的决策树进化诱导:基于GPU的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Evolutionary induction of decision trees is an emerging alternative to greedy top-down approaches. Its growing popularity results from good prediction performance and less complex output trees. However, one of the major drawbacks associated with the application of evolutionary algorithms is the tree induction time, especially for large-scale data. In the paper, we design and implement a graphics processing unit (GPU)-based parallelization of evolutionary induction of decision trees. We apply a Compute Unified Device Architecture programming model, which supports general-purpose computation on a GPU (GPGPU). The selection and genetic operators are performed sequentially on a CPU, while the evaluation process for the individuals in the population is parallelized. The data-parallel approach is applied, and thus, the parts of a dataset are spread over GPU cores. Each core processes the assigned chunk of the data. Finally, the results from all GPU cores are merged and the sought tree metrics are sent to the CPU. Computational performance of the proposed approach is validated experimentally on artificial and real-life datasets. A comparison with the traditional CPU version shows that evolutionary induction of decision trees supported by GPGPU can be accelerated significantly (even up to 800 times) and allows for processing of much larger datasets.
机译:决策树的进化诱导是一种新兴的贪婪自上而下方法的替代方案。其越来越多的普及从良好的预测性能和更复杂的输出树产生。然而,与应用进化算法相关的主要缺点之一是树诱导时间,特别是对于大规模数据。在本文中,我们设计并实现了基于决策树的进化诱导的基于分布的图形处理单元(GPU)。我们应用了一个计算统一设备架构编程模型,支持GPU上的通用计算(GPGPU)。选择和遗传算子在CPU上顺序进行,而群体中的个体的评估过程是平行化的。应用数据并行方法,因此,数据集的部分遍布GPU核心。每个核心处理数据分配的数据块。最后,将所有GPU核心的结果合并,并将所寻求的树度量发送到CPU。所提出的方法的计算性能在人工和现实生活数据集上实验验证。与传统的CPU版本的比较显示,GPGPU支持的决策树的进化诱导可以显着加速(甚至高达800次),并允许处理更大的数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号