首页> 外文OA文献 >Random forest training on reconfigurable hardware
【2h】

Random forest training on reconfigurable hardware

机译:关于可重构硬件的随机森林培训

摘要

Random Forest (RF) is one of the most widely used supervised learning methods available. An RF is ensemble of decision tree classifiers with injection of several sources of randomness. It demonstrates a set of improvement over single decision and regression trees and is comparable or superior to major classification tools such as support vector machine (SVM) and adaptive boosting (Adaboost) with respect to accuracy, interpretability, robustness and processing speed. RF can be generally divided into training process and predicting process.udRecently with emergence of large-scale data mining applications, the RF training process implemented in software on a single computer can no longer induce a complex RF model within reasonable amount of time. Alternative solutions involving computer clusters and GPUs usually come with disadvantages with respect to Performance/Power ratio and are not feasible for portable/embedded applications.udIn this work a set of FPGA-based implementations of the RF training process are proposed. FPGA devices allow construction of efficient custom hardware architectures and feature lower power consumption than typical GPPs or GPUs therefore are suitable for portable/embedded applications. The proposed hardware training architectures take advantage of different types of inherent parallelism in the RF training algorithm and distribute the workload to a set of parallel workers. Combining the parallel processing techniques with custom hardware designs featuring low latency, the architectures are able to accelerate the training process without loss in accuracy.
机译:随机森林(RF)是可用的最广泛使用的监督学习方法之一。 RF通过注入多个随机性源来集成决策树分类器。它展示了对单决策树和回归树的一系列改进,并且在准确性,可解释性,鲁棒性和处理速度方面,可以与主要分类工具(如支持向量机(SVM)和自适应增强(Adaboost))相比或更高。 RF通常可以分为训练过程和预测过程。 ud随着大规模数据挖掘应用程序的出现,在单台计算机上以软件实现的RF训练过程不再能够在合理的时间内生成复杂的RF模型。涉及计算机集群和GPU的替代解决方案通常在性能/功率比方面具有劣势,并且对于便携式/嵌入式应用不可行。 ud在这项工作中,提出了一组基于FPGA的RF训练过程实现。与典型的GPP或GPU相比,FPGA器件允许构建高效的定制硬件架构,并具有较低的功耗,因此适用于便携式/嵌入式应用。所提出的硬件训练体系结构在RF训练算法中利用了不同类型的固有并行性,并将工作量分配给一组并行工作者。将并行处理技术与具有低延迟的定制硬件设计相结合,这些架构能够加速训练过程而不会降低准确性。

著录项

  • 作者

    Cheng Chuan;

  • 作者单位
  • 年度 2015
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号