首页> 外文会议>IEEE Non-Volatile Memory Systems and Applications Symposium >Replanting Your Forest: NVM-friendly Bagging Strategy for Random Forest
【24h】

Replanting Your Forest: NVM-friendly Bagging Strategy for Random Forest

机译:植树造林:随机森林的NVM友好套袋策略

获取原文

摘要

Random forest is effective and accurate in making predictions for classification and regression problems, which constitute the majority of machine learning applications or systems nowadays. However, as the data are being generated explosively in this big data era, many machine learning algorithms, including the random forest algorithm, may face the difficulty in maintaining and processing all the required data in the main memory. Instead, intensive data movements (i.e., data swappings) between the faster-but-smaller main memory and the slowerbut-larger secondary storage may occur excessively and largely degrade the performance. To address this challenge, the emerging non-volatile memory (NVM) technologies are placed great hopes to substitute the traditional random access memory (RAM) and to build a larger-than-ever main memory space because of its higher cell density, lower power consumption, and comparable read performance as traditional RAM. Nevertheless, the limited write endurance of NVM cells and the read-write asymmetry of NVMs may still limit the feasibility of performing machine learning algorithms directly on NVMs. Such dilemma inspires this study to develop an NVM-friendly bagging strategy for the random forest algorithm, in order to trade the “randomness” of the sampled data for the reduced data movements in the memory hierarchy without hurting the prediction accuracy. The evaluation results show that the proposed design could save up to 72% of the write accesses on the representative traces with nearly no degradation on the prediction accuracy.
机译:随机森林在预测分类和回归问题方面是有效且准确的,而分类和回归问题已成为当今机器学习应用程序或系统的主要组成部分。但是,由于在这个大数据时代爆炸性地生成数据,许多机器学习算法(包括随机森林算法)可能会面临在主存储器中维护和处理所有必需数据的困难。取而代之的是,在速度更快但较小的主存储器和速度较慢但较大的辅助存储器之间的密集数据移动(即数据交换)可能会过度发生,从而大大降低性能。为了应对这一挑战,新兴的非易失性存储器(NVM)技术寄希望于替代传统的随机存取存储器(RAM),并由于其更高的单元密度,更低的功耗而建立了比以往更大的主存储器空间。消耗量以及与传统RAM相当的读取性能然而,NVM单元的有限写入耐久性和NVM的读写不对称性仍可能限制直接在NVM上执行机器学习算法的可行性。这种困境激发了这项研究,为随机森林算法开发了一种NVM友好的装袋策略,以便以采样数据的“随机性”为代价,减少内存层次结构中的数据移动,而又不损害预测精度。评估结果表明,所提出的设计可以节省多达72%的代表性迹线上的写访问,而预测精度几乎不会降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号