Replanting Your Forest: NVM-friendly Bagging Strategy for Random Forest

机译：植树造林：随机森林的NVM友好套袋策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Random forest is effective and accurate in making predictions for classification and regression problems, which constitute the majority of machine learning applications or systems nowadays. However, as the data are being generated explosively in this big data era, many machine learning algorithms, including the random forest algorithm, may face the difficulty in maintaining and processing all the required data in the main memory. Instead, intensive data movements (i.e., data swappings) between the faster-but-smaller main memory and the slowerbut-larger secondary storage may occur excessively and largely degrade the performance. To address this challenge, the emerging non-volatile memory (NVM) technologies are placed great hopes to substitute the traditional random access memory (RAM) and to build a larger-than-ever main memory space because of its higher cell density, lower power consumption, and comparable read performance as traditional RAM. Nevertheless, the limited write endurance of NVM cells and the read-write asymmetry of NVMs may still limit the feasibility of performing machine learning algorithms directly on NVMs. Such dilemma inspires this study to develop an NVM-friendly bagging strategy for the random forest algorithm, in order to trade the “randomness” of the sampled data for the reduced data movements in the memory hierarchy without hurting the prediction accuracy. The evaluation results show that the proposed design could save up to 72% of the write accesses on the representative traces with nearly no degradation on the prediction accuracy.

机译：随机森林在预测分类和回归问题方面是有效且准确的，而分类和回归问题已成为当今机器学习应用程序或系统的主要组成部分。但是，由于在这个大数据时代爆炸性地生成数据，许多机器学习算法（包括随机森林算法）可能会面临在主存储器中维护和处理所有必需数据的困难。取而代之的是，在速度更快但较小的主存储器和速度较慢但较大的辅助存储器之间的密集数据移动（即数据交换）可能会过度发生，从而大大降低性能。为了应对这一挑战，新兴的非易失性存储器（NVM）技术寄希望于替代传统的随机存取存储器（RAM），并由于其更高的单元密度，更低的功耗而建立了比以往更大的主存储器空间。消耗量以及与传统RAM相当的读取性能然而，NVM单元的有限写入耐久性和NVM的读写不对称性仍可能限制直接在NVM上执行机器学习算法的可行性。这种困境激发了这项研究，为随机森林算法开发了一种NVM友好的装袋策略，以便以采样数据的“随机性”为代价，减少内存层次结构中的数据移动，而又不损害预测精度。评估结果表明，所提出的设计可以节省多达72％的代表性迹线上的写访问，而预测精度几乎不会降低。

著录项

来源
《IEEE Non-Volatile Memory Systems and Applications Symposium》|2019年|1-6|共6页
会议地点
作者
Yu Ting Ho; Chun-Feng Wu; Ming-Chang Yang; Tseng-Yi Chen; Yuan-Hao Chang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Nonvolatile memory; Machine learning algorithms; Random access memory; Bagging; Memory management; Training;

机译：非易失性存储器;机器学习算法;随机存取存储器;装袋;存储器管理;培训;

相似文献

外文文献
中文文献
专利

1. REPLANTING STRATEGIES OF CASHEW BY TAMILNADU FOREST PLANTATION CORPORATION [J] . M. Govinda Rao The Cashew & Cocoa Journal . 2015,第2期

机译：塔米尔纳德（Tamilnadu）森林种植园公司对草丛的再造策略
2. Consistent estimation of residual variance with random forest Out-Of-Bag errors [J] . Ramosaj Burim, Pauly Markus Statistics & Probability Letters . 2019,第期

机译：随机林外误差的剩余差异一致估计
3. POINTS OF SIGNIFICANCE Ensemble methods: bagging and random forests [J] . Altman Naomi, Krzywinski Martin Nature methods . 2017,第10期

机译：重要性乐合容方法：袋装和随机森林
4. Replanting Your Forest: NVM-friendly Bagging Strategy for Random Forest [C] . Yu Ting Ho, Chun-Feng Wu, Ming-Chang Yang, IEEE Non-Volatile Memory Systems and Applications Symposium . 2019

机译：补充您的森林：随机森林的NVM友好的装袋策略
5. Replanting the Douglas fir forest: Forest science and forest practice in the Pacific Northwest, 1890–1945 [D] . Brock, Emily Katherine 2004

机译：种植道格拉斯冷杉森林：西北太平洋地区的森林科学和森林实践，1890–1945年
6. On the overestimation of random forest’s out-of-bag error [O] . Silke Janitza, Roman Hornung 2012

机译：高估随机森林的袋外误差
7. Ovarian cancer data classification using bagging and random forest [O] . A. Arfiani, Z. Rustam 2019

机译：卵巢癌数据分类使用袋和随机森林

Replanting Your Forest: NVM-friendly Bagging Strategy for Random Forest

摘要

著录项

相似文献

相关主题

期刊订阅