首页> 外文期刊>Journal of Statistical Software >wsrf: An R Package for Classification with Scalable Weighted Subspace Random Forests
【24h】

wsrf: An R Package for Classification with Scalable Weighted Subspace Random Forests

机译:wsrf:用于可伸缩加权子空间随机森林分类的​​R包

获取原文
           

摘要

We describe a parallel implementation in R of the weighted subspace random forest algorithm (Xu, Huang, Williams, Wang, and Ye 2012) available as the wsrf package. A novel variable weighting method is used for variable subspace selection in place of the traditional approach of random variable sampling. This new approach is particularly useful in building models for high dimensional data - often consisting of thousands of variables. Parallel computation is used to take advantage of multi-core machines and clusters of machines to build random forest models from high dimensional data in considerably shorter times. A series of experiments presented in this paper demonstrates that wsrf is faster than existing packages whilst retaining and often improving on the classification performance, particularly for high dimensional data.
机译:我们描述了加权子空间随机森林算法(Xu,Huang,Williams,Wang和Ye 2012)在R中的并行实现,该算法可作为wsrf包使用。一种新颖的变量加权方法用于变量子空间选择,代替了传统的随机变量采样方法。这种新方法在构建通常由数千个变量组成的高维数据模型中特别有用。并行计算用于利用多核计算机和计算机集群的优势,从而在相当短的时间内从高维数据构建随机森林模型。本文提出的一系列实验证明,wsRF比现有软件包更快,同时保留并经常改善分类性能,尤其是对于高维数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号