首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Evaluation of Ensemble Methods in Imbalanced Regression Tasks
【24h】

Evaluation of Ensemble Methods in Imbalanced Regression Tasks

机译:不平衡回归任务中集成方法的评估

获取原文
           

摘要

Ensemble methods are well known for providing an advantage over single models in a large range of data mining and machine learning tasks. Their benefits are commonly associated to the ability of reducing the bias and/or variance in learning tasks. Ensembles have been studied both for classification and regression tasks with uniform domain preferences. However, only for imbalanced classification these methods were thoroughly studied. In this paper we present an empirical study concerning the predictive ability of ensemble methods bagging and boosting in regression tasks, using 20 data sets with imbalanced distributions, and assuming non-uniform domain preferences. Results show that ensemble methods are capable of providing improvements in predictive ability towards under-represented values, and that this improvement influences the predictive ability of models concerning the average behaviour of the data. Results also show that the smaller data sets are prone to larger improvements in predictive accuracy and that no conclusion could be drawn when considering the percentage of rare cases alone.
机译:众所周知,集合方法在大量数据挖掘和机器学习任务中提供了优于单个模型的优势。它们的好处通常与减少学习任务中的偏见和/或差异的能力有关。已经针对具有统一域首选项的分类和回归任务研究了合奏。但是,仅针对不平衡分类,对这些方法进行了深入研究。在本文中,我们使用20个分布不平衡的数据集并假设非均匀的领域偏好,对回归任务中套袋方法和增强方法的预测能力进行了实证研究。结果表明,集成方法能够提高针对代表性不足的值的预测能力,并且这种改进会影响模型有关数据平均行为的预测能力。结果还表明,较小的数据集倾向于较大程度地提高预测准确性,仅考虑罕见病例的百分比就无法得出结论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号