首页> 外文会议>Canadian conference on artificial intelligence >Improvements to Boosting with Data Streams
【24h】

Improvements to Boosting with Data Streams

机译:数据流增强的改进

获取原文

摘要

Data Streams (DS) pose a challenge for any machine learning algorithm, because of high volume of data - on the order of millions of instances for a typical data set. Various algorithms were proposed, in particular, OzaBoost - a parallel adaptation of AdaBoost - creates various "weak" learners in parallel and updates each of them with new instances during training. At any moment, OzaBoost can stop and output the final model. OzaBoost suffers with memory consumption, which avoids its use for certain types of problems. This work introduces OzaBoost Dynamic, which changes the weight calculation and the number of boosted "weak" learners used by OzaBoost to improve its performance in terms of memory consumption. This work presents the empirical results showing the performance of all algorithms using data sets with 50 and 60 million instances.
机译:数据流(DS)对任何机器学习算法都构成了挑战,因为数据量很大-典型数据集的实例数百万个实例。提出了各种算法,特别是OzaBoost-AdaBoost的并行改编-并行创建各种“弱”学习者,并在训练过程中使用新实例更新每个学习者。 OzaBoost随时可以停止并输出最终模型。 OzaBoost会消耗内存,从而避免将其用于某些类型的问题。这项工作介绍了OzaBoost Dynamic,它可以更改权重计算和OzaBoost使用的增强型“弱”学习者的数量,以提高其在内存消耗方面的性能。这项工作提出了经验结果,该结果显示了使用具有50和6000万个实例的数据集的所有算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号