首页> 外文会议>Multiple Classifier Systems >Random Aggregated and Bagged Ensembles of SVMs: An Empirical Bias-Variance Analysis
【24h】

Random Aggregated and Bagged Ensembles of SVMs: An Empirical Bias-Variance Analysis

机译:支持向量机的随机聚集和袋装集成:经验偏差-方差分析

获取原文

摘要

Bagging can be interpreted as an approximation of random aggregating, an ideal ensemble method by which base learners are trained using data sets randomly drawn according to an unknown probability distribution. An approximate realization of random aggregating can be obtained through subsampled bagging, when large training sets are available. In this paper we perform an experimental bias-variance analysis of bagged and random aggregated ensembles of Support Vector Machines, in order to quantitatively evaluate their theoretical variance reduction properties. Experimental results with small samples show that random aggregating, implemented through subsampled bagging, reduces the variance component of the error by about 90%, while bagging, as expected, achieves a lower reduction. Bias-variance analysis explains also why ensemble methods based on subsampling techniques can be successfully applied to large data mining problems.
机译:套袋可以解释为随机聚集的近似值,这是一种理想的集成方法,通过该方法,可以使用根据未知概率分布随机绘制的数据集来训练基础学习者。当有大量的训练集可用时,可以通过子采样的装袋获得随机聚集的近似实现。在本文中,我们对支持向量机的袋装和随机集合集成进行实验性偏差方差分析,以定量评估其理论方差减少性质。小样本的实验结果表明,通过子采样装袋实现的随机汇总可将误差的方差分量减少约90%,而装袋则可以达到预期的降低效果。偏差方差分析还解释了为什么基于子采样技术的集成方法可以成功地应用于大数据挖掘问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号