首页> 外文会议>International Joint Conference on Neural Networks >Crogging (cross-validation aggregation) for forecasting — A novel algorithm of neural network ensembles on time series subsamples
【24h】

Crogging (cross-validation aggregation) for forecasting — A novel algorithm of neural network ensembles on time series subsamples

机译:交叉预测(交叉验证聚合)—一种新的神经网络算法,对时间序列子样本进行集成

获取原文

摘要

In classification, regression and time series prediction alike, cross-validation is widely employed to estimate the expected accuracy of a predictive algorithm by averaging predictive errors across mutually exclusive subsamples of the data. Similarly, bootstrapping aims to increase the validity of estimating the expected accuracy by repeatedly sub-sampling the data with replacement, creating overlapping samples of the data. Estimates are then used to anticipate of future risk in decision making, or to guide model selection where multiple candidates are feasible. Beyond error estimation, bootstrapping has recently been extended to combine each of the diverse models created for estimation, and aggregating over each of their predictions (rather than their errors), coined bootstrap aggregation or bagging. However, similar extensions of cross-validation to create diverse forecasting models have not been considered. In accordance with bagging, we propose to combine the benefits of cross-validation and forecast aggregation, i.e. crogging. We assesses different levels of cross-validation, including a (single-fold) hold-out approach, 2-fold and 10-fold cross validation and Monte-Carlos cross validation, to create diverse base-models of neural networks for time series prediction trained on different data subsets, and average their individual multiple-step ahead predictions. Results of forecasting the 111 time series of the NN3 competition indicate significant improvements accuracy through Crogging relative to Bagging or individual model selection of neural networks.
机译:在分类,回归和时间序列预测中,交叉验证通过在数据互斥的子样本中平均预测误差的平均值,广泛用于估计预测算法的预期准确性。类似地,自举的目的是通过对替换后的数据重复进行二次采样,以创建重叠的数据样本,从而提高估计预期准确性的有效性。然后,将估计值用于预测决策中的未来风险,或在可行多个候选对象的情况下指导模型选择。除了错误估计之外,最近还扩展了引导程序,以结合为估计而创建的每个不同模型,并汇总其每个预测(而不是其错误),精简的自举聚合或装袋。但是,尚未考虑交叉验证的类似扩展以创建各种预测模型。根据装袋的要求,我们建议将交叉验证和预测汇总(即combine割)的好处结合起来。我们评估了不同级别的交叉验证,包括(单倍)保留方法,2倍和10倍交叉验证以及Monte-Carlos交叉验证,以创建用于时间序列预测的神经网络的多种基本模型在不同的数据子集上进行训练,并对他们各自的多步预测进行平均。预测NN3竞赛的111个时间序列的结果表明,相对于袋装或神经网络的单个模型选择,Crogging可以显着提高准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号