首页> 外文会议>IEEE International Conference on Fuzzy Systems >Efficient sample selection in data stream regression employing evolving generalized fuzzy models
【24h】

Efficient sample selection in data stream regression employing evolving generalized fuzzy models

机译:使用演化的广义模糊模型进行数据流回归的有效样本选择

获取原文

摘要

In this paper, we propose two criteria for efficient sample selection in case of data stream regression problems. The selection becomes apparent whenever the target values, which guide the update of the regressors as well as the implicit model structures, are costly to measure. Reducing the samples used for model updates as much as possible while keeping the predictive accuracy of the models on a high level is thus a central challenge, especially in non-stationary environments where (permanent) system changes or expansion can be expected. Our selection criteria rely on two aspects: 1.) the extrapolation degree of the model combined with its non-linearity degree, 2.) the uncertainty in model outputs which can be measured in terms of confidence intervals reflected by so-called adaptive error bars, which are updated over time synchronously to the model. The selection criteria are developed in combination with evolving generalized Takagi-Sugeno (TS) fuzzy models (containing rules in arbitrarily rotated position), which could be shown to outperform conventional evolving TS models (containing axis-parallel rules) and other stream regression techniques in previous publications. The results based on two high-dimensional real-world streaming problems show that a decrease of the number of model updates by about 80-85% (as only 15-20% of samples are selected) can still achieve similar accumulated model errors over time to the case when performing a full update on all samples. This may yield a significant reduction of computational demands and of costs whenever targets are costly to measure.
机译:在本文中,我们提出了两个标准,以在数据流回归出现问题时有效地选择样本。每当指导回归变量和隐式模型结构更新的目标值的测量成本很高时,选择就变得显而易见。因此,尽可能地减少用于模型更新的样本,同时将模型的预测精度保持在较高水平上,这是一个主要挑战,尤其是在非平稳环境中(可能会发生(永久性)系统更改或扩展)。我们的选择标准取决于两个方面:1.)模型的外推度及其非线性度; 2.)模型输出中的不确定性,可以通过所谓的自适应误差线反映的置信区间来测量,这些信息会随着时间的推移与模型同步更新。选择标准是与不断发展的广义Takagi-Sugeno(TS)模糊模型(包含任意旋转位置的规则)相结合而开发的,可以证明其优于传统的不断发展的TS模型(包含轴平行规则)和其他流回归技术。以前的出版物。基于两个高维现实流问题的结果表明,随着时间的流逝,模型更新次数减少约80-85%(因为仅选择了15-20%的样本)仍可以实现类似的累积模型误差对所有样本执行完全更新时的情况。每当目标的测量成本很高时,这可能会大大减少计算需求和成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号