首页> 外文会议>International Joint Conference on Neural Networks >Quantifying the limited and gradual concept drift assumption
【24h】

Quantifying the limited and gradual concept drift assumption

机译:量化有限且渐进的概念漂移假设

获取原文

摘要

Nonstationary environments, where underlying distributions change over time, are becoming increasingly common in real-world applications. A specific example of such an environment is concept drift, where the joint probability distributions of observed data drift over time. Such environments call for a model that can update its parameters to adapt to the changing environment. An extreme case of this scenario, referred to as extreme verification latency, is where labeled data are only available at initialization, with unlabeled data becoming available in a streaming fashion thereafter. In such a scenario, the classifier must update its hypothesis based on only unlabeled data drawn from the drifting distributions. In our prior work, we described a framework, called COMPOSE, that works well in this type of environment, provided that the data distributions experience limited (or gradual) drift. Limited drift assumption is common in many concept drift algorithms yet — surprisingly — there is little or no formal definition of this assumption. In this contribution, we describe a mechanism to formally quantify limited drift. We define two metrics, one that represents the normalized class separation drift, and the other that uses the ratio of between-class separations and within class drift through time. We test these metrics on both synthetic and real world problems, and argue that the latter can be more suitably used.
机译:底层分布随时间变化的非平稳环境在实际应用中变得越来越普遍。这种环境的一个特定示例是概念漂移,其中观察到的数据的联合概率分布随时间漂移。这样的环境需要一个可以更新其参数以适应不断变化的环境的模型。这种情况的极端情况称为极端验证延迟,其中标记的数据仅在初始化时可用,而未标记的数据此后以流方式变为可用。在这种情况下,分类器必须仅基于从漂移分布中提取的未标记数据来更新其假设。在我们之前的工作中,我们描述了一个名为COMPOSE的框架,只要数据分布受到有限的(或渐进的)漂移,该框架就可以在这种类型的环境中很好地工作。有限的漂移假设在许多概念漂移算法中很常见,但令人惊讶的是,对此假设几乎没有正式定义。在此贡献中,我们描述了一种正式量化有限漂移的机制。我们定义了两个度量,一个表示标准化的类分离漂移,另一个使用类间分离和类内漂移随时间的比率。我们对综合问题和现实问题都测试了这些指标,并认为后者可以更适当地使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号