Quantifying the limited and gradual concept drift assumption

机译：量化有限且渐进的概念漂移假设

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nonstationary environments, where underlying distributions change over time, are becoming increasingly common in real-world applications. A specific example of such an environment is concept drift, where the joint probability distributions of observed data drift over time. Such environments call for a model that can update its parameters to adapt to the changing environment. An extreme case of this scenario, referred to as extreme verification latency, is where labeled data are only available at initialization, with unlabeled data becoming available in a streaming fashion thereafter. In such a scenario, the classifier must update its hypothesis based on only unlabeled data drawn from the drifting distributions. In our prior work, we described a framework, called COMPOSE, that works well in this type of environment, provided that the data distributions experience limited (or gradual) drift. Limited drift assumption is common in many concept drift algorithms yet — surprisingly — there is little or no formal definition of this assumption. In this contribution, we describe a mechanism to formally quantify limited drift. We define two metrics, one that represents the normalized class separation drift, and the other that uses the ratio of between-class separations and within class drift through time. We test these metrics on both synthetic and real world problems, and argue that the latter can be more suitably used.

机译：底层分布随时间变化的非平稳环境在实际应用中变得越来越普遍。这种环境的一个特定示例是概念漂移，其中观察到的数据的联合概率分布随时间漂移。这样的环境需要一个可以更新其参数以适应不断变化的环境的模型。这种情况的极端情况称为极端验证延迟，其中标记的数据仅在初始化时可用，而未标记的数据此后以流方式变为可用。在这种情况下，分类器必须仅基于从漂移分布中提取的未标记数据来更新其假设。在我们之前的工作中，我们描述了一个名为COMPOSE的框架，只要数据分布受到有限的（或渐进的）漂移，该框架就可以在这种类型的环境中很好地工作。有限的漂移假设在许多概念漂移算法中很常见，但令人惊讶的是，对此假设几乎没有正式定义。在此贡献中，我们描述了一种正式量化有限漂移的机制。我们定义了两个度量，一个表示标准化的类分离漂移，另一个使用类间分离和类内漂移随时间的比率。我们对综合问题和现实问题都测试了这些指标，并认为后者可以更适当地使用。

著录项

来源
《International Joint Conference on Neural Networks》|2015年|1-8|共8页
会议地点
作者
Sarnelle Joseph; Sanchez Anthony; Capo Robert; Haas Joshua; Polikar Robi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. The Gradual Resampling Ensemble for mining imbalanced data streams with concept drift [J] . Ren Siqi, Liao Bo, Zhu Wen, Neurocomputing . 2018,第apra19期

机译：用于概念漂移的不平衡数据流的逐步重采样集合
2. Early detection of gradual concept drifts by text categorization and Support Vector Machine techniques: The TRIO algorithm [J] . M. Marseguerra Reliability Engineering & System Safety . 2014,第sepa期

机译：通过文本分类和支持向量机技术对渐变概念漂移进行早期检测：TRIO算法
3. Using unsupervised incremental learning to cope with gradual concept drift [J] . David Hadas, Galit Yovel, and Nathan Intrator Connection Science . 2011,第1期

机译：使用无人监督的增量学习来应对渐进的概念漂移
4. Quantifying the Limited and Gradual Concept Drift Assumption [C] . Joseph Sarnelle, Anthony Sanchez, Robert Capo, International Joint Conference on Neural Networks . 2015

机译：量化有限和渐进的概念漂移假设
5. Teaching algebra-based concepts to students with learning disabilities: The effects of preteaching using a gradual instructional sequence [D] . Watt, Sarah Jean 2013

机译：以学习障碍的学生教基于代数的概念：使用渐进教学序列的自由侵入的影响
6. Drift by drift: effective population size is limited by advection [O] . John P Wares, James M Pringle 2008

机译：漂移漂移：有效人口规模受平流限制
7. Fuzzy time windowing for gradual concept drift adaptation [O] . Anjin Liu, Guangquan Zhang, Jie Lu 2017

机译：用于逐步概念漂移适应的模糊时间窗口
8. Inertial orientation tracker having gradual automatic drift compensation for tracking human head and other similarly sized body [R] . 2002

机译：惯性定向跟踪器具有逐渐自动漂移补偿，用于跟踪人体头部和其他类似尺寸的身体

Quantifying the limited and gradual concept drift assumption

摘要

著录项

相似文献

相关主题

期刊订阅