Incremental Learning of Concept Drift from Streaming Imbalanced Data

Ditzler; Gregory; Polikar; Robi

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Incremental Learning of Concept Drift from Streaming Imbalanced Data

【24h】

Incremental Learning of Concept Drift from Streaming Imbalanced Data

机译：通过流式传输不平衡数据增量学习概念漂移

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning in nonstationary environments, also known as learning concept drift, is concerned with learning from data whose statistical characteristics change over time. Concept drift is further complicated if the data set is class imbalanced. While these two issues have been independently addressed, their joint treatment has been mostly underexplored. We describe two ensemble-based approaches for learning concept drift from imbalanced data. Our first approach is a logical combination of our previously introduced Learn++.NSE algorithm for concept drift, with the well-established SMOTE for learning from imbalanced data. Our second approach makes two major modifications to Learn++.NSE-SMOTE integration by replacing SMOTE with a subensemble that makes strategic use of minority class data; and replacing Learn++.NSE and its class-independent error weighting mechanism with a penalty constraint that forces the algorithm to balance accuracy on all classes. The primary novelty of this approach is in determining the voting weights for combining ensemble members, based on each classifier's time and imbalance-adjusted accuracy on current and past environments. Favorable results in comparison to other approaches indicate that both approaches are able to address this challenging problem, each with its own specific areas of strength. We also release all experimental data as a resource and benchmark for future research.

机译：非平稳环境中的学习（也称为学习概念漂移）与从统计特性随时间变化的数据中学习有关。如果数据集类别不平衡，则概念漂移将变得更加复杂。尽管这两个问题已得到独立解决，但对它们的联合处理却大多未得到充分研究。我们描述了两种基于整体的方法来从不平衡数据中学习概念漂移。我们的第一种方法是将先前引入的Learn ++。NSE算法（用于概念漂移）与完善的SMOTE（用于从不平衡数据中学习）的逻辑组合。我们的第二种方法对Learn ++。NSE-SMOTE集成进行了两个主要修改，方法是将SMOTE替换为一个子群，该子群可以战略性地使用少数类数据。并使用惩罚约束替换Learn ++。NSE及其独立于类的错误加权机制，该约束迫使算法平衡所有类的准确性。这种方法的主要新颖之处在于，根据每个分类器的时间以及当前和过去环境中不平衡调整后的准确性，确定用于合并合奏成员的投票权重。与其他方法相比，良好的结果表明这两种方法都能够解决这一具有挑战性的问题，每种方法都有其自己的特定优势领域。我们还将发布所有实验数据，作为将来研究的资源和基准。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2013年第10期|2283-2301|共19页
作者
Ditzler; Gregory; Polikar; Robi;
展开▼
作者单位

Drexel University, Philadelphia|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Incremental learning; class imbalance; concept drift; multiple classifier systems;

机译：增量学习;班级失衡;概念漂移;多个分类器系统;

相似文献

外文文献
中文文献
专利

1. Incremental learning imbalanced data streams with concept drift: The dynamic updated ensemble algorithm [J] . Li Zeng, Huang Wenchao, Xiong Yan, Knowledge-Based Systems . 2020,第May11期

机译：增量学习具有概念漂移的简单数据流：动态更新的集合算法
2. A comprehensive active learning method for multiclass imbalanced data streams with concept drift [J] . Liu W., Zhang H., Ding Z., Knowledge-Based Systems . 2021,第Mara5期

机译：具有概念漂移的多种数据流数据流的全面主动学习方法
3. GPU-Accelerated Extreme Learning Machines for Imbalanced Data Streams with Concept Drift [J] . Bartosz Krawczyk Procedia Computer Science . 2016,第1期

机译：GPU加速的极限学习机，用于概念漂移的不平衡数据流
4. Dynamic Weighted Majority for Incremental Learning of Imbalanced Data Streams with Concept Drift [C] . Yang Lu, Yiu-ming Cheung, Yuan Yan Tang International Joint Conference on Artificial Intelligence . 2019

机译：具有概念漂移的不平衡数据流的动态加权多数
5. The GC3 framework grid density based clustering for classification of streaming data with concept drift. [D] . Sethi, Tegjyot Singh. 2013

机译：基于GC3框架网格密度的聚类，用于通过概念漂移对流数据进行分类。
6. Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance [O] . Yange Sun, Meng Li, Lei Li, 2021

机译：具有与概念漂移和类不平衡的演化数据流的成本敏感分类
7. Two-Stage Cost-Sensitive Learning for Data Streams With Concept Drift and Class Imbalance [O] . Yange Sun, Yi Sun, Honghua Dai 2020

机译：具有概念漂移和级别不平衡的数据流的两阶段成本敏感学习

Incremental Learning of Concept Drift from Streaming Imbalanced Data

摘要

著录项

相似文献

相关主题

期刊订阅