首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Incremental Learning of Concept Drift from Streaming Imbalanced Data
【24h】

Incremental Learning of Concept Drift from Streaming Imbalanced Data

机译:通过流式传输不平衡数据增量学习概念漂移

获取原文
获取原文并翻译 | 示例
           

摘要

Learning in nonstationary environments, also known as learning concept drift, is concerned with learning from data whose statistical characteristics change over time. Concept drift is further complicated if the data set is class imbalanced. While these two issues have been independently addressed, their joint treatment has been mostly underexplored. We describe two ensemble-based approaches for learning concept drift from imbalanced data. Our first approach is a logical combination of our previously introduced Learn++.NSE algorithm for concept drift, with the well-established SMOTE for learning from imbalanced data. Our second approach makes two major modifications to Learn++.NSE-SMOTE integration by replacing SMOTE with a subensemble that makes strategic use of minority class data; and replacing Learn++.NSE and its class-independent error weighting mechanism with a penalty constraint that forces the algorithm to balance accuracy on all classes. The primary novelty of this approach is in determining the voting weights for combining ensemble members, based on each classifier's time and imbalance-adjusted accuracy on current and past environments. Favorable results in comparison to other approaches indicate that both approaches are able to address this challenging problem, each with its own specific areas of strength. We also release all experimental data as a resource and benchmark for future research.
机译:非平稳环境中的学习(也称为学习概念漂移)与从统计特性随时间变化的数据中学习有关。如果数据集类别不平衡,则概念漂移将变得更加复杂。尽管这两个问题已得到独立解决,但对它们的联合处理却大多未得到充分研究。我们描述了两种基于整体的方法来从不平衡数据中学习概念漂移。我们的第一种方法是将先前引入的Learn ++。NSE算法(用于概念漂移)与完善的SMOTE(用于从不平衡数据中学习)的逻辑组合。我们的第二种方法对Learn ++。NSE-SMOTE集成进行了两个主要修改,方法是将SMOTE替换为一个子群,该子群可以战略性地使用少数类数据。并使用惩罚约束替换Learn ++。NSE及其独立于类的错误加权机制,该约束迫使算法平衡所有类的准确性。这种方法的主要新颖之处在于,根据每个分类器的时间以及当前和过去环境中不平衡调整后的准确性,确定用于合并合奏成员的投票权重。与其他方法相比,良好的结果表明这两种方法都能够解决这一具有挑战性的问题,每种方法都有其自己的特定优势领域。我们还将发布所有实验数据,作为将来研究的资源和基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号