首页> 外文会议>ACM conference on information and knowledge management >Partial Drift Detection Using a Rule Induction Framework
【24h】

Partial Drift Detection Using a Rule Induction Framework

机译:使用规则感应框架的部分漂移检测

获取原文

摘要

The major challenge in mining data streams is the issue of concept drift, the tendency of the underlying data generation process to change over time. In this paper, we propose a general rule learning framework that can efficiently handle concept-drifting data streams and maintain a highly accurate classification model. The main idea is to focus on partial drifts by allowing individual rules to monitor the stream and detect if there is a drift in the regions they cover. A rule quality measure then decides whether the affected rules are inconsistent with the concept drift. The model is accordingly updated to only include rules that are consistent with the newly arrived concept. A dynamically maintained set of instances deemed relevant to the most recent concept is also kept at memory. Learning a new concept from a larger set of instances reduces the variance of data distribution and allows for a more accurate, stable classification model. Our experiments show that this approach not only handles the drift efficiently, but it also can provide higher classification accuracy compared to other competitive approaches on a variety of real and synthetic data sets.
机译:采矿数据流中的主要挑战是概念漂移问题,底层数据生成过程随时间变化的趋势。在本文中,我们提出了一般规则学习框架,可以有效地处理概念漂移数据流并维持高度准确的分类模型。主要思想是通过允许单个规则监视流并检测它们覆盖的区域是否存在漂移来聚焦部分漂移。然后,规则质量措施决定受影响的规则是否与概念漂移不一致。因此,该模型的更新仅包括与新到达概念一致的规则。一种动态维护的一组与最新概念相关的实例也会保持在记忆中。从更大一组实例学习新概念会降低数据分布的方差,并允许更准确,稳定的分类模型。我们的实验表明,这种方法不仅有效地处理漂移,而且还可以提供与各种实际和合成数据集的其他竞争方法相比更高的分类准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号