首页> 外文期刊>Applied Artificial Intelligence >A RANDOM DECISION TREE ENSEMBLE FOR MINING CONCEPT DRIFTS FROM NOISY DATA STREAMS
【24h】

A RANDOM DECISION TREE ENSEMBLE FOR MINING CONCEPT DRIFTS FROM NOISY DATA STREAMS

机译:可用于从嘈杂数据流中挖掘概念的概念的随机决策树

获取原文
获取原文并翻译 | 示例
           

摘要

Detecting concept drifts and reducing the impact from the noise in real applications of data streams are challenging but valuable for inductive learning. It is especially a challenge in a light demand on the overheads of lime, and space. However, though a great number of inductive learning algorithms based on ensemble classification models have been proposed for handling concept drifting data streams, little attention has been focused on the- detection of the diversity of concept drifts and the influence from noise in data streams simultaneously. Motivated by this, we present a new light-weighted inductive algorithm for concept drifting detection in virtue of an ensemble model of random decision trees (named, CDRDT) to distinguish various types of concept drifts from noisy data streams in this article. We use variably small data chunks to generate, random decision trees incrementally. Meanwhile, rue introduce the inequality of Hoeffding bounds and the principle of statistical quality control to detect the different types of concept drifts and noise. Extensive studies on synthetic and real streaming data demonstrate, that CDRDT could effectively and efficiently delect concept drifts from the noisy streaming data. Therefore, our algorithm provides a feasible reference framework of classification for concept drifting data streams with noise.
机译:在数据流的实际应用中,检测概念漂移并减少噪声的影响是具有挑战性的,但对于归纳学习很有用。在对石灰和空间顶部的轻需求方面,这尤其是一个挑战。然而,尽管已经提出了大量基于整体分类模型的归纳学习算法来处理概念漂移数据流,但很少有注意力集中在概念漂移多样性的检测以及数据流中噪声的影响上。因此,本文提出了一种新的轻量级归纳算法,用于概念漂移检测,该模型借助随机决策树(命名为CDRDT)的集成模型来区分嘈杂数据流中各种类型的概念漂移。我们使用可变小的数据块来逐步生成随机决策树。同时,rue介绍了Hoeffding边界的不等式以及统计质量控制的原理,以检测概念漂移和噪声的不同类型。对合成和真实流数据的大量研究表明,CDRDT可以有效地从嘈杂的流数据中发现概念漂移。因此,我们的算法为带噪声的概念漂移数据流提供了可行的分类参考框架。

著录项

  • 来源
    《Applied Artificial Intelligence》 |2010年第7期|p.680-710|共31页
  • 作者单位

    School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230009, China;

    rnSchool of Computer Science and Information Engineering, Hefei University of Technology, Hefei, China Department of Computer Science, University of Vermont, Vermont, USA;

    rnSchool of Computer Science and Information Engineering, Hefei University of Technology, Hefei, China;

    rnSchool of Information Systems, Singapore Management University, Singapore;

    rnCollege of Computer Science, Zhejiang University, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号