首页> 外文会议>Pacific-Asia Conference on Knowledge Discovery and Data Mining >PEARL: Probabilistic Exact Adaptive Random Forest with Lossy Counting for Data Streams
【24h】

PEARL: Probabilistic Exact Adaptive Random Forest with Lossy Counting for Data Streams

机译:PEARL:对数据流进行有损计数的概率精确自适应随机森林

获取原文

摘要

In order to adapt random forests to the dynamic nature of data streams, the state-of-the-art technique discards trained trees and grows new trees when concept drifts are detected. This is particularly wasteful when recurrent patterns exist. In this work, we introduce a novel framework called PEARL, which uses both an exact technique and a probabilistic graphical model with Lossy Counting, to replace drifted trees with relevant trees built from the past. The exact technique utilizes pattern matching to find the set of drifted trees, that co-occurred in predictions in the past. Meanwhile, a probabilistic graphical model is being built to capture the tree replacements among recurrent concept drifts. Once the graphical model becomes stable, it replaces the exact technique and finds relevant trees in a probabilistic fashion. Further, Lossy Counting is applied to the graphical model which brings an added theoretical guarantee for both error rate and space complexity. We empirically show our technique outperforms baselines in terms of cumulative accuracy on both synthetic and real-world datasets.
机译:为了使随机森林适应数据流的动态性质,最新技术会在检测到概念漂移时丢弃经过训练的树木,并种植新的树木。当存在重复模式时,这特别浪费。在这项工作中,我们介绍了一个称为PEARL的新颖框架,该框架同时使用精确技术和带有“有损计数”的概率图形模型,用过去构建的相关树替换了漂移的树。精确的技术利用模式匹配来找到在过去的预测中共同出现的漂移树集。同时,正在建立一个概率图形模型来捕获周期性概念漂移中的树替换。一旦图形模型变得稳定,它将替换精确的技术并以概率方式找到相关的树。此外,将有损计数应用于图形模型,这为错误率和空间复杂性都带来了额外的理论保证。从经验来看,我们的技术在合成数据集和实际数据集上的累积准确性均优于基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号