...
首页> 外文期刊>International journal of machine learning and cybernetics >Adaptive robust local online density estimation for streaming data
【24h】

Adaptive robust local online density estimation for streaming data

机译:用于流数据的自适应强大的本地在线密度估计

获取原文
获取原文并翻译 | 示例
           

摘要

Accurate online density estimation is crucial to numerous applications that are prevalent with streaming data. Existing online approaches for density estimation somewhat lack prompt adaptability and robustness when facing concept-drifting and noisy streaming data, resulting in delayed or even deteriorated approximations. To alleviate this issue, in this work, we first propose an adaptive local online kernel density estimator (ALoKDE) for real-time density estimation on data streams. ALoKDE consists of two tightly integrated strategies: (1) a statistical test for concept drift detection and (2) an adaptive weighted local online density estimation when a drift does occur. Specifically, using a weighted form, ALoKDE seeks to provide an unbiased estimation by factoring in the statistical hallmarks of the latest learned distribution and any potential distributional changes that could be introduced by each incoming instance. A robust variant of ALoKDE, i.e., R-ALoKDE, is further developed to effectively handle data streams with varied types/levels of noise. Moreover, we analyze the asymptotic properties of ALoKDE and R-ALoKDE, and also derive their theoretical error bounds regarding bias, variance, MSE and MISE. Extensive comparative studies on various artificial and real-world (noisy) streaming data demonstrate the efficacies of ALoKDE and R-ALoKDE in online density estimation and real-time classification (with noise).
机译:准确的在线密度估计对于具有流数据流动的许多应用是至关重要的。在面对概念漂移和嘈杂的流数据时,存在在密度估计的现有在线估计方法略微缺乏迅速的适应性和鲁棒性,从而导致延迟甚至劣化的近似。为了减轻这个问题,在这项工作中,我们首先提出了一个适应性本地在线内核密度估计器(Alokde),用于数据流上的实时密度估计。 Alokde由两个紧密综合的策略组成:(1)概念漂移检测的统计测试和(2)当发生漂移时的自适应加权本地在线密度估计。具体地,使用加权形式,Alokde寻求通过在最新学习分发的统计标志中提供无偏见的估计,以及每个进入的实例可以引入的任何潜在分布变化。进一步开发了一种Alokde,即R-Alokde的鲁棒变体,以有效地处理具有不同类型/噪声水平的数据流。此外,我们分析了Alokde和R-Alokde的渐近性质,并导出了其关于偏差,方差,MSE和Mise的理论误差范围。关于各种人工和现实世界(嘈杂)流数据的广泛比较研究证明了Alokde和R-Alokde在在线密度估算和实时分类中的疗效(具有噪音)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号