【24h】

Continuous Adaptive Outlier Detection on Distributed Data Streams

机译:分布式数据流的连续自适应离群值检测

获取原文
获取原文并翻译 | 示例

摘要

In many applications, stream data are too voluminous to be collected in a central fashion and often transmitted on a distributed network. In this paper, we focus on the outlier detection over distributed data streams in real time, firstly, we formalize the problem of outlier detection using the kernel density estimation technique. Then, we adopt the fading strategy to keep pace with the transient and evolving natures of stream data, and mico-cluster technique to conquer the data partition and "one-pass" scan. Furthermore, our extensive experiments with synthetic and real data show that the proposed algorithm is efficient and effective compared with existing outlier detection algorithms, and more suitable for data streams.
机译:在许多应用中,流数据量太大,无法以集中方式收集,并且经常在分布式网络上传输。在本文中,我们专注于实时地对分布式数据流进行离群值检测,首先,我们使用核密度估计技术对离群值检测问题进行形式化。然后,我们采用衰落策略来跟上流数据的瞬态和不断发展的特性,并采用mico-cluster技术来征服数据分区和“一次通过”扫描。此外,我们对合成和真实数据进行的大量实验表明,与现有的离群值检测算法相比,该算法是有效且有效的,并且更适合于数据流。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号