首页> 外文会议>Advances in intelligent data analysis VIII >Change (Detection) You Can Believe in: Finding Distributional Shifts in Data Streams
【24h】

Change (Detection) You Can Believe in: Finding Distributional Shifts in Data Streams

机译:您可以相信的更改(检测):查找数据流中的分布偏移

获取原文
获取原文并翻译 | 示例

摘要

Data streams are dynamic, with frequent distributional changes. In this paper, we propose a statistical approach to detecting distributional shifts in multi-dimensional data streams. We use relative entropy, also known as the Kullback-Leibler distance, to measure the statistical distance between two distributions. In the context of a multidimensional data stream, the distributions are generated by data from two sliding windows. We maintain a sample of the data from the stream inside the windows to build the distributions.rnOur algorithm is streaming, nonparametric, and requires no distributional or model assumptions. It employs the statistical theory of hypothesis testing and bootstrapping to determine whether the distributions are statistically different. We provide a full suite of experiments on synthetic data to validate the method and demonstrate its effectiveness on data from real-life applications.
机译:数据流是动态的,具有频繁的分布变化。在本文中,我们提出了一种统计方法来检测多维数据流中的分布偏移。我们使用相对熵(也称为Kullback-Leibler距离)来度量两个分布之间的统计距离。在多维数据流的上下文中,分布是由来自两个滑动窗口的数据生成的。我们维护窗口内流中数据的样本以构建分布。rn我们的算法是流式的,非参数性的,不需要任何分布或模型假设。它使用假设检验和自举的统计理论来确定分布是否在统计上不同。我们提供了一整套关于合成数据的实验,以验证该方法并证明其对来自实际应用程序的数据的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号