首页> 外文会议>International Conference on Intelligent Data Analysis >Change (Detection) You Can Believe in: Finding Distributional Shifts in Data Streams
【24h】

Change (Detection) You Can Believe in: Finding Distributional Shifts in Data Streams

机译:更改(检测)您可以相信:在数据流中查找分配转移

获取原文

摘要

Data streams are dynamic, with frequent distributional changes. In this paper, we propose a statistical approach to detecting distributional shifts in multi-dimensional data streams. We use relative entropy, also known as the Kullback-Leibler distance, to measure the statistical distance between two distributions. In the context of a multidimensional data stream, the distributions are generated by data from two sliding windows. We maintain a sample of the data from the stream inside the windows to build the distributions. Our algorithm is streaming, nonparametric, and requires no distributional or model assumptions. It employs the statistical theory of hypothesis testing and bootstrapping to determine whether the distributions are statistically different. We provide a full suite of experiments on synthetic data to validate the method and demonstrate its effectiveness on data from real-life applications.
机译:数据流是动态的,具有频繁的分布变化。在本文中,我们提出了一种统计方法来检测多维数据流中的分布偏移。我们使用相对熵,也称为kullback-leibler距离,测量两个分布之间的统计距离。在多维数据流的上下文中,分布由来自两个滑动窗口的数据生成。我们从Windows内的流中维护数据的样本以构建分布。我们的算法是流式传输,非参数,并且不需要分布或模型假设。它采用假设检测的统计理论和自动启动,以确定分布是统计上不同的。我们为合成数据提供了一套全套实验,以验证该方法,并展示其对现实生活应用程序的数据的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号