首页> 外文会议>International Conference on Database Systems for Advanced Applications >M-kernel merging: towards density estimation over data streams
【24h】

M-kernel merging: towards density estimation over data streams

机译:M-Kernel合并:朝着数据流估算密度估计

获取原文

摘要

Density estimation is a costly operation for computing distribution information of data sets underlying many important data mining applications, such as clustering and biased sampling. However, traditional density estimation methods are inapplicable for streaming data, which are continuously arriving large volume of data, because of their request for linear storage and square size calculation. The shortcoming limits the application of many existing effective algorithms on data streams, for which the mining problem is an emergency for applications and a challenge for research. In this paper, the problem of computing density functions over data streams is examined. A novel method attacking this shortcoming of existing methods is developed to enable density estimation for large volume of data in linear time, fixed size memory, and without lose of accuracy. The method is based on M-Kernel merging, so that limited kernel functions to be maintained are determined intelligently. The application of the new method on different streaming data models is discussed, and the result of intensive experiments is presented. The analytical and empirical result show that this new density estimation algorithm for data streams can calculate density functions on demand at any time with high accuracy for different streaming data models.
机译:密度估计是用于计算数据集的分发信息的昂贵操作,这些数据集的许多重要数据挖掘应用程序,例如聚类和偏置采样。然而,传统的密度估计方法是不适用的,用于流传输数据,这是连续到达大量数据的流量,因为它们请求线性存储和方尺寸计算。缺点限制了许多现有的有效算法在数据流上的应用,其中挖掘问题是应用的紧急情况以及研究的挑战。在本文中,检查了在数据流上计算密度函数的问题。开发了一种攻击现有方法缺点的新方法,以使线性时间,固定尺寸存储器中的大量数据能够实现密度估计,并且没有减少精度。该方法基于M-kernel合并,从而智能地确定要维护的有限的内核函数。讨论了新方法在不同流数据模型中的应用,并提出了密集实验的结果。分析和经验结果表明,对于数据流的这种新密度估计算法可以在不同时间为不同流数据模型的高精度计算密度函数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号