首页> 外文会议>Electrical and Computer Engineering, 2005. Canadian Conference on >Approximate frequency counts in sliding window over data stream
【24h】

Approximate frequency counts in sliding window over data stream

机译:数据流滑动窗口中的近似频率计数

获取原文
获取外文期刊封面目录资料

摘要

The sliding window model is useful for discounting stale data in data stream applications. In this model, data elements arrive continually and only the most recent N elements are used when answering queries. We consider the important problem in sliding window - computing frequency counts being not less than a user-specified threshold. We present a one-pass algorithm for dynamically computing frequency counts in sliding window over a data stream. Our algorithm employs exponential histograms to maintain the frequency of elements, and carries out compression process periodically to delete the needless elements. Once the synopsis over the most recent N elements has been constructed, our algorithm supports approximate frequency counts over the most recent M elements as long as M does not exceed N. The actual space bounds obtained on experimental data are significantly better than both the worst case guarantees of our algorithm and the observed space requirements of earlier algorithms. The analytical and experimental results show that our algorithm is accurate and effective in dynamically computing the frequency counts in sliding window
机译:滑动窗口模型对于在数据流应用程序中折旧数据很有用。在此模型中,数据元素连续到达,并且在回答查询时仅使用最新的N个元素。我们认为滑动窗口中的重要问题-计算频率计数不少于用户指定的阈值。我们提出了一种单遍算法,用于动态计算数据流上滑动窗口中的频率计数。我们的算法使用指数直方图来保持元素的频率,并定期执行压缩过程以删除不必要的元素。一旦构造了有关最近N个元素的提要,只要M不超过N,我们的算法就可以支持最近M个元素的近似频率计数。根据实验数据获得的实际空间界限比两个最坏情况都好得多保证我们的算法以及较早算法观察到的空间要求。分析和实验结果表明,我们的算法在动态计算滑动窗口中的频率计数方面是准确有效的

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号