【24h】

Reducing Data Stream Sliding Windows by Cyclic Tree-Like Histograms

机译:通过循环树状直方图减少数据流滑动窗口

获取原文
获取原文并翻译 | 示例

摘要

Data reduction is a basic step in a KDD process useful for delivering to successive stages more concise and meaningful data. When mining is applied to data streams, that are continuous data flows, the issue of suitably reducing them is highly interesting, in order to arrange effective approaches requiring multiple scans on data, that, in such a way, may be performed over one or more reduced sliding windows. A class of queries, whose importance in the context of KDD is widely accepted, corresponds to sum range queries. In this paper we propose a histogram-based technique for reducing sliding windows supporting approximate arbitrary (i.e., non biased) sum range queries. The histogram, based on a hierarchical structure (opposed to the flat structure of traditional ones), results suitable for directly supporting hierarchical queries, and, thus, drill-down and roll-up operations. In addition, the structure well supports sliding window shifting and quick query answering (both these operations are logarithmic in the sliding window size). Experimental analysis shows the superiority of our method in terms of accuracy w.r.t. the state-of-the-art approaches in the context of histogram-based sliding window reduction techniques.
机译:数据减少是KDD流程中的基本步骤,有助于将更简洁,有意义的数据传递到后续阶段。当对连续的数据流即数据流进行挖掘时,为了减少需要对数据进行多次扫描的有效方法,可以适当地减少它们的问题是非常有趣的,这样可以对一个或多个对象执行多次扫描减少滑动窗口。一类查询,其在KDD上下文中的重要性已被广泛接受,它对应于总和范围查询。在本文中,我们提出了一种基于直方图的技术来减少支持近似任意(即无偏)和范围查询的滑动窗口。基于分层结构(与传统的平面结构相反)的直方图适用于直接支持分层查询的结果,因此可以进行向下和向上滚动操作。另外,该结构很好地支持滑动窗口移动和快速查询应答(这两个操作在滑动窗口大小上都是对数的)。实验分析表明我们的方法在w.r.t.基于直方图的滑动窗口缩小技术的最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号