...
首页> 外文期刊>The VLDB Journal >Sliding-window top-k queries on uncertain streams
【24h】

Sliding-window top-k queries on uncertain streams

机译:不确定流上的滑动窗口top-k查询

获取原文
获取原文并翻译 | 示例
           

摘要

Recently, due to the imprecise nature of the data generated from a variety of streaming applications, such as sensor networks, query processing on uncertain data streams has become an important problem. However, all the existing works on uncertain data streams study unbounded streams. In this paper, we take the first step towards the important and challenging problem of answering sliding-window queries on uncertain data streams, with a focus on one of the most important types of queries—top-k queries. It is nontrivial to find an efficient solution for answering sliding-window top-k queries on uncertain data streams, because challenges not only stem from the strict space and time requirements of processing both arriving and expiring tuples in high-speed streams, but also rise from the exponential blowup in the number of possible worlds induced by the uncertain data model. In this paper, we design a unified framework for processing sliding-window top-k queries on uncertain streams. We show that all the existing top-k definitions in the literature can be plugged into our framework, resulting in several succinct synopses that use space much smaller than the window size, while they are also highly efficient in terms of processing time. We also extend our framework to answering multiple top-k queries. In addition to the theoretical space and time bounds that we prove for these synopses, we present a thorough experimental report to verify their practical efficiency on both synthetic and real data.
机译:最近,由于从各种流应用程序(例如传感器网络)生成的数据的不精确特性,对不确定数据流的查询处理已成为一个重要问题。但是,关于不确定数据流的所有现有工作都研究无界流。在本文中,我们朝着解决不确定数据流上的滑动窗口查询这一重要且具有挑战性的问题迈出了第一步,重点是最重要的查询类型之一-top-k查询。找到有效的解决方案来回答不确定数据流上的滑动窗口top-k查询并非易事,因为挑战不仅源于处理高速流中到达和终止元组的严格的空间和时间要求,而且还会增加由不确定数据模型引起的可能世界数量的指数爆炸。在本文中,我们设计了一个统一的框架来处理不确定流上的滑动窗口top-k查询。我们证明了文献中所有现有的top-k定义都可以插入我们的框架中,从而产生几个简洁的提要,这些提要使用的空间远小于窗口大小,同时在处理时间方面也非常高效。我们还将框架扩展为回答多个top-k查询。除了我们为这些提要证明的理论空间和时限外,我们还提供了一份详尽的实验报告,以验证它们在合成数据和实际数据上的实际效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号