首页> 外文会议>Proceedings of the Sixteenth international world wide web conference(WWW2007) >Optimized Query Planning of Continuous Aggregation Queries in Dynamic Data Dissemination Networks
【24h】

Optimized Query Planning of Continuous Aggregation Queries in Dynamic Data Dissemination Networks

机译:动态数据分发网络中连续聚合查询的优化查询计划

获取原文
获取原文并翻译 | 示例

摘要

Continuous queries are used to monitor changes to time varying data and to provide results useful for online decision making. Typically a user desires to obtain the value of some aggregation function over distributed data items, for example, to know (a) the average of temperatures sensed by a set of sensors (b) the value of index of mid-cap stocks. In these queries a client specifies a coherency requirement as part of the query. In this paper we present a low-cost, scalable technique to answer continuous aggregation queries using a content distribution network of dynamic data items. In such a network of data aggregators, each data aggregator serves a set of data items at specific coherencies. Just as various fragments of a dynamic web-page are served by one or more nodes of a content distribution network, our technique involves decomposing a client query into sub-queries and executing sub-queries on judiciously chosen data aggregators with their individual sub-query incoherency bounds. We provide a technique of getting the optimal query plan (i.e., set of sub-queries and their chosen data aggregators) which satisfies client query's coherency requirement with least cost, measured in terms of the number of refresh messages sent from aggregators to the client. For estimating query execution cost, we build a continuous query cost model which can be used to estimate the number of messages required to satisfy the client specified incoherency bound. Performance results using real-world traces show that our cost based query planning leads to queries being executed using less than one third the number of messages required by existing schemes.
机译:连续查询用于监视时变数据的变化,并提供可用于在线决策的结果。通常,用户希望获得分布式数据项上某些聚合函数的值,例如,以了解(a)一组传感器感测到的平均温度(b)中型股指数的值。在这些查询中,客户端将一致性要求指定为查询的一部分。在本文中,我们提出了一种低成本,可扩展的技术,该技术使用动态数据项的内容分发网络来回答连续聚合查询。在这样的数据聚合器网络中,每个数据聚合器以特定的一致性服务一组数据项。正如动态网页的各个片段由内容分发网络的一个或多个节点提供服务一样,我们的技术包括将客户端查询分解为子查询,并在经过明智选择的数据聚合器及其子查询上执行子查询。不连贯范围。我们提供了一种获取最佳查询计划(即子查询集及其选择的数据聚合器)的技术,该技术可以以最小的成本来满足客户端查询的一致性要求,以从聚合器发送到客户端的刷新消息数来衡量。为了估算查询执行成本,我们建立了一个连续查询成本模型,该模型可用于估算满足客户端指定的不一致性限制所需的消息数。使用实际跟踪得出的性能结果表明,我们基于成本的查询计划导致执行查询时所用消息的数量不到现有方案所需消息的三分之一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号