...
首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Query Planning for Continuous Aggregation Queries over a Network of Data Aggregators
【24h】

Query Planning for Continuous Aggregation Queries over a Network of Data Aggregators

机译:通过数据聚合器网络进行连续聚合查询的查询计划

获取原文
获取原文并翻译 | 示例
           

摘要

Continuous queries are used to monitor changes to time varying data and to provide results useful for online decision making. Typically a user desires to obtain the value of some aggregation function over distributed data items, for example, to know value of portfolio for a client; or the AVG of temperatures sensed by a set of sensors. In these queries a client specifies a coherency requirement as part of the query. We present a low-cost, scalable technique to answer continuous aggregation queries using a network of aggregators of dynamic data items. In such a network of data aggregators, each data aggregator serves a set of data items at specific coherencies. Just as various fragments of a dynamic webpage are served by one or more nodes of a content distribution network, our technique involves decomposing a client query into subqueries and executing subqueries on judiciously chosen data aggregators with their individual subquery incoherency bounds. We provide a technique for getting the optimal set of subqueries with their incoherency bounds which satisfies client query's coherency requirement with least number of refresh messages sent from aggregators to the client. For estimating the number of refresh messages, we build a query cost model which can be used to estimate the number of messages required to satisfy the client specified incoherency bound. Performance results using real-world traces show that our cost-based query planning leads to queries being executed using less than one third the number of messages required by existing schemes.
机译:连续查询用于监视时变数据的变化,并提供可用于在线决策的结果。通常,用户希望获得分布式数据项上某些聚合函数的值,例如,了解客户的投资组合的值;或一组传感器感测到的温度的平均值。在这些查询中,客户端将一致性要求指定为查询的一部分。我们提出了一种低成本,可扩展的技术,可使用动态数据项的聚合器网络来回答连续聚合查询。在这样的数据聚合器网络中,每个数据聚合器以特定的一致性服务一组数据项。就像内容分发网络的一个或多个节点为动态网页的各个片段提供服务一样,我们的技术包括将客户端查询分解为子查询,并在经过明智选择的数据聚合器上以其各自的子查询不一致性限制执行子查询。我们提供了一种技术,用于获得具有其不相关性范围的子查询的最佳集合,该子查询可以满足客户查询的相关性要求,并且从聚合器发送到客户的刷新消息数量最少。为了估计刷新消息的数量,我们建立了一个查询成本模型,该模型可用于估计满足客户端指定的不一致性限制所需的消息数量。使用实际跟踪得出的性能结果表明,我们基于成本的查询计划导致执行查询所用的消息少于现有方案所需消息的三分之一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号