首页> 外文会议>International workshop on approximation and online algorithms >A Communication-Efficient Distributed Data Structure for Top-k and k-Select Queries
【24h】

A Communication-Efficient Distributed Data Structure for Top-k and k-Select Queries

机译:用于前k个查询和k个选择查询的一种高效通信的分布式数据结构

获取原文

摘要

We consider the scenario of n sensor nodes observing streams of data. The nodes are connected to a central server whose task it is to compute some function over all data items observed by the nodes. In our case, there exists a total order on the data items observed by the nodes. Our goal is to compute the k currently lowest observed values or a value with rank in [(1 —ε)k, (l + ε)k] with probability (1—δ). We propose solutions for these problems in an extension of the distributed monitoring model where the server can send broadcast messages to all nodes for unit cost. We want to minimize communication over multiple time steps where there are m updates to a node's value in between queries. The result is composed of two main parts, which each may be of independent interest: 1. Protocols which answer Top-k and k-Select queries. These protocols are memoryless in the sense that they gather all information at the time of the request. 2. A dynamic data structure which tracks for every k an element close to k. We describe how to combine the two parts to receive a protocol answering the stated queries over multiple time steps. Overall, for Top-k queries we use O(k + log m + log log n) and for k-Select queries O(1/ε~2 log 1/δ + log m + log~2 log n) messages in expectation. These results are shown to be asymptotically tight if m is not too small.
机译:我们考虑n个传感器节点观察数据流的情况。节点连接到中央服务器,该服务器的任务是对节点观察到的所有数据项计算某些功能。在我们的情况下,节点观察到的数据项存在总顺序。我们的目标是计算k个当前最低的观测值或具有[(1-ε)k,(l +ε)k]的概率为(1-δ)的值。我们在分布式监视模型的扩展中提出了针对这些问题的解决方案,在该模型中,服务器可以将广播消息发送给所有节点,并收取单位成本。我们希望最大程度地减少多个时间步之间的通信,在两次查询之间,节点的值有m个更新。结果由两个主要部分组成,每个部分可能具有独立的意义:1.回答Top-k和k-Select查询的协议。这些协议在它们在请求时收集所有信息的意义上说是无记忆的。 2.一种动态数据结构,它为每个k跟踪一个接近k的元素。我们描述了如何结合这两个部分来接收一个协议,该协议在多个时间步上回答所陈述的查询。总体而言,对于前k个查询,我们期望使用O(k + log m + log log n),对于k-Select查询,我们期望使用O(1 /ε〜2 log 1 /δ+ log m + log〜2 log n)消息。如果m不太小,这些结果将显示为渐近严格的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号