...
首页> 外文期刊>Distributed Computing >Approximate Distributed Top-k Queries
【24h】

Approximate Distributed Top-k Queries

机译:近似分布式Top-k查询

获取原文
获取原文并翻译 | 示例

摘要

We consider a distributed system where each node keeps a local count for items (similar to elections where nodes are ballot boxes and items are candidates). A top-k query in such a system asks which are the k items whose global count, across all nodes in the system, is the largest. In this paper, we present a Monte Carlo algorithm that outputs, with high probability, a set of k candidates which approximates the top-k items. The algorithm is motivated by sensor networks in that it focuses on reducing the individual communication complexity. In contrast to previous algorithms, the communication complexity depends only on the global scores and not on the partition of scores among nodes. If the number of nodes is large, our algorithm dramatically reduces the communication complexity when compared with deterministic algorithms. We show that the complexity of our algorithm is close to a lower bound on the cell-probe complexity of any non-interactive top-k approximation algorithm. We show that for some natural global distributions (such as the Geometric or Zipf distributions), our algorithm needs only polylogarith-mic number of communication bits per node.
机译:我们考虑一个分布式系统,其中每个节点都保留一个项目的本地计数(类似于选举中节点是投票箱而项目是候选对象的选举)。在这样的系统中,前k个查询会询问在系统中所有节点上的全局计数最大的k个项目中的哪些。在本文中,我们提出了一种蒙特卡洛算法,该算法以较高的概率输出一组k个候选值,这些值近似于前k个项。该算法是由传感器网络驱动的,因为它着重于降低单个通信的复杂性。与以前的算法相比,通信复杂度仅取决于全局得分,而不取决于节点之间的得分分配。如果节点数量很大,与确定性算法相比,我们的算法将大大降低通信复杂性。我们表明,我们的算法的复杂度接近任何非交互式top-k近似算法的单元探针复杂度的下限。我们表明,对于某些自然的全局分布(例如几何分布或Zipf分布),我们的算法仅需要每个节点多对数的通信位数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号