首页> 外文会议>IEEE Annual Symposium on Foundations of Computer Science >Element Distinctness, Frequency Moments, and Sliding Windows
【24h】

Element Distinctness, Frequency Moments, and Sliding Windows

机译:元素区别度,频率矩和滑动窗口

获取原文

摘要

We derive new time-space tradeoff lower bounds and algorithms for exactly computing statistics of input data, including frequency moments, element distinctness, and order statistics, that are simple to calculate for sorted data. In particular, we develop a randomized algorithm for the element distinctness problem whose time T and space S satisfy T in O(n3/2/S1/2), smaller than previous lower bounds for comparison-based algorithms, showing that element distinctness is strictly easier than sorting for randomized branching programs. This algorithm is based on a new time- and space-efficient algorithm for finding all collisions of a function f from a finite set to itself that are reachable by iterating f from a given set of starting points. We further show that our element distinctness algorithm can be extended at only a polylogarithmic factor cost to solve the element distinctness problem over sliding windows, where the task is to take an input of length 2n-1 and produce an output for each window of length n, giving n outputs in total. In contrast, we show a time-space tradeoff lower bound of T in Omega(n2/S) for randomized multi-way branching programs, and hence standard RAM and word-RAM models, to compute the number of distinct elements, F_0, over sliding windows. The same lower bound holds for computing the low-order bit of F_0 and computing any frequency moment F_k for k not equal 1. This shows that frequency moments F_k not equal 1 and even the decision problem F_0bmod 2 are strictly harder than element distinctness. We provide even stronger separations on average for inputs from [n]. We complement this lower bound with a T in O(n2/S) comparison-based deterministic RAM algorithm for exactly computing F_k over sliding windows, nearly matching both our general lower bound for the sliding-window version and the comparison-based lower bounds for a single instance of the problem. We also consider the computations of order statistics over sliding windows.
机译:我们推导出新的时空折衷下界和算法,以精确计算输入数据的统计信息,包括频率矩,元素差异和阶次统计信息,这些对于分类数据而言很容易计算。特别是,我们针对元素唯一性问题开发了一种随机算法,该问题的时间T和空间S满足O(n3 / 2 / S1 / 2)中的T,小于以前基于比较的算法的下界,这表明元素唯一性严格比对随机分支程序进行排序更容易。该算法基于一种新的节省时间和空间的算法,用于从一个有限的集合到其自身查找函数f的所有碰撞,这些碰撞可以通过从给定的一组起始点迭代f来解决。我们进一步表明,我们的元素唯一性算法仅需以多对数因子成本即可扩展,以解决滑动窗口上的元素唯一性问题,其中的任务是获取长度为2n-1的输入并为每个长度为n的窗口生成输出,总共给出n个输出。相比之下,我们显示了Omega(n2 / S)中随机多路分支程序的T的时空折衷下界,因此显示了标准RAM和word-RAM模型,以计算不同元素F_0的数量,推拉窗。相同的下限适用于计算F_0的低阶位和计算k不等于1的任何频率矩F_k。这表明频率矩F_k不等于1甚至决策问题F_0bmod 2都比元素唯一性严格。我们平均为[n]的输入提供了更强的分离。我们在基于O(n2 / S)比较的确定性RAM算法中使用T来补充此下限,以在滑动窗口上精确计算F_k,几乎与我们的滑动窗口版本的下限和基于比较的下限相对应。问题的单个实例。我们还考虑了滑动窗口上订单统计的计算。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号