【24h】

Low Discrepancy Sets Yield Approximate Min-Wise Independent Permutation Families

机译:低差异设置可产生大约最小明智独立排列族

获取原文
获取原文并翻译 | 示例

摘要

Motivated by a problem of filtering near-duplicate Web documents, Broder, Charikar, Frieze & Mitzenmacher defined the following notion of ε-approximate min-wise independent permutation families. A multiset F of permutations of {0, 1, ... , n - 1} is such a family if for all K is contained in {0, 1, ... , n - 1} and any x ∈ K, a permutation π chosen uniformly at random from F satisfies | Pr[min{π(K)} = π(x)] - 1/(|K|) ≤ε/(|K|). We show connections of such families with low discrepancy sets for geometric rectangles, and give explicit constructions of such families F of size n~(O(log n)~(1/2)) for ε = 1~Θ, improving upon the previously best-known bound of Indyk. We also present polynomial-size constructions when the min-wise condition is required only for |K| ≤ 2~(O(log~(2/3)n), with ε≥ 2~(-O(log~(2/3)n)).
机译:由于过滤几乎重复的Web文档的问题,Broder,Charikar,Frieze和Mitzenmacher定义了以下ε近似最小独立排列族的概念。如果{0,1,...,n-1}中包含所有K且任何x∈K,a为{0,1,...,n-1}的排列的多集F就是这样的族。 F均匀随机选择的置换π满足| Pr [min {π(K)} =π(x)]-1 /(| K |)≤ε/(| K |)。我们显示了此类族的几何矩形的低差异集的连接,并给出了ε= 1 / n〜Θ的大小为n〜(O(log n)〜(1/2))的此类族F的显式构造,以前最著名的Indyk范围。当仅要求| K |的最小方向条件时,我们还给出了多项式大小的构造。 ≤2〜(O(log〜(2/3)n),ε≥2〜(-O(log〜(2/3)n))。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号