首页> 外文会议>International workshop on randomization and approximation techniques in computer science >Low Discrepancy Sets Yield Approximate Min-Wise Independent Permutation Families
【24h】

Low Discrepancy Sets Yield Approximate Min-Wise Independent Permutation Families

机译:低差异集产量近似闽独立排列族

获取原文

摘要

Motivated by a problem of filtering near-duplicate Web documents, Broder, Charikar, Frieze & Mitzenmacher defined the following notion of ε-approximate min-wise independent permutation families. A multiset F of permutations of {0, 1, ... , n - 1} is such a family if for all K is contained in {0, 1, ... , n - 1} and any x ∈ K, a permutation π chosen uniformly at random from F satisfies | Pr[min{π(K)} = π(x)] - 1/(|K|) ≤ε/(|K|). We show connections of such families with low discrepancy sets for geometric rectangles, and give explicit constructions of such families F of size n~(O(log n)~(1/2)) for ε = 1/n~Θ, improving upon the previously best-known bound of Indyk. We also present polynomial-size constructions when the min-wise condition is required only for |K| ≤ 2~(O(log~(2/3)n), with ε≥ 2~(-O(log~(2/3)n)).
机译:通过过滤近重型Web文档的问题,Broder,Charikar,Frieze&Mitzenmacher定义了以下ε-近似闽独立排列家庭的概念。如果所有k包含在{0,1,...,n - 1}中,则{0,1,...,n - 1}的序列是这样的家庭的多目标F是{0,1,...,n - 1}和任何x k,a从f满足的随机均匀选择偏置π Pr [min {π(k)} =π(x)] - 1 /(| k |)≤ε/(| k |)。我们展示了对几何矩形的低差异集的这些家庭的连接,并为ε= 1 / n~θ的尺寸N〜(O(log n)〜(1/2))提供了明确的结构f = 1 / n〜θ。以前最着名的indyk的界限。当仅适用于| k |时,我们还存在多项式尺寸结构≤2〜(O(log〜(2/3)n),ε≥2〜(-o(log〜(2/3)n))))。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号