首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Domain Compression and its Application to Randomness-Optimal Distributed Goodness-of-Fit
【24h】

Domain Compression and its Application to Randomness-Optimal Distributed Goodness-of-Fit

机译:域压缩及其在随机性的应用 - 最佳分布式适合性

获取原文
           

摘要

We study goodness-of-fit of discrete distributions in the distributed setting, where samples are divided between multiple users who can only release a limited amount of information about their samples due to various information constraints. Recently, a subset of the authors showed that having access to a common random seed (i.e., shared randomness) leads to a significant reduction in the sample complexity of this problem. In this work, we provide a complete understanding of the interplay between the amount of shared randomness available, the stringency of information constraints, and the sample complexity of the testing problem by characterizing a tight trade-off between these three parameters. We provide a general distributed goodness-of-fit protocol that as a function of the amount of shared randomness interpolates smoothly between the private- and public-coin sample complexities. We complement our upper bound with a general framework to prove lower bounds on the sample complexity of this testing problems under limited shared randomness. Finally, we instantiate our bounds for the two archetypal information constraints of communication and local privacy, and show that our sample complexity bounds are optimal as a function of all the parameters of the problem, including the amount of shared randomness. A key component of our upper bounds is a new primitive of extit{domain compression}, a tool that allows us to map distributions to a much smaller domain size while preserving their pairwise distances, using a limited amount of randomness.
机译:我们研究了分布式环境中的离散分布的拟合,其中样本在多个用户之间划分,在多个用户之间,由于各种信息约束,只能释放有关其样本的有限数量的信息。最近,作者的子集表明,具有通用随机种子(即共享随机性)导致该问题的样本复杂性的显着降低。在这项工作中,我们通过在这三个参数之间表征紧张的权衡,完全了解可用的共享随机性的数量,信息约束的严格性,以及测试问题的样本复杂性。我们提供了一般的分布式拟合优性协议,作为共享随机性的数量的函数,在私有和公共硬币样本复杂性之间平稳地插入。我们与一般框架补充了我们的上限,以证明在有限的共享随机性下该测试问题的样本复杂性下限。最后,我们将我们的界限实例化了两个原型信息的通信和本地隐私的限制,并表明我们的示例复杂性范围是作为问题所有参数的函数的最佳状态,包括共享随机性的量。我们上限的一个关键组成部分是 extent {域压缩}的新基材,该工具允许我们将分布映射到更小的域大小,同时使用有限量的随机性保持其成对距离。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号