首页> 外文期刊>Electronic Colloquium on Computational Complexity >Which Distribution Distances are Sublinearly Testable?
【24h】

Which Distribution Distances are Sublinearly Testable?

机译:哪些分布距离可以进行亚线性检验?

获取原文
           

摘要

Given samples from an unknown distribution p and a description of a distribution q , are p and q close or far? This question of "identity testing" has received significant attention in the case of testing whether p and q are equal or far in total variation distance. However, in recent work, the following questions have been been critical to solving problems at the frontiers of distribution testing: -Alternative Distances: Can we test whether p and q are far in other distances, say Hellinger? -Tolerance: Can we test when p and q are close, rather than equal? And if so, close in which distances? Motivated by these questions, we characterize the complexity of distribution testing under a variety of distances, including total variation, 2 , Hellinger, Kullback-Leibler, and 2 . For each pair of distances d 1 and d 2 , we study the complexity of testing if p and q are close in d 1 versus far in d 2 , with a focus on identifying which problems allow strongly sublinear testers (i.e., those with complexity O ( n 1 ? ) for some 0"> 0 where n is the size of the support of the distributions p and q ). We provide matching upper and lower bounds for each case. We also study these questions in the case where we only have samples from q (equivalence testing), showing qualitative differences from identity testing in terms of when tolerance can be achieved. Our algorithms fall into the classical paradigm of 2 -statistics, but require crucial changes to handle the challenges introduced by each distance we consider. Finally, we survey other recent results in an attempt to serve as a reference for the complexity of various distribution testing problems.
机译:给定来自未知分布p的样本和分布q的描述,p和q是接近还是远?在检验p和q的总变化距离是否相等或相距较远的情况下,“身份测试”这一问题受到了极大的关注。但是,在最近的工作中,以下问题对于解决分布测试前沿的问题至关重要:-替代距离:Hellinger说,我们能否测试p和q是否在其他距离上较远? -容差:我们可以测试p和q何时接近而不是相等吗?如果是这样,请在哪些距离内关闭?受这些问题的启发,我们描述了各种距离下的分布测试的复杂性,包括总变化2,Hellinger,Kullback-Leibler和2。对于每对距离d 1和d 2,我们研究了d 1中p和q相对于d 2中远时测试p和q的复杂度,重点在于确定哪些问题允许强次线性测试器(即,那些复杂度为O的问题) (n 1?)对于0“> 0,其中n是分布p和q的支撑大小。我们为每种情况提供匹配的上下限,我们还研究了只有来自q(等效性测试)的样本显示了与何时进行容忍度相同的测试,我们的算法属于2统计的经典范式,但需要进行重大更改以应对我们考虑的每个距离所带来的挑战。最后,我们调查了其他最新结果,以作为各种分布测试问题的复杂性的参考。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号