...
首页> 外文期刊>LIPIcs : Leibniz International Proceedings in Informatics >Near-Optimal Closeness Testing of Discrete Histogram Distributions
【24h】

Near-Optimal Closeness Testing of Discrete Histogram Distributions

机译:离散直方图分布的近最佳近立测试

获取原文
   

获取外文期刊封面封底 >>

       

摘要

We investigate the problem of testing the equivalence between two discrete histograms. A k-histogram over [n] is a probability distribution that is piecewise constant over some set of k intervals over [n]. Histograms have been extensively studied in computer science and statistics. Given a set of samples from two k-histogram distributions p, q over [n], we want to distinguish (with high probability) between the cases that p = q and ||p ? q||_1 >= epsilon. The main contribution of this paper is a new algorithm for this testing problem and a nearly matching information-theoretic lower bound. Specifically, the sample complexity of our algorithm matches our lower bound up to a logarithmic factor, improving on previous work by polynomial factors in the relevant parameters. Our algorithmic approach applies in a more general setting and yields improved sample upper bounds for testing closeness of other structured distributions as well.
机译:我们调查在两个离散直方图之间测试等效的问题。 [n]的K-tiblogram是一种概率分布,其在[n]上的某些组k间隔内是分段常数。直方图在计算机科学和统计中被广泛研究过。给定一组来自​​两个k-tiotogram分布p的样本,q在[n]上,我们希望在p = q和|| p的情况下区分(具有高概率)? q || _1> = epsilon。本文的主要贡献是该测试问题的新算法,以及几乎匹配的信息 - 理论下限。具体而言,我们的算法的样本复杂性与我们的下限与对数因子相匹配,通过相关参数中的多项式因子改善了以前的工作。我们的算法方法适用于更常规的设置,并产生改进的样本上限,用于测试其他结构化分布的近距离。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号