首页> 外文期刊>JMLR: Workshop and Conference Proceedings >A Family of Exact Goodness-of-Fit Tests for High-Dimensional Discrete Distributions
【24h】

A Family of Exact Goodness-of-Fit Tests for High-Dimensional Discrete Distributions

机译:一个精确的高维分散分布测试的家庭精确的健康测试

获取原文
           

摘要

The objective of goodness-of-fit testing is to assess whether a dataset of observations is likely to have been drawn from a candidate probability distribution. This paper presents a rank-based family of goodness-of-fit tests that is specialized to discrete distributions on high-dimensional domains. The test is readily implemented using a simulation-based, linear-time procedure. The testing procedure can be customized by the practitioner using knowledge of the underlying data domain. Unlike most existing test statistics, the proposed test statistic is distribution-free and its exact (non-asymptotic) sampling distribution is known in closed form. We establish consistency of the test against all alternatives by showing that the test statistic is distributed as a discrete uniform if and only if the samples were drawn from the candidate distribution. We illustrate its efficacy for assessing the sample quality of approximate sampling algorithms over combinatorially large spaces with intractable probabilities, including random partitions in Dirichlet process mixture models and random lattices in Ising models.
机译:健美测试的目的是评估观察的数据集是否可能已经从候选概率分布中汲取。本文介绍了一种基于级别的健康测试,专门用于在高维结构上的离散分布。使用基于仿真的线性时间过程容易地实现测试。可以使用底层数据域的知识来定制测试程序。与大多数现有的测试统计不同,所提出的测试统计是无分布的,其精确的(非渐近)采样分布以封闭形式已知。我们通过表明如果且仅当样本被从候选分布中汲取的样本且仅当样本被从候选分布中抽出时,测试统计数据可以将测试统计学分配的一致性。我们说明了在具有难以处理的概率的组合大型空间上评估近似采样算法的样本质量的功效,包括Dirichlet过程混合模型中的随机分区和在ising模型中的随机格子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号