Implementing Monte Carlo tests with p-value buckets

Gandy Axel; Hahn Georg; Ding Dong

首页> 外文期刊>Scandinavian journal of statistics >Implementing Monte Carlo tests with p-value buckets

【24h】

Implementing Monte Carlo tests with p-value buckets

机译：使用P值桶实施Monte Carlo测试

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Software packages usually report the results of statistical tests using p-values. Users often interpret these values by comparing them with standard thresholds, for example, 0.1, 1, and 5%, which is sometimes reinforced by a star rating (***, **, and *, respectively). We consider an arbitrary statistical test whose p-value p is not available explicitly, but can be approximated by Monte Carlo samples, for example, by bootstrap or permutation tests. The standard implementation of such tests usually draws a fixed number of samples to approximate p. However, the probability that the exact and the approximated p-value lie on different sides of a threshold (the resampling risk) can be high, particularly for p-values close to a threshold. We present a method to overcome this. We consider a finite set of user-specified intervals that cover [0, 1] and that can be overlapping. We call these p-value buckets. We present algorithms that, with arbitrarily high probability, return a p-value bucket containing p. We prove that for both a bounded resampling risk and a finite runtime, overlapping buckets need to be employed, and that our methods both bound the resampling risk and guarantee a finite runtime for such overlapping buckets. To interpret decisions with overlapping buckets, we propose an extension of the star rating system. We demonstrate that our methods are suitable for use in standard software, including for low p-value thresholds occurring in multiple testing settings, and that they can be computationally more efficient than standard implementations.

机译：软件包通常使用p值报告统计测试结果。用户通常通过将它们与标准阈值进行比较来解释这些值，例如，0.1,1和5％，有时被星级评级（***，**和*分别）加强。我们考虑一个任意统计测试，其P值P不明确可用，但可以通过Monte Carlo样本来近似，例如，通过引导或排列测试。这种测试的标准实施通常绘制固定数量的样本以近似p。然而，精确和近似p值位于阈值（重采样风险）的不同侧的概率可以很高，特别是对于接近阈值的p值。我们提出了一种克服这一点的方法。我们考虑一个有限的用户指定的间隔，覆盖[0,1]，可以重叠。我们称之为这些p值桶。我们提供了任意高概率的算法，返回包含p的p值桶。我们证明，对于有界重采样风险和有限的运行计划，需要采用重叠桶，并且我们的方法都绑定了重采样风险并保证了这种重叠桶的有限运行时。用重叠桶解释决策，我们提出了星级评级系统的延伸。我们证明我们的方法适用于标准软件，包括在多个测试设置中发生的低p值阈值，并且它们可以比标准实现更有效地效率。

著录项

来源
《Scandinavian journal of statistics》 |2020年第3期|950-967|共18页
作者
Gandy Axel; Hahn Georg; Ding Dong;
展开▼
作者单位

Imperial Coll London Dept Math London England;

Imperial Coll London Dept Math London England;

Imperial Coll London Dept Math London England;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
algorithm; bootstrap; hypothesis testing; p-value; resampling; sampling;

机译：算法;举止;假设检测;p值;重新采样;抽样;
入库时间 2022-08-18 21:23:31

相似文献

外文文献
中文文献
专利

1. A hybrid method of the sequential Monte Carlo and the Edgeworth expansion for computation of very small p-values in permutation tests [J] . Yang James J., Trucco Elisa M., Buu Anne Statistical methods in medical research . 2019,第10a11期

机译：顺序蒙特卡罗的混合方法和边缘沃思扩展，以计算极小的P值在排列测试中
2. MMCTest-A Safe Algorithm for Implementing Multiple Monte Carlo Tests [J] . AXEL GANDY, GEORG HAHN Scandinavian journal of statistics . 2014,第4期

机译：MMCTest-一种实现多个蒙特卡洛测试的安全算法
3. On using truncated sequential probability ratio test boundaries for Monte Carlo implementation of hypothesis tests [J] . Fay MP, Kim HJ, Hachey M Journal of computational and graphical statistics: A joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America . 2007,第4期

机译：关于使用截断的顺序概率比检验边界进行假设检验的蒙特卡洛
4. Ein Vergleich von statistischen Tests und Monte-Carlo-Methoden zur Konformitatsbewertung: A comparison of statistical testing and Monte Carlo methods for Conformity assessment [C] . Jos van der Grinten VDI-Fachtagung Messunsicherheits . 2019

机译：统计试验和蒙特卡罗方法的比较符合评级：统计检测与蒙特卡罗符合性评估方法的比较
5. Methods in Hypothesis Testing, Markov Chain Monte Carlo and Neuroimaging Data Analysis [D] . Xu, Xiaojin 2013

机译：假设检验方法，马尔可夫链蒙特卡洛法和神经影像数据分析
6. On Using Truncated Sequential Probability Ratio Test Boundaries for Monte Carlo Implementation of Hypothesis Tests [O] . Michael P. Fay, Hyune-Ju Kim, Mark Hachey -1

机译：关于使用截断的顺序概率比率检验边界进行假设检验的蒙特卡洛方法
7. Implementing Monte Carlo tests with p‐value buckets [O] . Axel Gandy, Georg Hahn, Dong Ding 2019

机译：使用P值桶实施Monte Carlo测试
8. Thermal Scattering Law Data: Implementation and Testing Using the Monte Carlo Neutron Transport Codes COG, MCNP and TART [R] . Cullen, D. E., Hansen, L. F., Lent, E. M., 2003

机译：热散射法则数据：使用蒙特卡罗中子传输代码COG，mCNp和TaRT的实现和测试

Implementing Monte Carlo tests with p-value buckets

摘要

著录项

相似文献

相关主题

期刊订阅