Implementing Monte Carlo tests with p‐value buckets

机译：使用P值桶实施Monte Carlo测试

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Software packages usually report the results of statistical tests usingp-values. Users often interpret these by comparing them to standard thresholds,e.g. 0.1%, 1% and 5%, which is sometimes reinforced by a star rating (***, **,*). In this article, we consider an arbitrary statistical test whose p-value pis not available explicitly, but can be approximated by Monte Carlo samples,e.g. by bootstrap or permutation tests. The standard implementation of suchtests usually draws a fixed number of samples to approximate p. However, theprobability that the exact and the approximated p-value lie on different sidesof a threshold (the resampling risk) can be high, particularly for p-valuesclose to a threshold. We present a method to overcome this. We consider afinite set of user-specified intervals which cover [0,1] and which can beoverlapping. We call these p-value buckets. We present algorithms that, witharbitrarily high probability, return a p-value bucket containing p. We provethat for both a bounded resampling risk and a finite runtime, overlappingbuckets need to be employed, and that our methods both bound the resamplingrisk and guarantee a finite runtime for such overlapping buckets. To interpretdecisions with overlapping buckets, we propose an extension of the star ratingsystem. We demonstrate that our methods are suitable for use in standardsoftware, including for low p-values occurring in multiple testing settings,and that they can be computationally more efficient than standardimplementations.

机译：软件包通常报告使用P值的统计测试结果。用户通常通过将它们与标准阈值进行比较来解释这些，例如。 0.1％，1％和5％，有时由星级（***，**，*）加强。在本文中，我们考虑了一个任意统计测试，其P值PIS明确无可用，但可以由Monte Carlo样品近似，例如，可以近似。通过引导或排列测试。 Mustests的标准实施通常绘制固定数量的样本以近似p。然而，近似的p值在阈值（重采样风险）上的确切和近似的p值位于阈值（重采样风险）的方法可以很高，特别是对于阈值的p值。我们提出了一种克服这一点的方法。我们考虑一组用于覆盖的用户指定的间隔，覆盖[0,1]，哪个可以捆绑。我们称之为这些p值桶。我们提供了算法，占用高概率，返回包含p的p值桶。我们认为有界重采样风险和有限运行时，需要采用重叠的重叠，并且我们的方法都绑定了重复样本，并保证了这种重叠桶的有限运行时。使用重叠桶来解释重现，我们提出了星形评估系统的扩展。我们证明我们的方法适用于标准设备，包括在多个测试设置中发生的低p值，并且它们可以比标准拼接计算得多。

著录项

作者
Axel Gandy; Georg Hahn; Dong Ding;
展开▼
作者单位

展开▼
年度 2019
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Implementing Monte Carlo tests with p-value buckets [J] . Gandy Axel, Hahn Georg, Ding Dong Scandinavian journal of statistics . 2020,第3期

机译：使用P值桶实施Monte Carlo测试
2. MMCTest-A Safe Algorithm for Implementing Multiple Monte Carlo Tests [J] . AXEL GANDY, GEORG HAHN Scandinavian journal of statistics . 2014,第4期

机译：MMCTest-一种实现多个蒙特卡洛测试的安全算法
3. On using truncated sequential probability ratio test boundaries for Monte Carlo implementation of hypothesis tests [J] . Fay MP, Kim HJ, Hachey M Journal of computational and graphical statistics: A joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America . 2007,第4期

机译：关于使用截断的顺序概率比检验边界进行假设检验的蒙特卡洛
4. Monte Carlo testing and verification of numerical algorithm implementations [C] . Pokrajac David D., Imran Abdullah-Al-Zubaer, Bakic Predrag R. International Conference on Telecommunications in Modern Satellite, Cable and Broadcasting Services . 2015

机译：蒙特卡洛测试和数值算法实现的验证
5. Methods in Hypothesis Testing, Markov Chain Monte Carlo and Neuroimaging Data Analysis [D] . Xu, Xiaojin 2013

机译：假设检验方法，马尔可夫链蒙特卡洛法和神经影像数据分析
6. On Using Truncated Sequential Probability Ratio Test Boundaries for Monte Carlo Implementation of Hypothesis Tests [O] . Michael P. Fay, Hyune-Ju Kim, Mark Hachey -1

机译：关于使用截断的顺序概率比率检验边界进行假设检验的蒙特卡洛方法
7. MMCTest – A Safe Algorithm for Implementing Multiple Monte Carlo Tests [O] . Axel G, Georg Hahn 2016

机译：mmCTest - 一种实现多次蒙特卡罗测试的安全算法
8. Thermal Scattering Law Data: Implementation and Testing Using the Monte Carlo Neutron Transport Codes COG, MCNP and TART [R] . Cullen, D. E., Hansen, L. F., Lent, E. M., 2003

机译：热散射法则数据：使用蒙特卡罗中子传输代码COG，mCNp和TaRT的实现和测试

Implementing Monte Carlo tests with p‐value buckets

摘要

著录项

相似文献

相关主题

期刊订阅