首页> 美国卫生研究院文献>NPG Open Access >The cost of large numbers of hypothesis tests on power effect size and sample size
【2h】

The cost of large numbers of hypothesis tests on power effect size and sample size

机译:关于功效效应量和样本量的大量假设检验的成本

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Advances in high-throughput biology and computer science are driving an exponential increase in the number of hypothesis tests in genomics and other scientific disciplines. Studies using current genotyping platforms frequently include a million or more tests. In addition to the monetary cost, this increase imposes a statistical cost owing to the multiple testing corrections needed to avoid large numbers of false-positive results. To safeguard against the resulting loss of power, some have suggested sample sizes on the order of tens of thousands that can be impractical for many diseases or may lower the quality of phenotypic measurements. This study examines the relationship between the number of tests on the one hand and power, detectable effect size or required sample size on the other. We show that once the number of tests is large, power can be maintained at a constant level, with comparatively small increases in the effect size or sample size. For example at the 0.05 significance level, a 13% increase in sample size is needed to maintain 80% power for ten million tests compared with one million tests, whereas a 70% increase in sample size is needed for 10 tests compared with a single test. Relative costs are less when measured by increases in the detectable effect size. We provide an interactive Excel calculator to compute power, effect size or sample size when comparing study designs or genome platforms involving different numbers of hypothesis tests. The results are reassuring in an era of extreme multiple testing.
机译:高通量生物学和计算机科学的进步正在推动基因组学和其他科学学科中的假设检验数量呈指数增长。使用当前基因分型平台的研究通常包括一百万或更多的测试。除了金钱成本外,由于需要进行多次测试校正才能避免大量假阳性结果,因此这种增加会带来统计成本。为了防止由此造成的功率损失,一些建议的样本数量可能会达到数万,这对于许多疾病而言可能是不切实际的,或者可能降低表型测量的质量。这项研究一方面检验了测试数量与功效,可检测到的效应量或所需样本量之间的关系。我们表明,一旦测试数量很多,功效就可以保持在恒定水平,而效应量或样本量的增加相对较小。例如,在0.05的显着性水平下,与一百万个测试相比,一百万个测试需要增加13%的样本量才能保持80%的功效,而与单个测试相比,十个测试需要增加70%的样本量。当通过可检测的效应量的增加来衡量时,相对成本会更低。当比较涉及不同数量假设检验的研究设计或基因组平台时,我们提供了一个交互式Excel计算器来计算功效,效应大小或样本大小。在极端多重测试的时代,结果令人放心。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号