...
首页> 外文期刊>Contemporary clinical trials >A small p-value from an observed data is not evidence of adequate power for future similar-sized studies: a cautionary note.
【24h】

A small p-value from an observed data is not evidence of adequate power for future similar-sized studies: a cautionary note.

机译:注意事项:从观察到的数据中得出的小p值并不足以证明未来进行类似规模的研究具有足够的功效。

获取原文
获取原文并翻译 | 示例
           

摘要

BACKGROUND: p-values are ubiquitous in medical research, but are often misunderstood. In addition to being misused or perhaps even abused at post-statistical analysis stage of making scientific inference and interpretations, p-values can also be a source of confusion at the design stage. METHODS: Application of standard test statistic on observed data may result in a small p-value which in turn may give the impression that a new study that has the same sample size as the observed data, perhaps even smaller, would have adequate power. We used re-sampling method and computed statistical power to illustrate the fallacy of this conclusion. We have also calculated power using analytical formulae. RESULTS: We analyzed data consisting of two group comparisons with binary as well as continuous outcome variables. For the binary outcome, the event rates for the outcome of interest in the illustrative data were 15/43 (35%) and 22/34 (65%), respectively (p-value=0.0093). Using these data, a bootstrap-based empirical power was estimated to be 75.4%. One random sample with only two-third of the original data had a p-value of 0.0066, but only an empirical power of 57.4%. Similar results were observed for a continuous outcome. CONCLUSION: Our results show that the number of zeros after the decimal point in a p-value from an observed sample cannot and should not be used to gauge the adequacy of sample size for a future study that is expected to have sufficient power to detect an effect as big as the observed.
机译:背景:p值在医学研究中无处不在,但常常被误解。 p值除了在进行科学推断和解释的后统计分析阶段被滥用甚至可能被滥用之外,在设计阶段p值也可能引起混乱。方法:在观察到的数据上应用标准检验统计量可能会产生较小的p值,这反过来可能给人的印象是,与观察到的数据具有相同样本量甚至可能更小的新研究将具有足够的功效。我们使用重采样方法和计算的统计功效来说明该结论的谬误。我们还使用解析公式计算了功率。结果:我们分析了数据,该数据由两组比较以及二元和连续结果变量组成。对于二元结果,说明性数据中感兴趣结果的事件发生率分别为15/43(35%)和22/34(65%)(p值= 0.0093)。使用这些数据,基于引导的经验能力估计为75.4%。一个只有原始数据三分之二的随机样本的p值为0.0066,但经验功效仅为57.4%。观察到连续结果相似的结果。结论:我们的结果表明,观察到的样本的p值中小数点后的零个数不能且不应用于衡量样本量是否足够用于未来的研究,该研究有望具有足够的检测力。效果与观察到的一样大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号