首页> 外文期刊>Behavior Research Methods >Efficiently measuring recognition performance with sparse data
【24h】

Efficiently measuring recognition performance with sparse data

机译:使用稀疏数据有效地测量识别性能

获取原文
获取原文并翻译 | 示例
       

摘要

We examine methods for measuring performance in signal-detection-like tasks when each participant provides only a few observations. Monte Carlo simulations demonstrate that standard statistical techniques applied to a d′ analysis can lead to large numbers of Type Ⅰ errors (incorrectly rejecting a hypothesis of no difference). Various statistical methods were compared in terms of their Type Ⅰ and Type Ⅱ error (incorrectly accepting a hypothesis of no difference) rates. Our conclusions are the same whether these two types of errors are weighted equally or Type Ⅰ errors are weighted more heavily. The most promising method is to combine an aggregate d′ measure with a percentile bootstrap confidence interval, a computer-intensive nonparametric method of statistical inference. Researchers who prefer statistical techniques more commonly used in psychology, such as a repeated measures t test, should use γ (Goodman & Kruskal, 1954), since it performs slightly better than or nearly as well as d′. In general, when repeated measures t tests are used, γ is more conservative than d′: It makes more Type Ⅱ errors, but its Type Ⅰ error rate tends to be much closer to that of the traditional .05 α level. It is somewhat surprising that γ performs as well as it does, given that the simulations that generated the hypothetical data conformed completely to the d′ model. Analyses in which H - FA was used had the highest Type Ⅰ error rates. Detailed simulation results can be downloaded from www.psychonomic.org/archive/Schooler-BRM-2004.zip.
机译:当每个参与者仅提供一些观察结果时,我们研究了用于测量类似信号检测任务的性能的方法。蒙特卡洛模拟表明,应用于d'分析的标准统计技术会导致大量的Ⅰ型错误(错误地拒绝了没有差异的假设)。比较了各种统计方法的Ⅰ型和Ⅱ型错误率(错误地接受无差异假设)。无论对这两种类型的错误均等加权还是对Ⅰ类错误进行较重加权,我们的结论都是相同的。最有前途的方法是将汇总的d'度量与百分比引导程序置信区间相结合,这是一种计算机密集型非参数统计推断方法。偏爱心理学中更常用的统计技术(例如重复测量t检验)的研究人员应使用γ(Goodman&Kruskal,1954),因为它的性能略好于或接近d'。通常,当使用重复测量t检验时,γ比d'更为保守:它会产生更多的Ⅱ类错误,但其Ⅰ类错误率往往更接近于传统的.05α水平。鉴于生成假设数据的模拟完全符合d'模型,因此γ的性能如此之好,令人感到有些惊讶。使用H-FA的分析具有最高的Ⅰ型错误率。可以从www.psychonomic.org/archive/Schooler-BRM-2004.zip下载详细的仿真结果。

著录项

  • 来源
    《Behavior Research Methods》 |2005年第1期|p.3-10|共8页
  • 作者单位

    Max Planck Institute for Human Development, Center for Adaptive Behavior and Cognition, Lentzeallee 94, Berlin 14195, Germany;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 心理学;
  • 关键词

  • 入库时间 2022-08-17 13:41:47

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号