首页> 外文会议>Annual meeting of the Association for Computational Linguistics >The Hitchhiker's Guide to Testing Statistical Significance in Natural Language Processing
【24h】

The Hitchhiker's Guide to Testing Statistical Significance in Natural Language Processing

机译:《 Hitchhiker的自然语言处理统计意义测试指南》

获取原文

摘要

Statistical significance testing is a standard statistical tool designed to ensure that experimental results are not coincidental. In this opin-ion/theoretical paper we discuss the role of statistical significance testing in Natural Language Processing (NLP) research. We establish the fundamental concepts of significance testing and discuss the specific aspects of NLP tasks, experimental setups and evaluation measures that affect the choice of significance tests in NLP research. Based on this discussion, we propose a simple practical protocol for statistical significance test selection in NLP setups and accompany this protocol with a brief survey of the most relevant tests. We then survey recent empirical papers published in ACL and TACL during 2017 and show that while our community assigns great value to experimental results, statistical significance testing is often ignored or misused. We conclude with a brief discussion of open issues that should be properly addressed so that this important tool can be applied in NLP research in a statistically sound manner.
机译:统计显着性检验是一种标准统计工具,旨在确保实验结果不是偶然的。在本观点/理论论文中,我们讨论了统计显着性检验在自然语言处理(NLP)研究中的作用。我们建立重要性测试的基本概念,并讨论影响NLP研究中重要性测试选择的NLP任务,实验设置和评估措施的特定方面。在此讨论的基础上,我们为NLP设置中的统计显着性测试选择提出了一个简单的实用协议,并将此协议与最相关的测试一起进行了简要介绍。然后,我们对2017年在ACL和TACL上发表的最新经验论文进行了调查,结果表明尽管我们的社区为实验结果赋予了巨大价值,但统计显着性检验经常被忽略或滥用。最后,我们简要讨论了应适当解决的未解决问题,以便可以将这种重要工具以统计上合理的方式应用于NLP研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号