首页> 外文会议>Starting AI Researchers' Symposium >Empirical Evaluation of Scoring Methods
【24h】

Empirical Evaluation of Scoring Methods

机译:评分方法的实证评价

获取原文

摘要

The automated reasoning research community has grown accustomed tocompetitive events where a pool of systems is run on a pool of problem instances with the purpose of ranking the systems according to their performances. At the heart of such ranking lies the method used to score the systems, i.e., the procedure used to compute a numerical quantity that should summarize the performances of a system with respect to the other systems and to the pool of problem instances. In this paper we evaluate several scoring methods, including methods used in automated reasoning contests, as well as methods based on voting theory, and a new method that we introduce. Our research aims to establish which of the above methods maximizes the effectiveness measures that we devised to quantify desirable properties of the scoring procedures. Our method is empirical, in that we compare the scoring methods by computing the effectiveness measures using the data from the 2005 comparative evaluation of solvers for quantified Boolean formulas. The results of our experiments give useful indications about the relative strengths and weaknesses of the scoring methods, and allow us to infer also some conclusions that are independent of the specific method adopted.
机译:自动推理研究界已经增加了习惯的竞争事件,其中一个系统池在一个问题实例上运行,目的是根据其性能排列系统。在这种排名的核心上,用于将系统的方法,即用于计算数量的过程,用于计算应总结一个关于其他系统的系统的性能以及问题实例池。在本文中,我们评估了几种评分方法,包括自动推理竞赛中使用的方法,以及基于投票理论的方法,以及我们介绍的新方法。我们的研究旨在建立上述哪种方法,最大限度地提高了我们设计的有效性措施,以便量化得分程序的理想性质。我们的方法是经验的,因为我们通过计算2005年对量化布尔公式的求解器的比较评估的数据计算有效性措施来比较评分方法。我们的实验结果为评分方法的相对优势和缺点提供了有用的指示,并允许我们推断出与所采用的具体方法无关的一些结论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号