Empirical Evaluation of Scoring Methods

机译：评分方法的实证评价

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The automated reasoning research community has grown accustomed tocompetitive events where a pool of systems is run on a pool of problem instances with the purpose of ranking the systems according to their performances. At the heart of such ranking lies the method used to score the systems, i.e., the procedure used to compute a numerical quantity that should summarize the performances of a system with respect to the other systems and to the pool of problem instances. In this paper we evaluate several scoring methods, including methods used in automated reasoning contests, as well as methods based on voting theory, and a new method that we introduce. Our research aims to establish which of the above methods maximizes the effectiveness measures that we devised to quantify desirable properties of the scoring procedures. Our method is empirical, in that we compare the scoring methods by computing the effectiveness measures using the data from the 2005 comparative evaluation of solvers for quantified Boolean formulas. The results of our experiments give useful indications about the relative strengths and weaknesses of the scoring methods, and allow us to infer also some conclusions that are independent of the specific method adopted.

机译：自动推理研究界已经增加了习惯的竞争事件，其中一个系统池在一个问题实例上运行，目的是根据其性能排列系统。在这种排名的核心上，用于将系统的方法，即用于计算数量的过程，用于计算应总结一个关于其他系统的系统的性能以及问题实例池。在本文中，我们评估了几种评分方法，包括自动推理竞赛中使用的方法，以及基于投票理论的方法，以及我们介绍的新方法。我们的研究旨在建立上述哪种方法，最大限度地提高了我们设计的有效性措施，以便量化得分程序的理想性质。我们的方法是经验的，因为我们通过计算2005年对量化布尔公式的求解器的比较评估的数据计算有效性措施来比较评分方法。我们的实验结果为评分方法的相对优势和缺点提供了有用的指示，并允许我们推断出与所采用的具体方法无关的一些结论。

著录项

来源
《Starting AI Researchers' Symposium》|2006年||共12页
会议地点
作者
Luca Pulina; IOS Press;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
Automated Reasoning; Systems Comparison; Scoring Methods;

机译：自动推理;系统比较;评分方法;

相似文献

外文文献
中文文献
专利

1. Comparison of empirical Bayes and propensity score methods for road safety evaluation: A simulation study [J] . Li Haojie, Graham Daniel J., Ding Hongliang, Accident Analysis & Prevention . 2019,第AUGa期

机译：经验贝叶斯和倾向得分方法在道路安全评估中的比较：模拟研究
2. A Prorating Method for Estimating MMPI-2-RF Scores From MMPI Responses: Examination of Score Fidelity and Illustration of Empirical Utility in the PERSEREC Police Integrity Study Sample [J] . Tarescavage Anthony M., Corey David M., Ben-Porath Yossef S. Assessment . 2016,第2期

机译：一种从MMPI响应中估算MMPI-2-RF分数的按比例分配方法：在PERSEREC警察廉正研究样本中检查分数保真度和经验效用图示
3. Effects of mergers on corporate performance: An empirical evaluation using OLS and the empirical Bayesian methods [J] . Abdul Rashid, Nazia Naeem Borsa Istanbul Review . 2017,第1期

机译：合并对公司绩效的影响：使用OLS和经验贝叶斯方法的经验评估
4. Empirical Evaluation of Scoring Methods [C] . Luca Pulina European Starting AI Researchers Symposium . 2006

机译：评分方法的实证评价
5. Causal Inference in Traffic Safety Research: Comparison of the Empirical Bayes and Propensity Scores-Potential Outcomes Methods [D] . Wood, Jonathan S. 2016

机译：交通安全研究中的因果推理：经验贝叶斯和倾向得分的比较-潜在结果方法
6. Protein Interaction Z Score Assessment (PIZSA): an empirical scoring scheme for evaluation of protein–protein interactions [O] . Ankit A Roy, Abhilesh S Dhawanjewar, Parichit Sharma, 2019

机译：蛋白质相互作用Z评分评估（PIZSA）：用于评估蛋白质与蛋白质相互作用的经验评分方案
7. Comparison of empirical Bayes and propensity score methods for road safety evaluation: A simulation study [O] . Haojie Li, Daniel J. Graham, Hongliang Ding, 2019

机译：经验贝叶斯和道路安全评价倾向评分方法的比较：模拟研究
8. Annotating Animal Mitochondrial tRNAs: A new Scoring Scheme and an Empirical Evaluation of Four Methods [R] . Wyman, S. K., Boore, J. L. 2004

机译：动物线粒体tRNa的注释：一种新的评分方案和四种方法的实证评价

Empirical Evaluation of Scoring Methods

摘要

著录项

相似文献

相关主题

期刊订阅