Cross validation for the classical model of structured expert judgment

Colson Abigail R.; Cooke Roger M.

首页> 外文期刊>Reliability Engineering & System Safety >Cross validation for the classical model of structured expert judgment

【24h】

Cross validation for the classical model of structured expert judgment

机译：对结构化专家判断的经典模型进行交叉验证

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We update the 2008 TU Delft structured expert judgment database with data from 33 professionally contracted Classical Model studies conducted between 2006 and March 2015 to evaluate its performance relative to other expert aggregation models. We briefly review alternative mathematical aggregation schemes, including harmonic weighting, before focusing on linear pooling of expert judgments with equal weights and performance-based weights. Performance weighting outperforms equal weighting in all but 1 of the 33 studies in-sample. True out-of-sample validation is rarely possible for Classical Model studies, and cross validation techniques that split calibration questions into a training and test set are used instead. Performance weighting incurs an "out-of-sample penalty" and its statistical accuracy out-of-sample is lower than that of equal weighting. However, as a function of training set size, the statistical accuracy of performance-based combinations reaches 75% of the equal weight value when the training set includes 80% of calibration variables. At this point the training set is sufficiently powerful to resolve differences in individual expert performance. The information of performance-based combinations is double that of equal weighting when the training set is at least 50% of the set of calibration variables. Previous out-of-sample validation work used a Total Out-of-Sample Validity Index based on all splits of the calibration questions into training and test subsets, which is expensive to compute and includes small training sets of dubious value. As an alternative, we propose an Out-of-Sample Validity Index based on averaging the product of statistical accuracy and information over all training sets sized at 80% of the calibration set. Performance weighting outperforms equal weighting on this Out-of-Sample Validity Index in 26 of the 33 post-2006 studies; the probability of 26 or more successes on 33 trials if there were no difference between performance weighting and equal weighting is 0.001.

机译：我们使用2006年至2015年3月之间进行的33项专业承包的古典模型研究数据更新了2008 TU Delft结构化专家判断数据库，以评估其相对于其他专家汇总模型的绩效。在集中讨论具有相等权重和基于性能的权重的专家判断的线性合并之前，我们简要回顾了包括谐波加权在内的其他数学聚合方案。在33个样本研究中，只有1个研究的绩效加权优于其他加权。对于古典模型研究，极不可能进行真正的样本外验证，而是使用将校准问题分为训练和测试集的交叉验证技术。性能加权会产生“样本外损失”，其统计样本外准确性低于相等加权的统计准确性。但是，作为训练集大小的函数，当训练集包含80％的校准变量时，基于性能的组合的统计准确性达到相等权重值的75％。在这一点上，训练集足够强大，可以解决各个专家表现的差异。当训练集至少是校准变量集的50％时，基于性能的组合的信息是相等权重的两倍。以前的样本外验证工作基于对校准问题的所有拆分（分为训练和测试子集）使用了总样本外有效性指数，这计算起来很昂贵，并且包含小的可疑值训练集。作为替代方案，我们建议根据所有训练集的统计准确度和信息乘积的平均值计算出样本外有效性指数，该训练集的大小应为校准集的80％。在2006年后的33项研究中，有26项的绩效超额有效性指标胜过同等有效性指标；如果绩效权重和相等权重之间没有差异，则33个试验中26次或更多成功的概率为0.001。

著录项

来源
《Reliability Engineering & System Safety》 |2017年第7期|109-120|共12页
作者
Colson Abigail R.; Cooke Roger M.;
展开▼
作者单位

Ctr Dis Dynam Econ & Policy, Washington, DC USA|Univ Strathclyde, Glasgow, Lanark, Scotland;

Univ Strathclyde, Glasgow, Lanark, Scotland|Resources Future Inc, Washington, DC 20036 USA|TU Delft Ret, Delft, Netherlands;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Expert judgment; Calibration; Information; Classical model; Out-of-sample validation;

机译：专家判断;校准;信息;经典模型;样本外验证;

相似文献

外文文献
中文文献
专利

1. Expert Elicitation: Using the Classical Model to Validate Experts' Judgments [J] . Colson Abigail R., Cooke Roger M. Review of environmental economics and policy . 2018,第1期

机译：专家启发：使用经典模型验证专家的判断
2. Expert Elicitation: Using the Classical Model to Validate Experts' Judgments [J] . Colson Abigail R., Cooke Roger M. Review of environmental economics and policy . 2018,第2期

机译：专家启发：使用经典模型验证专家的判断
3. Comparison of a new expert elicitation model with the Classical Model, equal weights and single experts, using a cross-validation technique [J] . F. Flandoli, E. Giorgi, W.P. Aspinall, Reliability Engineering & System Safety . 2011,第10期

机译：使用交叉验证技术比较新专家启发模型与古典模型，相等权重和单专家
4. ON THE BALANCED USE BETWEEN FORMALLY STRUCTURED AND SIMPLIFIED APPROACHES FOR REFLECTING SUBJECTIVE JUDGMENTS OF EXPERTS [C] . Kwang-Il Ahn, Joon-Eon Yang, Jae-Joo Ha International Congress on Advances in Nuclear Power Plants . 2003

机译：关于反映专家主观判断的正式结构和简化方法之间的平衡使用
5. A simulation study of the Classical Method of expert judgment combination: How many seeds and how many experts? [D] . Gehris, Rama. 2008

机译：专家判断组合经典方法的模拟研究：多少种子，多少专家？
6. Out-of-sample validation for structured expert judgment of Asian carp establishment in Lake Erie [O] . Roger M Cooke, Marion E Wittmann, David M Lodge, -1

机译：用于伊利湖亚洲鲤鱼养殖结构化专家判断的样本外验证
7. Cross validation for the classical model of structured expert judgment [O] . Colson Abigail R., Cooke Roger M. 2017

机译：对结构化专家判断的经典模型进行交叉验证
8. Methods for Assessing the Failure Frequency of Underground Gas Pipelines withHistorical Data and Structured Expert Judgment [R] . Cooke, R., Jager, E. 1996

机译：用历史数据和结构化专家判断评估地下燃气管道失效频率的方法

Cross validation for the classical model of structured expert judgment

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅