Assessing the quality of classification models: Performance measures and evaluation procedures

Pawe? Cichosz

首页> 外文期刊>Open Engineering >Assessing the quality of classification models: Performance measures and evaluation procedures

【24h】

Assessing the quality of classification models: Performance measures and evaluation procedures

机译：评估分类模型的质量：绩效指标和评估程序

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article systematically reviews techniques used for the evaluation of classification models and provides guidelines for their proper application. This includes performance measures assessing the model’s performance on a particular dataset and evaluation procedures applying the former to appropriately selected data subsets to produce estimates of their expected values on new data. Their common purpose is to assess model generalization capabilities, which are crucial for judging the applicability and usefulness of both classification and any other data mining models. The review presented in this article is expected to be sufficiently in-depth and complete for most practical needs, while remaining clear and easy to follow with little prior knowledge. Issues that receive special attention include incorporating instance weights to performance measures, combining the same set of evaluation procedures with arbitrary performance measures, and avoiding pitfalls related to separating data subsets used for evaluation from those used for model creation. With the classification task unquestionably being one of the central data mining tasks and the vastly increasing number of data mining applications — not only in business, but also in engineering and research — this is expected to be interesting and useful for a wide audience. All presented techniques are accompanied by simple R language implementations and usage examples, which — whereas created to serve the illustration purpose mostly — can be actually used in practice.

机译：本文系统地回顾了用于评估分类模型的技术，并为其正确应用提供了指导。这包括性能指标上的特定数据集评估模型的性能和评价程序将前者适当选择数据子集对新数据的预期值的估计数字。它们的共同目的是评估模型泛化能力，这对于判断分类和任何其他数据挖掘模型的适用性和实用性至关重要。预期本文中的评论将针对大多数实际需求进行足够的深度和完整，同时保持清晰且易于理解，并且无需任何先验知识。需要特别注意的问题包括将实例权重合并到性能指标中，将同一套评估程序与任意性能指标相结合，并避免与将用于评估的数据子集与用于模型创建的子集分离有关的陷阱。毫无疑问，分类任务是中心数据挖掘任务之一，并且数据挖掘应用程序的数量不断增加（不仅在业务方面，而且在工程学和研究领域），这对于广泛的受众来说将是有趣且有用的。所有提出的技术都附带有简单的R语言实现和用法示例，尽管这些语言创建来主要是用于说明目的，但实际上可以在实践中使用。

著录项

来源
《Open Engineering》 |2011年第2期|共页
作者
Pawe? Cichosz;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类一般工业技术;
关键词

相似文献

外文文献
中文文献
专利

1. Assessing the Quality of Classification Models: Performance Measures and Evaluation Procedures [J] . Pawet Cichosz Central European Journal of Engineering . 2011,第2期

机译：评估分类模型的质量：绩效指标和评估程序
2. Comparing classical performance measures with signature indices derived from flow duration curves to assess model structures as tools for catchment classification [J] . Rita Ley, Hugo Hellebrand, Markus C. Casper, Nordic hydrology . 2016,第1期

机译：将经典性能指标与流量持续时间曲线得出的特征指标进行比较，以评估模型结构作为流域分类的工具
3. Measuring ability to assess claims about treatment effects: a latent trait analysis of items from the ?￠??Claim Evaluation Tools?￠?? database using Rasch modelling,Measuring ability to assess claims about treatment effects: the development of the ?￠??Claim E [J] . Allen Nsangi, Astrid Austvoll-Dahlgren, Daniel Semakula, BMJ Open . 2017,第5期

机译：评估评估有关治疗效果的主张的能力：来自“索赔评估工具”的项目的潜在性状分析使用Rasch建模的数据库，可评估有关治疗效果的声明的能力：Claim E的发展
4. A statistical methodological framework for estimating, assessing, evaluating, monitoring and interpreting road travel risk performance measure indicators: A 'risk analysis and evaluation system model' combining traffic collision and 'exposure torisk' [C] . Delbert E. Stewart International Technical Conference on the Enhanced Safety of Vehicles . 1998

机译：估计，评估，评估，监测和解释道路旅行风险绩效措施指标的统计方法论框架：“风险分析与评估系统模型”组合交通碰撞和“曝光”
5. THE EVALUATION CRITERIA AND PROCEDURES EMPLOYED TO ASSESS THE PERFORMANCE OF SECONDARY PUBLIC SCHOOL PRINCIPALS IN VIRGINIA. [D] . ROUNTREE, JAMES EARL. 1981

机译：评估弗吉尼亚中学公立学校基本绩效的评估标准和程序。
6. Expected loss functions as additional measures to assess performance of multiple testing procedures for combination drug dose finding [O] . Julia N. Soulakova, Allan R. Sampson -1

机译：预期损失函数作为评估组合药物剂量发现的多种测试程序性能的额外措施
7. Why standard modelling and evaluation procedures are inadequate for assessing traffic congestion measures [O] . Lam WHK, Tam ML 1997

机译：为什么标准建模和评估程序不足以评估交通拥堵措施
8. Acoustic Model Evaluation Procedures: A Review. Distance Measures and Statistical Test Procedures for Assessing the Accuracy of Propagation Loss Models [R] . McGirr, R. W. 1979

机译：声学模型评估程序：评论。评估传播损失模型准确性的距离测量和统计测试程序

Assessing the quality of classification models: Performance measures and evaluation procedures

摘要

著录项

相似文献

相关主题

期刊订阅