Optimizing Classifiers for Hypothetical Scenarios

机译：针对假设场景优化分类器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The deployment of classification models is an integral component of many modern data mining and machine learning applications. A typical classification model is built with the tacit assumption that the deployment scenario by which it is evaluated is fixed and fully characterized. Yet, in the practical deployment of classification methods, important aspects of the application environment, such as the misclassification costs, may be uncertain during model building. Moreover, a single classification model may be applied in several different deployment scenarios. In this work, we propose a method to optimize a model for uncertain deployment scenarios. We begin by deriving a relationship between two evaluation measures, H measure and cost curves, that may be used to address uncertainty in classifier performance. We show that when uncertainty in classifier performance is modeled as a probabilistic belief that is a function of this underlying relationship, a natural definition of risk emerges for both classifiers and instances. We then leverage this notion of risk to develop a boosting-based algorithm-which we call RiskBoost-that directly mitigates classifier risk, and we demonstrate that it outperforms AdaBoost on a diverse selection of datasets.

机译：分类模型的部署是许多现代数据挖掘和机器学习应用程序不可或缺的组成部分。一个典型的分类模型是在默认假设的基础上构建的，评估该部署模型是固定的，并且具有充分的特征。但是，在分类方法的实际部署中，应用环境的重要方面（例如错误分类成本）在模型构建过程中可能是不确定的。此外，单个分类模型可以应用于几种不同的部署方案。在这项工作中，我们提出了一种针对不确定的部署方案优化模型的方法。我们首先得出H度量和成本曲线这两个评估度量之间的关系，这些关系可用于解决分类器性能的不确定性。我们表明，当分类器性能的不确定性被建模为基于此基础关系的概率性信念时，分类器和实例的风险自然定义就会出现。然后，我们利用这种风险概念开发了一种基于boosting的算法（我们称为RiskBoost），该算法可直接减轻分类器风险，并且在多种数据集上证明它优于AdaBoost。

著录项

来源
《Pacific-Asia conference on knowledge discovery and data mining》|2015年|264-276|共13页
会议地点
作者
Reid A. Johnson; Troy Raeder; Nitesh V. Chawla;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Grasshopper optimization algorithm-based approach for the optimization of ensemble classifier and feature selection to classify epileptic EEG signals [J] . Singh Gurwinder, Singh Birmohan, Kaur Manpreet Medical and Biological Engineering and Computing: Journal of the International Federation for Medical and Biological Engineering . 2019,第6期

机译：基于蚱蜢优化算法的合奏分类器优化和特征选择来分类癫痫脑电图信号
2. Integrating Machine Learning With Microsimulation to Classify Hypothetical, Novel Patients for Predicting Pregabalin Treatment Response Based on Observational and Randomized Data in Patients With Painful Diabetic Peripheral Neuropathy [J] . Joe Alexander Jr, Roger A Edwards, Luigi Manca, Pragmatic and Observational Research . 2019,第5期

机译：基于疼痛糖尿病外周神经病变患者的观察和随机数据，整合机器学习以分类假设，新型患者预测普瑞巴林治疗响应预测普瑞巴林治疗响应
3. Risk Assessment of an Electrical Power System Considering the Influence of Traffic Congestion on a Hypothetical Scenario of Electrified Transportation System in New York State [J] . Wang Hongping, Fang Yi-Ping, Zio Enrico IEEE Transactions on Intelligent Transportation Systems . 2021,第1期

机译：考虑到交通拥堵对纽约电气化运输系统的影响力的影响风险评估
4. Optimizing Classifiers for Hypothetical Scenarios [C] . Reid A. Johnson, Troy Raeder, Nitesh V. Chawla Pacific-Asia conference on knowledge discovery and data mining . 2015

机译：优化假设方案的分类器
5. Effects of the Permutation of Reinforcement Magnitude on Measures of Delay Discounting in a Hypothetical Money Scenario [D] . Harman, Michael J. 2018

机译：假设货币情景中加固幅度的排列对延迟贴现措施的影响
6. Optimizing Palbociclib Therapy Across the Age Spectrum: Hypothetical Illustrative Case Scenarios in HR+ HER2–Metastatic Breast Cancer [O] . Kimberly Podsada, Joanne C. Ryan, Kristi Orbaugh 2020

机译：优化患有年龄频谱的Palbociclib疗法：HR +Her2-转移性乳腺癌的假设说明性案例情景
7. Classifying Scenarios in a Product Design Process: a study towards semi-automated scenario generation [O] . Anggreeni I., Voort M.C. van der 2008

机译：在产品设计过程中对场景进行分类：对半自动场景生成的研究
8. Numerical Hydrodynamic Study of Hypothetical Levee Setback Scenarios. [R] . Echevarria-Doyle, W., Dahl, T. A. 2018

机译：假设堤防挫折情景的数值水动力学研究。

Optimizing Classifiers for Hypothetical Scenarios

摘要

著录项

相似文献

相关主题

期刊订阅