On the evaluation and selection of classifier learning algorithms with crowdsourced data

Urkullu A.; Perez A.; Calvo B.

首页> 外文期刊>Applied Soft Computing >On the evaluation and selection of classifier learning algorithms with crowdsourced data

【24h】

On the evaluation and selection of classifier learning algorithms with crowdsourced data

机译：众包数据分类学习算法的评估与选择

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In many current problems, the actual class of the instances, the ground truth, is unavailable. Instead, with the intention of learning a model, the labels can be crowdsourced by harvesting them from different annotators. In this work, among those problems we focus on those that are binary classification problems. Specifically, our main objective is to explore the evaluation and selection of models through the quantitative assessment of the goodness of evaluation methods capable of dealing with this kind of context. That is a key task for the selection of evaluation methods capable of performing a sensible model selection. Regarding the evaluation and selection of models in such contexts, we identify three general approaches, each one based on a different interpretation of the nature of the underlying ground truth: deterministic, subjectivist or probabilistic. For the analysis of these three approaches, we propose how to estimate the Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) curve within each interpretation, thus deriving three evaluation methods. These methods are compared in extensive experimentation whose empirical results show that the probabilistic method generally overcomes the other two, as a result of which we conclude that it is advisable to use that method when performing the evaluation in such contexts. In further studies, it would be interesting to extend our research to multiclass classification problems. (C) 2019 Elsevier B.V. All rights reserved.

机译：在许多当前问题中，实际情况的实际类别，实际真相是不可用的。相反，随着学习模型的意图，通过从不同的注释器收获它们来众所周心。在这项工作中，我们在那些问题中关注那些是二进制分类问题的问题。具体而言，我们的主要目标是通过定量评估能够处理这种背景的评估方法的善良的定量评估来探讨模型的评估和选择。这是选择能够执行合理的模型选择的评估方法的关键任务。关于在这种背景下的模型评估和选择，我们确定三种一般方法，每个方法都是基于对基础事实的性质的不同解释：确定性，主观主义或概率。为了分析这三种方法，我们提出了如何估计每个解释内的接收器操作特征（ROC）曲线的曲线（AUC）下的区域，从而导出三种评估方法。这些方法在广泛的实验中进行了比较，其经验结果表明，概率方法通常克服另外两个，因此我们得出结论，建议在在这种情况下进行评估时使用该方法。在进一步的研究中，将我们的研究扩展到多种多组分类问题是有趣的。（c）2019年Elsevier B.V.保留所有权利。

著录项

来源
《Applied Soft Computing》 |2019年第2019期|共13页
作者
Urkullu A.; Perez A.; Calvo B.;
展开▼
作者单位

Univ Basque Country Dept Comp Sci &

Artificial Intelligence UPV EHU Paseo Manuel Lardizabal 1 San Sebastian 20018 Spain;

Basque Ctr Appl Math Dept Data Sci Alameda Mazarredo 14 Bilbao 48009 Spain;

Univ Basque Country Dept Comp Sci &

Artificial Intelligence UPV EHU Paseo Manuel Lardizabal 1 San Sebastian 20018 Spain;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词
Model selection; Evaluation; Crowdsourced data; AUC; Kendall-tau;

机译：模型选择;评估;众包数据;AUC;KENDALL-TAU;

相似文献

外文文献
中文文献
专利

1. On the evaluation and selection of classifier learning algorithms with crowdsourced data [J] . Urkullu A., Perez A., Calvo B. Applied Soft Computing . 2019,第期

机译：众包数据分类学习算法的评估与选择
2. A hybrid data mining model of feature selection algorithms and ensemble learning classifiers for credit scoring [J] . Fatemeh Nemati Koutanaei, Hedieh Sajedi, Mohammad Khanbabaei Journal of retailing and consumer services . 2015,第NOVa期

机译：特征选择算法和集成学习分类器的混合数据挖掘模型用于信用评分
3. On Taxonomy and Evaluation of Feature Selection-Based Learning Classifier System Ensemble Approaches for Data Mining Problems [J] . Debie Essam, Shafi Kamran, Merrick Kathryn, Computational Intelligence . 2017,第3期

机译：基于特征选择的学习分类器系统集成方法的数据挖掘问题分类与评价
4. Phishing Hybrid Feature-Based Classifier by Using Recursive Features Subset Selection and Machine Learning Algorithms [C] . Hiba Zuhair, Ali Selamat International Coference of Reliable Information and Communication Technology . 2019

机译：通过使用递归特征子集选择和机器学习算法，通过使用递归特征基于混合特征的分类器
5. Applying Machine Learning with Spatio Temporal Analysis to Classify Crowdsourced Data from the 2010 Haiti Earthquake Relief Efforts. [D] . Jamal, Sarosh. 2017

机译：将机器学习与时空时空分析相结合，对2010年海地地震救济努力中的众包数据进行分类。
6. Supervised Machine Learning Algorithms for Bioelectromagnetics: Prediction Models and Feature Selection Techniques Using Data from Weak Radiofrequency Radiation Effect on Human and Animals Cells [O] . Malka N. Halgamuge 2020

机译：生物电磁学的有监督机器学习算法：使用弱射频辐射对人和动物细胞的数据预测模型和特征选择技术
7. On the evaluation and selection of classifier learning algorithms with crowdsourced data [O] . A. Urkullu, A. Pérez, B. Calvo 2019

机译：论众包数据的分类器学习算法的评估与选择

On the evaluation and selection of classifier learning algorithms with crowdsourced data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅