首页> 外文期刊>Applied Soft Computing >On the evaluation and selection of classifier learning algorithms with crowdsourced data
【24h】

On the evaluation and selection of classifier learning algorithms with crowdsourced data

机译:众包数据分类学习算法的评估与选择

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In many current problems, the actual class of the instances, the ground truth, is unavailable. Instead, with the intention of learning a model, the labels can be crowdsourced by harvesting them from different annotators. In this work, among those problems we focus on those that are binary classification problems. Specifically, our main objective is to explore the evaluation and selection of models through the quantitative assessment of the goodness of evaluation methods capable of dealing with this kind of context. That is a key task for the selection of evaluation methods capable of performing a sensible model selection. Regarding the evaluation and selection of models in such contexts, we identify three general approaches, each one based on a different interpretation of the nature of the underlying ground truth: deterministic, subjectivist or probabilistic. For the analysis of these three approaches, we propose how to estimate the Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) curve within each interpretation, thus deriving three evaluation methods. These methods are compared in extensive experimentation whose empirical results show that the probabilistic method generally overcomes the other two, as a result of which we conclude that it is advisable to use that method when performing the evaluation in such contexts. In further studies, it would be interesting to extend our research to multiclass classification problems. (C) 2019 Elsevier B.V. All rights reserved.
机译:在许多当前问题中,实际情况的实际类别,实际真相是不可用的。相反,随着学习模型的意图,通过从不同的注释器收获它们来众所周心。在这项工作中,我们在那些问题中关注那些是二进制分类问题的问题。具体而言,我们的主要目标是通过定量评估能够处理这种背景的评估方法的善良的定量评估来探讨模型的评估和选择。这是选择能够执行合理的模型选择的评估方法的关键任务。关于在这种背景下的模型评估和选择,我们确定三种一般方法,每个方法都是基于对基础事实的性质的不同解释:确定性,主观主义或概率。为了分析这三种方法,我们提出了如何估计每个解释内的接收器操作特征(ROC)曲线的曲线(AUC)下的区域,从而导出三种评估方法。这些方法在广泛的实验中进行了比较,其经验结果表明,概率方法通常克服另外两个,因此我们得出结论,建议在在这种情况下进行评估时使用该方法。在进一步的研究中,将我们的研究扩展到多种多组分类问题是有趣的。 (c)2019年Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号