...
首页> 外文期刊>Neurocomputing >Multi-label thresholding for cost-sensitive classification
【24h】

Multi-label thresholding for cost-sensitive classification

机译:用于成本敏感分类的多标签阈值

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Multi-label classification associates each instance with a set of labels which reflects the nature of a wide range of real-world applications. However, existing approaches assume that all labels have the same misclassification cost, whereas in real-world problems different types of misclassification errors have different costs, which are generally unknown in the training context or might change from one context to another. Thus, there is a demand for cost-sensitive classification methods that minimise the average misclassification cost rather than error rates or counts. In this paper, we adopt a simple yet general method, called thresholding, which applies to most classification algorithms to adapt them to cost-sensitive multi-label classification. This paper investigates current threshold choice approaches for multi-label classification. It explores the choice of single and multiple thresholds and extends some of the current techniques to support multi-label problems. Moreover, it proposes cost curves and scatter diagrams for performance evaluation in the multi-label setting. Experimental evaluation on 13 multi-label datasets demonstrates that there is no significant loss by adjusting a global threshold rather than a per-label threshold considering different misclassification costs across labels. Although tuning multiple thresholds is the obvious solution, the global threshold can also be valid.Multi-label classification associates each instance with a set of labels which reflects the nature of a wide range of real-world applications. However, existing approaches assume that all labels have the same misclassification cost, whereas in real-world problems different types of misclassification errors have different costs, which are generally unknown in the training context or might change from one context to another. Thus, there is a demand for cost-sensitive classification methods that minimise the average misclassification cost rather than error rates or counts. In this paper, we adopt a simple yet general method, called thresholding, which applies to most classification algorithms to adapt them to cost-sensitive multi-label classification. This paper investigates current threshold choice approaches for multi-label classification. It explores the choice of single and multiple thresholds and extends some of the current techniques to support multi-label problems. Moreover, it proposes cost curves and scatter diagrams for performance evaluation in the multi-label setting. Experimental evaluation on 13 multi-label datasets demonstrates that there is no significant loss by adjusting a global threshold rather than a per-label threshold considering different misclassification costs across labels. Although tuning multiple thresholds is the obvious solution, the global threshold can also be valid.(c) 2020 Elsevier B.V. All rights reserved.
机译:多标签分类将每个实例与一组标签相关联,这些标签反映了各种现实应用程序的性质。然而,现有方法假设所有标签都具有相同的错误分类成本,而在实际问题中,不同类型的错误分类错误具有不同的成本,这在培训背景中通常是未知的,或者可能会从一个上下文变为另一个语境。因此,需要对成本敏感的分类方法来最小化平均错误分类成本而不是错误率或计数。在本文中,我们采用一种简单但通用的方法,称为阈值,这适用于大多数分类算法,以使它们适应成本敏感的多标签分类。本文调查了多标签分类的电流阈值选择方法。它探讨了单个和多个阈值的选择,并扩展了一些当前技术来支持多标签问题。此外,它提出了用于多标签设置中的性能评估的成本曲线和散点图。对13个多标签数据集的实验评估表明,通过调整全局阈值而不是考虑到标签不同的错误分类成本,通过调整全局阈值而没有显着损失。虽然调整多个阈值是明显的解决方案,但是全局阈值也可以是有效的.Multi-Label分类将每个实例与一组标签相关联,这些标签反映了各种现实应用程序的性质。然而,现有方法假设所有标签都具有相同的错误分类成本,而在实际问题中,不同类型的错误分类错误具有不同的成本,这在培训背景中通常是未知的,或者可能会从一个上下文变为另一个语境。因此,需要对成本敏感的分类方法来最小化平均错误分类成本而不是错误率或计数。在本文中,我们采用一种简单但通用的方法,称为阈值,这适用于大多数分类算法,以使它们适应成本敏感的多标签分类。本文调查了多标签分类的电流阈值选择方法。它探讨了单个和多个阈值的选择,并扩展了一些当前技术来支持多标签问题。此外,它提出了用于多标签设置中的性能评估的成本曲线和散点图。对13个多标签数据集的实验评估表明,通过调整全局阈值而不是考虑到标签不同的错误分类成本,通过调整全局阈值而没有显着损失。虽然调整多个阈值是明显的解决方案,但全局阈值也可以有效。(c)2020 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号