...
首页> 外文期刊>Advances in Artificial Intelligence >A New Information Measure Based on Example-Dependent Misclassification Costs and Its Application in Decision Tree Learning
【24h】

A New Information Measure Based on Example-Dependent Misclassification Costs and Its Application in Decision Tree Learning

机译:基于实例错误分类成本的新信息测度及其在决策树学习中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

This article describes how the costs of misclassification given with the individual training objects for classification learning can be used in the construction of decision trees for minimal cost instead of minimal error class decisions. This is demonstrated by defining modified, cost-dependent probabilities, a new, cost-dependent information measure, and using a cost-sensitive extension of the CAL5 algorithm for learning decision trees. The cost-dependent information measure ensures the selection of the (local) next best, that is, cost-minimizing, discriminating attribute in the sequential construction of the classification trees. This is shown to be a cost-dependent generalization of the classical information measure introduced by Shannon, which only depends on classical probabilities. It is therefore of general importance and extends classic information theory, knowledge processing, and cognitive science, since subjective evaluations of decision alternatives can be included in entropy and the transferred information. Decision trees can then be viewed as cost-minimizing decoders for class symbols emitted by a source and coded by feature vectors. Experiments with two artificial datasets and one application example show that this approach is more accurate than a method which uses class dependent costs given by experts a priori.
机译:本文介绍了如何将用于分类学习的单个训练对象给出的错误分类成本用于决策树的构建,以最小的成本而不是最小的错误类别决策。通过定义修改的,与成本相关的概率,新的,与成本相关的信息度量,以及使用CAL5算法的成本敏感扩展来学习决策树,可以证明这一点。成本相关的信息度量可确保选择(本地)次优的分类,即在分类树的顺序构造中最小化成本,区分属性。这被证明是Shannon引入的经典信息度量的一种依赖于成本的概括,它仅取决于经典概率。因此,它具有普遍意义,并扩展了经典的信息论,知识处理和认知科学,因为决策选择的主观评估可以包含在熵和传递的信息中。然后,决策树可被看作是成本最小化的解码器,用于由源发出并由特征向量编码的类符号。使用两个人工数据集和一个应用示例进行的实验表明,该方法比使用专家先验给出的类相关成本的方法更为准确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号