首页> 外文期刊>Intelligent data analysis >C_CART: An instance confidence-based decision tree algorithm for classification
【24h】

C_CART: An instance confidence-based decision tree algorithm for classification

机译:C_CART:分类的实例基于置信度决策树算法

获取原文
获取原文并翻译 | 示例
           

摘要

In classification, a decision tree is a common model due to its simple structure and easy understanding. Most of decision tree algorithms assume all instances in a dataset have the same degree of confidence, so they use the same generation and pruning strategies for all training instances. In fact, the instances with greater degree of confidence are more useful than the ones with lower degree of confidence in the same dataset. Therefore, the instances should be treated discriminately according to their corresponding confidence degrees when training classifiers. In this paper, we investigate the impact and significance of degree of confidence of instances on the classification performance of decision tree algorithms, taking the classification and regression tree (CART) algorithm as an example. First, the degree of confidence of instances is quantified from a statistical perspective. Then, a developed CART algorithm named C_CART is proposed by introducing the confidence of instances into the generation and pruning processes of CART algorithm. Finally, we conduct experiments to evaluate the performance of C_CART algorithm. The experimental results show that our C_CART algorithm can significantly improve the generalization performance as well as avoiding the over-fitting problem to a certain extend.
机译:在分类中,决策树是由于其简单结构和易于理解的共同模型。大多数决策树算法假设数据集中的所有实例都具有相同程度的置信度,因此它们对所有培训实例使用相同的生成和修剪策略。事实上,具有更高程度的信心的情况比在同一数据集中具有较低置信度的距离更低的情况。因此,当训练分类器时,应根据其相应的置信度判别处理该情况。在本文中,我们调查了实例对决策树算法分类性能的影响和意义,以分类和回归树(推车)算法为例。首先,从统计角度来看,情况的置信度是量化的。然后,通过将实例的置信度引入购物车算法的产生和修剪过程来提出名为C_CART的发达的CAR算法。最后,我们进行实验来评估C_CART算法的性能。实验结果表明,我们的C_CART算法可以显着提高泛化性能,以及避免对某一延伸的过度拟合问题。

著录项

  • 来源
    《Intelligent data analysis》 |2021年第4期|929-948|共20页
  • 作者单位

    Minist Educ Key Lab Symbol Computat & Knowledge Engineer Changchun 130012 Jilin Peoples R China|Jilin Univ Coll Comp Sci & Technol Changchun Jilin Peoples R China|CSIRO Data61 Sydney NSW Australia;

    Minist Educ Key Lab Symbol Computat & Knowledge Engineer Changchun 130012 Jilin Peoples R China|Jilin Univ Coll Comp Sci & Technol Changchun Jilin Peoples R China;

    Nanjing Univ State Key Lab Novel Software Technol Nanjing Jiangsu Peoples R China;

    Minist Educ Key Lab Symbol Computat & Knowledge Engineer Changchun 130012 Jilin Peoples R China|Jilin Univ Coll Comp Sci & Technol Changchun Jilin Peoples R China;

    CSIRO Data61 Sydney NSW Australia;

  • 收录信息 美国《科学引文索引》(SCI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Degree of confidence; CART algorithm; generalization; classification; machine learning;

    机译:信心程度;购物车算法;泛化;分类;机器学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号