...
首页> 外文期刊>Information Systems >Software defect prediction using a cost sensitive decision forest and voting, and a potential solution to the class imbalance problem
【24h】

Software defect prediction using a cost sensitive decision forest and voting, and a potential solution to the class imbalance problem

机译:使用成本敏感的决策林和投票进行软件缺陷预测,以及对类不平衡问题的潜在解决方案

获取原文
获取原文并翻译 | 示例
           

摘要

Software development projects inevitably accumulate defects throughout the development process. Due to the high cost that defects can incur, careful consideration is crucial when predicting which sections of code are likely to contain defects. Classification algorithms used in machine learning can be used to create classifiers which can be used to predict defects. While traditional classification algorithms optimize for accuracy, cost-sensitive classification methods attempt to make predictions which incur the lowest classification cost. In this paper we propose a cost-sensitive classification technique called CSForest which is an ensemble of decision trees. We also propose a cost-sensitive voting technique called CS Voting in order to take advantage of the set of decision trees in minimizing the classification cost. We then investigate a potential solution to class imbalance within our decision forest algorithm. We empirically evaluate the proposed techniques comparing them with six (6) classifier algorithms on six (6) publicly available clean datasets that are commonly used in the research on software defect prediction. Our initial experimental results indicate a clear superiority of the proposed techniques over the existing ones. (C) 2015 Elsevier Ltd. All rights reserved.
机译:软件开发项目不可避免地会在整个开发过程中积累缺陷。由于缺陷可能会导致高昂的成本,因此在预测代码的哪些部分可能包含缺陷时,仔细考虑至关重要。机器学习中使用的分类算法可用于创建可用于预测缺陷的分类器。尽管传统的分类算法会针对准确性进行优化,但成本敏感型分类方法会尝试做出导致最低分类成本的预测。在本文中,我们提出了一种成本敏感的分类技术,称为CSForest,它是决策树的集合。我们还提出了一种成本敏感的投票技术,称为CS投票,以便在最小化分类成本时利用决策树集。然后,我们研究了决策森林算法中类不平衡的潜在解决方案。我们对六(6)个公开可用的干净数据集上的六(6)个分类器算法与它们进行比较,对这些技术进行经验评估,这些算法通常用于软件缺陷预测的研究中。我们的初步实验结果表明,所提出的技术明显优于现有技术。 (C)2015 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号