首页> 外文期刊>Software Quality Journal >A transfer cost-sensitive boosting approach for cross-project defect prediction
【24h】

A transfer cost-sensitive boosting approach for cross-project defect prediction

机译:用于跨项目缺陷预测的转移成本敏感提升方法

获取原文
获取原文并翻译 | 示例
           

摘要

Software defect prediction has been regarded as one of the crucial tasks to improve software quality by effectively allocating valuable resources to fault-prone modules. It is necessary to have a sufficient set of historical data for building a predictor. Without a set of sufficient historical data within a company, cross-project defect prediction (CPDP) can be employed where data from other companies are used to build predictors. In such cases, a transfer learning technique, which extracts common knowledge from source projects and transfers it to a target project, can be used to enhance the prediction performance. There exists the class imbalance problem, which causes difficulties for the learner to predict defects. The main impacts of imbalanced data under cross-project settings have not been investigated in depth. We propose a transfer cost-sensitive boosting method that considers both knowledge transfer and class imbalance for CPDP when given a small amount of labeled target data. The proposed approach performs boosting that assigns weights to the training instances with consideration of both distributional characteristics and the class imbalance. Through comparative experiments with the transfer learning and the class imbalance learning techniques, we show that the proposed model provides significantly higher defect detection accuracy while retaining better overall performance. As a result, a combination of transfer learning and class imbalance learning is highly effective for improving the prediction performance under cross-project settings. The proposed approach will help to design an effective prediction model for CPDP. The improved defect prediction performance could help to direct software quality assurance activities and reduce costs. Consequently, the quality of software can be managed effectively.
机译:通过有效地将宝贵的资源分配给易错模块,软件缺陷预测已被视为提高软件质量的关键任务之一。必须有足够的历史数据集来构建预测变量。如果公司内部没有一组足够的历史数据,则可以使用跨项目缺陷预测(CPDP),其中使用来自其他公司的数据来构建预测器。在这种情况下,可以使用从源项目中提取常识并将其转移到目标项目的转移学习技术来增强预测性能。存在班级不平衡问题,给学习者预测缺陷带来困难。跨项目设置下数据不平衡的主要影响尚未得到深入研究。我们提出了一种转移成本敏感的提升方法,当给定少量标记的目标数据时,该方法同时考虑了CPDP的知识转移和类不平衡。所提出的方法进行了增强,将分配权重分配给训练实例,同时考虑了分布特征和班级不平衡。通过使用转移学习和类不平衡学习技术的对比实验,我们表明,提出的模型提供了更高的缺陷检测精度,同时保留了更好的整体性能。结果,转移学习和班级不平衡学习的结合对于提高跨项目设置下的预测性能非常有效。所提出的方法将有助于为CPDP设计有效的预测模型。改进的缺陷预测性能可以帮助指导软件质量保证活动并降低成本。因此,可以有效地管理软件的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号