首页> 外文期刊>Empirical Software Engineering >Value-cognitive boosting with a support vector machine for cross-project defect prediction
【24h】

Value-cognitive boosting with a support vector machine for cross-project defect prediction

机译:支持向量机的价值认知提升,用于跨项目缺陷预测

获取原文
获取原文并翻译 | 示例
       

摘要

It is well-known that software defect prediction is one of the most important tasks for software quality improvement. The use of defect predictors allows test engineers to focus on defective modules. Thereby testing resources can be allocated effectively and the quality assurance costs can be reduced. For within-project defect prediction (WPDP), there should be sufficient data within a company to train any prediction model. Without such local data, cross-project defect prediction (CPDP) is feasible since it uses data collected from similar projects in other companies. Software defect datasets have the class imbalance problem increasing the difficulty for the learner to predict defects. In addition, the impact of imbalanced data on the real performance of models can be hidden by the performance measures chosen. We investigate if the class imbalance learning can be beneficial for CPDP. In our approach, the asymmetric misclassification cost and the similarity weights obtained from distributional characteristics are closely associated to guide the appropriate resampling mechanism. We performed the effect size A-statistics test to evaluate the magnitude of the improvement. For the statistical significant test, we used Wilcoxon rank-sum test. The experimental results show that our approach can provide higher prediction performance than both the existing CPDP technique and the existing class imbalance technique.
机译:众所周知,软件缺陷预测是提高软件质量的最重要任务之一。使用缺陷预测器可以使测试工程师专注于缺陷模块。从而可以有效地分配测试资源,并可以降低质量保证成本。对于项目内缺陷预测(WPDP),公司内部应该有足够的数据来训练任何预测模型。没有此类本地数据,跨项目缺陷预测(CPDP)是可行的,因为它使用从其他公司的类似项目中收集的数据。软件缺陷数据集存在类不平衡问题,这增加了学习者预测缺陷的难度。此外,不平衡的数据对模型实际性能的影响可以通过选择的性能指标来隐藏。我们调查班级失衡学习是否对CPDP有益。在我们的方法中,不对称错误分类成本和从分布特征获得的相似度权重密切相关,以指导适当的重采样机制。我们进行了效果量A统计检验,以评估改善的幅度。对于统计显着性检验,我们使用了Wilcoxon秩和检验。实验结果表明,与现有的CPDP技术和现有的类不平衡技术相比,我们的方法可以提供更高的预测性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号