首页> 外文期刊>Journal of Intelligent Systems >A Comparison of Semi-Supervised Classification Approaches for Software Defect Prediction
【24h】

A Comparison of Semi-Supervised Classification Approaches for Software Defect Prediction

机译:用于软件缺陷预测的半监督分类方法的比较

获取原文
获取原文并翻译 | 示例
           

摘要

Predicting the defect-prone modules when the previous defect labels of modules are limited is a challenging problem encountered in the software industry. Supervised classification approaches cannot build high-performance prediction models with few defect data, leading to the need for new methods, techniques, and tools. One solution is to combine labeled data points with unlabeled data points during learning phase. Semi-supervised classification methods use not only labeled data points but also unlabeled ones to improve the generalization capability. In this study, we evaluated four semi-supervised classification methods for semi-supervised defect prediction. Low-density separation (LDS), support vector machine (SVM), expectation-maximization (EM-SEMI), and class mass normalization (CMN) methods have been investigated on NASA data sets, which are CM1, KC1, KC2, and PC1. Experimental results showed that SVM and LDS algorithms outperform CMN and EM-SEMI algorithms. In addition, LDS algorithm performs much better than SVM when the data set is large. In this study, the LDS-based prediction approach is suggested for software defect prediction when there are limited fault data.
机译:当模块的先前缺陷标签受到限制时,预测易发生缺陷的模块是软件行业中遇到的具有挑战性的问题。监督分类方法无法建立具有很少缺陷数据的高性能预测模型,从而需要新的方法,技术和工具。一种解决方案是在学习阶段将标记的数据点与未标记的数据点组合在一起。半监督分类方法不仅使用标记的数据点,而且使用未标记的数据点来提高泛化能力。在本研究中,我们评估了四种用于半监督缺陷预测的半监督分类方法。在NASA数据集CM1,KC1,KC2和PC1上研究了低密度分离(LDS),支持向量机(SVM),期望最大化(EM-SEMI)和类质量归一化(CMN)方法。实验结果表明,SVM和LDS算法优于CMN和EM-SEMI算法。另外,当数据集很大时,LDS算法的性能比SVM好得多。在这项研究中,当故障数据有限时,建议使用基于LDS的预测方法进行软件缺陷预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号