首页> 外文期刊>Concurrency and computation: practice and experience >A novel software defect prediction approach usingmodified objective cluster analysis
【24h】

A novel software defect prediction approach usingmodified objective cluster analysis

机译:一种新颖的软件缺陷预测方法,使用副的目标集群分析

获取原文
获取原文并翻译 | 示例

摘要

A novel approach for software defect prediction of unlabeled datasets is proposed using modified objective cluster analysis (OCA). In this approach, the first step is to construct the distance matrix of instances in the datasets by utilizing the automatically determined clusters through the modified OCA. Then the dipoles within different instances are categorized into two different groups. Finally, the clusters of instances are produced, and software defects can be predicted by imposing a modified consistency criterion. Case study and comparative experiments were conducted based on 12 public datasets selected from the databases of Promise and ReLink using multiple different unsupervised algorithms and cross-project approaches. There are two experimental settings: experiments with datasets that contain all metrics and experiments with datasets that contain only module size metrics. The results were evaluated by the index of precision, recall, F-measure, and receiver operating characteristic curve (AUC). Furthermore, a complexity analysis of the tested algorithms was conducted as well. In experiments with datasets with all metrics, the proposed OCA gets the best results in four indexes, and the average values of precision, recall, F-measure, and AUC were improved by a minimum of 1.52%, 2.78%, 19.84%, and 0.93%, respectively. In experiments with datasets with only module size metrics, the proposed OCA performed the best results in four indexes also, and the average values of precision, F-measure, and AUC were improved by a minimum of 8.8%, 2.59%, and 8.36%, respectively. The proposed algorithm is of low complexity and provides a new way to efficiently predict software defects with unlabeled datasets.
机译:使用修改的目标群集分析(OCA)提出了一种用于软件缺陷预测的软件缺陷预测的新方法。在这种方法中,第一步骤是通过利用通过修改的OCA自动确定的群集来构造数据集中的距离矩阵。然后在不同实例中的偶极子分为两个不同的组。最后,产生实例的集群,并且通过强制修改后的一致性标准,可以预测软件缺陷。案例研究和比较实验是基于从承诺数据库中选择的12个公共数据集进行,使用多个不同无监督的算法和交叉项目方法。有两种实验设置:使用仅包含模块大小指标的数据集的数据集进行实验。通过精度,召回,F测量和接收器操作特征曲线(AUC)的指标评估结果。此外,还进行了对测试算法的复杂性分析。在具有所有度量的数据集的实验中,所提出的OCA获得了四个指标的最佳结果,精度,召回,F测量和AUC的平均值提高了最低1.52%,2.78%,19.84%,和分别为0.93%。在仅具有模块大小度量的数据集的实验中,所提出的OCA在四个指标中执行了最佳结果,并且精度,F测量和AUC的平均值提高了至少8.8%,2.59%和8.36% , 分别。该算法的复杂性低,提供了一种有效地预测未标记数据集的软件缺陷的新方法。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号