首页> 外文期刊>Computer standards & interfaces >An empirical analysis of the effectiveness of software metrics and fault prediction model for identifying faulty classes
【24h】

An empirical analysis of the effectiveness of software metrics and fault prediction model for identifying faulty classes

机译:对用于识别故障类别的软件指标和故障预测模型的有效性的实证分析

获取原文
获取原文并翻译 | 示例
           

摘要

Software fault prediction models are used to predict faulty modules at the very early stage of software development life cycle. Predicting fault proneness using source code metrics is an area that has attracted several researchers' attention. The performance of a model to assess fault proneness depends on the source code metrics which are considered as the input for the model. In this work, we have proposed a framework to validate the source code metrics and identify a suitable set of source code metrics with the aim to reduce irrelevant features and improve the performance of the fault prediction model. Initially, we applied a f-test analysis and univariate logistic regression analysis to each source code metric to evaluate their potential for predicting fault proneness. Next, we performed a correlation analysis and multivariate linear regression stepwise forward selection to find the right set of source code metrics for fault prediction. The obtained set of source code metrics are considered as the input to develop a fault prediction model using a neural network with five different training algorithms and three different ensemble methods. The effectiveness of the developed fault prediction models are evaluated using a proposed cost evaluation framework. We performed experiments on fifty six Open Source Java projects. The experimental results reveal that the model developed by considering the selected set of source code metrics using the suggested source code metrics validation framework as the input achieves better results compared to all other metrics. The experimental results also demonstrate that the fault prediction model is best suitable for projects with faulty classes less than the threshold value depending on fault identification efficiency flow - 48.89%, median- 39.26%, and high - 27.86%).
机译:软件故障预测模型用于在软件开发生命周期的早期阶段预测故障模块。使用源代码度量来预测故障倾向性是吸引了许多研究人员关注的领域。用于评估故障倾向性的模型的性能取决于被视为模型输入的源代码指标。在这项工作中,我们提出了一个框架来验证源代码指标并确定一组合适的源代码指标,以减少不相关的功能并提高故障预测模型的性能。最初,我们对每个源代码指标应用了f检验分析和单变量logistic回归分析,以评估其预测故障倾向的潜力。接下来,我们进行了相关分析和多元线性回归逐步正向选择,以找到用于故障预测的正确的源代码指标集。所获得的一组源代码度量标准被视为使用神经网络开发故障预测模型的输入,该神经网络具有五种不同的训练算法和三种不同的集成方法。使用建议的成本评估框架评估开发的故障预测模型的有效性。我们对56个开源Java项目进行了实验。实验结果表明,与建议的所有其他指标相比,使用建议的源代码指标验证框架通过考虑选定的一组源代码指标而开发的模型具有更好的结果。实验结果还表明,故障预测模型最适合于故障类别小于阈值的项目,具体取决于故障识别效率流(48.89%,中位数39.26%和高27.86%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号