首页> 外文学位 >Incremental development and cost-based evaluation of software fault prediction models
【24h】

Incremental development and cost-based evaluation of software fault prediction models

机译:软件故障预测模型的增量开发和基于成本的评估

获取原文
获取原文并翻译 | 示例

摘要

It is difficult to build high quality software with limited quality assurance budgets. Software fault prediction models can be used to learn fault predictors from software metrics. Fault prediction prior to software release can guide Verification and Validation (V&V) activity and allocate scarce resources to modules which are predicted to be fault-prone.;One of the most important goals of fault prediction is to detect fault prone modules as early as possible in the software development life cycle. Design and code metrics have been successfully used for predicting fault-prone modules. In this dissertation, we introduce fault prediction from software requirements. Furthermore, we investigate the advantages of the incremental development of software fault prediction models, and we compare the performance of these models as the volume of data and their life cycle origin (design, code, or their combination) evolution during project development. We confirm that increasing the volume of training data improves model performance. And that, models built from code metrics typically outperform those built using design metrics only. However, both types of models prove to be useful as they can be constructed in different phases of the life cycle. We also demonstrate that models that utilize a combination of design and code level metrics outperform models which use either one metric set exclusively.;In evaluation of fault prediction models, misclassification cost has been neglected. Using a graphical measurement, the cost curve, we evaluate software fault prediction models. Cost curves not only allow software quality engineers to introduce project-specific misclassification costs into model evaluation, but also allow them to incorporate module-specific misclassification costs into model evaluation. Classifying a software module as fault-prone implies the application of some verification activities, thus adding to the development cost. Misclassifying a module as fault free carries the risk of system failure, and is also associated with cost implications. Our results, through the analysis of more than ten projects from public repositories, support a recommendation to adopt cost curves as one of the standard methods for software fault prediction model performance evaluation.
机译:用有限的质量保证预算来构建高质量的软件是困难的。软件故障预测模型可用于从软件指标中学习故障预测器。在软件发布之前进行故障预测可以指导验证和确认(V&V)活动,并为预计容易发生故障的模块分配稀缺资源。故障预测的最重要目标之一是尽早检测容易发生故障的模块在软件开发生命周期中。设计和代码指标已成功用于预测容易发生故障的模块。本文从软件需求出发介绍故障预测。此外,我们研究了软件故障预测模型的增量开发的优势,并在项目开发过程中比较了这些模型在数据量及其生命周期起源(设计,代码或其组合)方面的性能。我们确认增加训练数据量可以改善模型性能。而且,根据代码指标构建的模型通常优于仅使用设计指标构建的模型。但是,两种模型都证明是有用的,因为它们可以在生命周期的不同阶段进行构建。我们还证明,结合使用设计和代码级别指标的模型优于仅使用一个指标集的模型。在评估故障预测模型时,忽略了误分类成本。使用图形化度量,成本曲线,我们评估软件故障预测模型。成本曲线不仅允许软件质量工程师将特定于项目的分类错误成本引入模型评估,而且还允许他们将特定于模块的分类错误成本纳入模型评估。将软件模块归类为容易发生故障意味着应用了一些验证活动,从而增加了开发成本。将模块错误地分类为无故障会带来系统故障的风险,并且还会带来成本问题。通过对来自公共存储库的十多个项目的分析,我们的结果支持建议采用成本曲线作为软件故障预测模型性能评估的标准方法之一。

著录项

  • 作者

    Jiang, Yue.;

  • 作者单位

    West Virginia University.;

  • 授予单位 West Virginia University.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 135 p.
  • 总页数 135
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号