...
首页> 外文期刊>Empirical Software Engineering >Uncertain Classification of Fault-Prone Software Modules
【24h】

Uncertain Classification of Fault-Prone Software Modules

机译:Fault-Prone软件模块的不确定分类

获取原文
获取原文并翻译 | 示例

摘要

Many development organizations try to minimize faults in software as a means for improving customer satisfaction. Assuring high software quality often entails time-consuming and costly development processes. A software quality model based on software metrics can be used to guide enhancement efforts by predicting which modules are fault-prone. This paper presents statistical techniques to determine which predictions by a classification tree should be considered uncertain. We conducted a case study of a large legacy telecommunications system. One release was the basis for the training dataset, and the subsequent release was the basis for the evaluation dataset. We built a classification tree using the TREEDISC algorithm, which is based on χ~2 tests of contingency tables. The model predicted whether a module was likely to have faults discovered by customers, or not, based on software product, process, and execution metrics. We simulated practical use of the model by classifying the modules in the evaluation dataset. The model achieved useful accuracy, in spite of the very small proportion of fault-prone modules in the system. We assessed whether the classes assigned to the leaves were appropriate by statistical tests, and found sizable subsets of modules with uncertain classification. Discovering which modules have uncertain classifications allows sophisticated enhancement strategies to resolve uncertainties. Moreover, TREEDISC is especially well suited to identifying uncertain classifications.
机译:许多开发组织试图将软件故障最小化,以提高客户满意度。确保高质量的软件通常需要耗时且昂贵的开发过程。基于软件指标的软件质量模型可用于通过预测哪些模块容易出现故障来指导增强工作。本文介绍了统计技术,以确定应该将分类树的哪些预测视为不确定的。我们进行了一个大型传统电信系统的案例研究。一个版本是训练数据集的基础,而后续版本是评估数据集的基础。我们使用TREEDISC算法构建了分类树,该树基于列联表的χ〜2检验。该模型基于软件产品,过程和执行指标来预测模块是否可能被客户发现故障。通过对评估数据集中的模块进行分类,我们模拟了模型的实际使用。尽管系统中容易出错的模块所占比例很小,但该模型仍能达到有用的精度。我们通过统计测试评估了分配给叶子的类别是否合适,并发现了具有不确定分类的模块的相当大的子集。发现哪些模块具有不确定的分类,可以使用复杂的增强策略来解决不确定性。此外,TREEDISC特别适合识别不确定的分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号