首页> 外文期刊>Journal of software >Modeling Software Defects as Anomalies: A Case Study on Promise Repository
【24h】

Modeling Software Defects as Anomalies: A Case Study on Promise Repository

机译:将软件缺陷建模为异常:以Promise存储库为例

获取原文
获取外文期刊封面目录资料

摘要

Software defect prediction is a highly studied domain in Software Engineering research due to its importance in software development. In literature, various classification methods with static code attributes have been used to predict defects. However, defected instances are very few compared to non-defected instances and as such lead to imbalanced data. Traditional machine learning techniques give poor results for such data. In this paper an anomaly detection technique for software defect prediction, is proposed which is not affected by imbalanced data. The technique incorporates both univariate and multivariate Gaussian distribution to model non-defected software module. The defected software modules are then predicted based on their deviation from the generated model. To evaluate our approach, we implemented the algorithm and tested it on the NASA datasets from the PROMISE repository. By utilizing this approach, we observed an average balance of 63.36% and 69.06% in univariate model and multivariate model respectively. Without utilizing optimization or filter, this approach yield better result than industry standard of 60%.
机译:由于软件缺陷预测在软件开发中的重要性,因此它在软件工程研究中是一个高度研究的领域。在文献中,具有静态代码属性的各种分类方法已用于预测缺陷。但是,与没有缺陷的实例相比,有缺陷的实例很少,因此会导致数据不平衡。传统的机器学习技术对于此类数据给出的结果很差。提出了一种不受数据不平衡影响的软件缺陷预测异常检测技术。该技术结合了单变量和多变量高斯分布,以对非缺陷软件模块进行建模。然后根据缺陷软件模块与生成的模型的偏差来预测缺陷软件模块。为了评估我们的方法,我们实施了该算法,并在PROMISE存储库中的NASA数据集上对其进行了测试。通过使用这种方法,我们观察到单变量模型和多变量模型的平均余额分别为63.36%和69.06%。在不利用优化或过滤器的情况下,此方法比60%的行业标准产生更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号