首页> 外文会议>IEEE International Conference on Software Analysis, Evolution and Reengineering >Enhancing Just-in-Time Defect Prediction Using Change Request-based Metrics
【24h】

Enhancing Just-in-Time Defect Prediction Using Change Request-based Metrics

机译:使用更改基于请求的指标增强立即缺陷预测

获取原文

摘要

Identifying defective software components as early as their commit helps to reduce significant software development and maintenance costs. In recent years, several studies propose to use just-in-time (JIT) defect prediction techniques to identify changes that could introduce defects at check-in time. To predict defect introducing changes, JIT defect prediction approaches use change metrics collected from software repositories. These change metrics, however, capture code and code change related information. Information related to the change requests (e.g., clarity of change request and difficulty to implement the change) that could determine the change’s proneness to introducing new defects are not studied. In this study, we propose to augment the publicly available change metrics dataset with six change request- based metrics collected from issue tracking systems. To build the prediction model, we used five machine learning algorithms: AdaBoost, XGBoost, Deep Neural Network, Random Forest and Logistic Regression. The proposed approach is evaluated using a dataset collected from four open source software systems, i.e., Eclipse platform, Eclipse JDT, Bugzilla and Mozilla. The results show that the augmented dataset improves the performance of JIT defect prediction in 19 out of 20 cases. F1-score of JIT defect prediction in the four systems is improved by an average of 4.8%, 3.4%, 1.7%, 1.1% and 1.1% while using AdaBoost, XGBoost, Deep Neural Network, Random Forest and Logistic Regression, respectively.
机译:早期识别有缺陷的软件组件,因为他们的提交有助于降低显着的软件开发和维护成本。近年来,几项研究建议使用即时(JIT)缺陷预测技术来识别可能在办理入住时间下引入缺陷的变化。为了预测引入变化的缺陷,JIT缺陷预测方法使用从软件存储库收集的变化指标。但是,这些更改指标,捕获代码和代码更改相关信息。无法研究与更改请求相关的信息(例如,改变请求的清晰度和实施变更),可以确定更改对引入新缺陷的更改。在这项研究中,我们建议使用从问题跟踪系统中收集的六个更改基于请求的指标来增加公开的更改度量数据集。为了构建预测模型,我们使用了五种机器学习算法:Adaboost,XGBoost,深神经网络,随机森林和逻辑回归。使用从四个开源软件系统,即Eclipse平台,Eclipse JDT,Bugzilla和Mozilla收集的数据集来评估所提出的方法。结果表明,增强数据集提高了20例中有19例的JIT缺陷预测的性能。 F1评分四种系统中的JIT缺陷预测平均分别提高了4.8%,3.4%,1.7%,1.1%和1.1%,同时使用Adaboost,Xgboost,深神经网络,随机森林和逻辑回归。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号