首页> 外文会议>IEEE International Symposium on Software Reliability Engineering >Cross-Project Aging-Related Bug Prediction Based on Joint Distribution Adaptation and Improved Subclass Discriminant Analysis
【24h】

Cross-Project Aging-Related Bug Prediction Based on Joint Distribution Adaptation and Improved Subclass Discriminant Analysis

机译:基于联合分布自适应和改进子类判别分析的跨项目老化相关错误预测

获取原文

摘要

Software aging, which is caused by Aging-Related Bugs (ARBs), refers to the phenomenon of performance degradation and eventual crash in long running systems. In order to discover and remove ARBs, ARB prediction is proposed. However, due to the low presence and reproducing difficulty of ARBs, it is usually difficult to collect sufficient ARB data within a project. Therefore, cross-project ARB prediction is proposed as a solution to build the target project’s ARB predictor by using the labeled data from the source project. A key point for cross-project ARB prediction is to reduce distribution difference between source and target project. However, existing approaches mainly focus on the marginal distribution difference while somehow overlook the conditional distribution difference, and they mainly use random oversampling to alleviate the class imbalance which may lead to overfitting. To address these problems, we propose a new crossproject ARB prediction approach based on Joint Distribution Adaptation (JDA) and Improved Subclass Discriminant Analysis (ISDA), called JDA-ISDA. The key idea of JDA-ISDA is first to use JDA to reduce the marginal distribution and conditional distribution difference jointly and then apply ISDA to alleviate the severe class imbalance problem. A set of experiments are carried out on two large open-source projects with six different machine learning (ML) classifiers. The experimental results demonstrate that compared with the state-of-the-art Transfer Learning based Aging-related bug Prediction (TLAP) and Supervised Representation Learning Approach (SRLA), JDA-ISDA is much more robust to different ML classifiers than TLAP, and the average improvement in terms of the balance value can be achieved up to 31.8%, and JDA-ISDA also outperforms TLAP and SRLA on average when logistic regression is chosen as the classifier for best performance prediction.
机译:由老化相关的错误(ARB)引起的软件老化是指性能下降和长时间运行的系统最终崩溃的现象。为了发现和去除ARB,提出了ARB预测。但是,由于ARB的存在率低且难以复制,因此通常很难在项目中收集足够的ARB数据。因此,提出了跨项目ARB预测作为解决方案,通过使用源项目中的标记数据来构建目标项目的ARB预测器。跨项目ARB预测的关键是减小源项目与目标项目之间的分布差异。但是,现有方法主要关注边际分布差异,而以某种方式忽略了条件分布差异,并且它们主要使用随机过采样来减轻可能导致过度拟合的类不平衡。为了解决这些问题,我们提出了一种新的跨项目ARB预测方法,该方法基于联合分布适应(JDA)和改进的子类判别分析(ISDA),称为JDA-ISDA。 JDA-ISDA的关键思想是首先使用JDA来减少边际分布和条件分布差异,然后应用ISDA来缓解严重的阶级失衡问题。在具有六个不同机器学习(ML)分类器的两个大型开源项目上进行了一组实验。实验结果表明,与基于最新的基于迁移学习的老化相关错误预测(TLAP)和监督表示学习方法(SRLA)相比,JDA-ISDA对不同的ML分类器的健壮性要比TLAP强得多,并且在平衡值计平均改善,可以实现高达31.8%,而当逻辑回归被选择作为分类器以获得最佳性能预测JDA-ISDA也优于TLAP和SRLA上平均。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号