首页> 外文会议>International Symposium on System and Software Reliability >Empirical evaluation of the active learning strategies on software defects prediction
【24h】

Empirical evaluation of the active learning strategies on software defects prediction

机译:主动学习策略对软件缺陷预测的实证评估

获取原文

摘要

Software defect prediction is a popular technical method in software engineering. In order to reduce the cost of a software defects, problems existing in the software are found by testing software products. Software defect prediction often uses machine learning techniques to improve the performance of software testing but requires enough labeled data when training the model. Because the cost of obtaining data is different from the label, the data is easy to obtain, but the label is cumbersome and expensive. In order to demonstrate software defect prediction, after the data obtained active learning algorithm is introduced to query the data, and the most valuable data is selected for expert annotation and then put into the model for training. However, it is not clear which active learning query strategy to choose the most effective in the software defect prediction model. We use different active learning strategy software defect prediction models for comparison. Experiment on the NASA dataset, using Naive Bayes and SVM, Linear Regression as the classifier. Comprehensive research results show that the Density-weighted strategy has a significant effect on the data set.
机译:软件缺陷预测是软件工程中一种流行的技术方法。为了减少软件缺陷的成本,通过测试软件产品来发现软件中存在的问题。软件缺陷预测通常使用机器学习技术来提高软件测试的性能,但是在训练模型时需要足够的标记数据。由于获取数据的成本与标签不同,因此数据易于获得,但是标签麻烦且昂贵。为了证明软件缺陷预测,在引入获得的数据后,采用主动学习算法对数据进行查询,然后选择最有价值的数据进行专家标注,然后放入模型进行训练。但是,尚不清楚在软件缺陷预测模型中选择哪种最有效的主动学习查询策略。我们使用不同的主动学习策略软件缺陷预测模型进行比较。使用朴素贝叶斯(Naive Bayes)和SVM(线性回归)作为分类器,对NASA数据集进行实验。综合研究结果表明,密度加权策略对数据集具有显着影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号