Empirical evaluation of the active learning strategies on software defects prediction

机译：主动学习策略对软件缺陷预测的实证评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Software defect prediction is a popular technical method in software engineering. In order to reduce the cost of a software defects, problems existing in the software are found by testing software products. Software defect prediction often uses machine learning techniques to improve the performance of software testing but requires enough labeled data when training the model. Because the cost of obtaining data is different from the label, the data is easy to obtain, but the label is cumbersome and expensive. In order to demonstrate software defect prediction, after the data obtained active learning algorithm is introduced to query the data, and the most valuable data is selected for expert annotation and then put into the model for training. However, it is not clear which active learning query strategy to choose the most effective in the software defect prediction model. We use different active learning strategy software defect prediction models for comparison. Experiment on the NASA dataset, using Naive Bayes and SVM, Linear Regression as the classifier. Comprehensive research results show that the Density-weighted strategy has a significant effect on the data set.

机译：软件缺陷预测是软件工程中一种流行的技术方法。为了减少软件缺陷的成本，通过测试软件产品来发现软件中存在的问题。软件缺陷预测通常使用机器学习技术来提高软件测试的性能，但是在训练模型时需要足够的标记数据。由于获取数据的成本与标签不同，因此数据易于获得，但是标签麻烦且昂贵。为了证明软件缺陷预测，在引入获得的数据后，采用主动学习算法对数据进行查询，然后选择最有价值的数据进行专家标注，然后放入模型进行训练。但是，尚不清楚在软件缺陷预测模型中选择哪种最有效的主动学习查询策略。我们使用不同的主动学习策略软件缺陷预测模型进行比较。使用朴素贝叶斯（Naive Bayes）和SVM（线性回归）作为分类器，对NASA数据集进行实验。综合研究结果表明，密度加权策略对数据集具有显着影响。

著录项

来源
《International Symposium on System and Software Reliability》|2020年|83-89|共7页
会议地点
作者
Wenbo Mi; Yong Li; Shibo Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Software; Predictive models; Support vector machines; Machine learning; Software testing; Information entropy; Data models;

机译：软件;预测模型;支持向量机;机器学习;软件测试;信息熵;数据模型;

相似文献

外文文献
中文文献
专利

1. Active Learning Empirical Research on Cross-Version Software Defect Prediction Datasets [J] . Fang Li, Yubin Qu, Junxia Ji, International Journal of Performability Engineering . 2020,第4期

机译：积极学习跨版软件缺陷预测数据集的实证研究
2. An empirical framework for defect prediction using machine learning techniques with Android software [J] . Malhotra Ruchika Applied Soft Computing . 2016,第Null期

机译：使用机器学习技术和Android软件进行缺陷预测的经验框架
3. EMPIRICAL ASSESSMENT OF MACHINE LEARNING BASED SOFTWARE DEFECT PREDICTION TECHNIQUES [J] . VENKATA UDAYA B. CHALLAGULLA, FAROKH B. BASTANI, I. LING YEN, International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2008,第2期

机译：基于机器学习的软件缺陷预测技术的实证评估
4. Substantiation of Software Defect Prediction using Statistical Learning: An Empirical Study [C] . Shiwang Agarwal, Sajal Gupta, Rishabh Aggarwal, 2019 4th International Conference on Internet of Things: Smart Innovation and Usages . 2019

机译：使用统计学习的软件缺陷预测的实证研究
5. Open source software projects' attractiveness, activeness, and efficiency as a path to software quality: An empirical evaluation of their relationships and causes [D] . Santos, Carlos, Jr. 2009

机译：开源软件项目的吸引力，活跃性和效率是提高软件质量的途径：对它们之间关系和原因的实证评估
6. Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques [O] . Bilal Khan, Rashid Naseem, Muhammad Arif Shah, 2021

机译：医疗保健大数据的软件缺陷预测：机器学习技术的实证评价
7. Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques [O] . Bilal Khan, Rashid Naseem, Muhammad Arif Shah, 2021

机译：医疗保健大数据的软件缺陷预测：机器学习技术的实证评价

Empirical evaluation of the active learning strategies on software defects prediction

摘要

著录项

相似文献

相关主题

期刊订阅