Mining Static Code Metrics for a Robust Prediction of Software Defect-Proneness

机译：挖掘静态代码度量标准，以可靠地预测软件缺陷的准确性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Defect-proneness prediction is affected by multiple aspects including sampling bias, non-metric factors, uncertainty of models etc. These aspects often contribute to prediction uncertainty and result in variance of prediction. This paper proposes two methods of data mining static code metrics to enhance defect-proneness prediction. Given little non-metric or qualitative information extracted from software codes, we first suggest to use a robust unsupervised learning method, shared nearest neighbors (SNN) to extract the similarity patterns of the code metrics. These patterns indicate similar characteristics of the components of the same cluster that may result in introduction of similar defects. Using the similarity patterns with code metrics as predictors, defect-proneness prediction may be improved. The second method uses the Occam's windows and Bayesian model averaging to deal with model uncertainty: first, the datasets are used to train and cross-validate multiple learners and then highly qualified models are selected and integrated into a robust prediction. From a study based on 12 datasets from NASA, we conclude that our proposed solutions can contribute to a better defect-proneness prediction.

机译：缺陷倾向性预测受多个方面的影响，包括采样偏差，非度量因素，模型的不确定性等。这些方面通常会导致预测不确定性并导致预测差异。本文提出了两种数据挖掘静态代码指标的方法，以增强缺陷倾向性预测。给定很少的从软件代码中提取的非度量或定性信息，我们首先建议使用鲁棒的无监督学习方法，共享最近邻（SNN）来提取代码度量的相似性模式。这些模式表明同一簇的组件具有相似的特性，可能导致引入相似的缺陷。使用具有代码量度的相似性模式作为预测变量，可以改善缺陷倾向性预测。第二种方法使用Occam的窗口和贝叶斯模型平均来处理模型不确定性：首先，使用数据集来训练和交叉验证多个学习者，然后选择高质量的模型并将其集成到可靠的预测中。通过基于来自NASA的12个数据集的研究，我们得出结论，我们提出的解决方案可以有助于更好地预测缺陷倾向性。

著录项

来源
《2011 Fifth International Symposium on Empirical Software Engineering and Measurement》|2011年|p.207-214|共8页
会议地点
作者
Li Lianfa; Leung Hareton;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类软件工程;
关键词
data mining; defect-proneness; robust prediction; software quality; uncertainty;

机译：数据挖掘;缺陷倾向性;稳健预测;软件质量;不确定性;

相似文献

外文文献
中文文献
专利

1. Predicting Code Smells and Analysis of Predictions: Using Machine Learning Techniques and Software Metrics [J] . Mohammad Y.Mhawish, Manjari Gupta 计算机科学技术学报（英文版） . 2020,第006期

机译：预测代码气味并进行预测分析：使用机器学习技术和软件指标
2. Static Analysis and Code Complexity Metrics as Early Indicators of Software Defects [J] . Safa Omri, Pascal Montag, Carsten Sinz Journal of Software Engineering and Applications . 2018,第4期

机译：静态分析和代码复杂性度量作为软件缺陷的早期指标
3. A Novel Approach to Determine Software Security Level using Bayes Classifier via Static Code Metrics [J] . Sariman Guncel, Kucuksille Ecir Ugur Elektronika ir Elektrotechnika . 2016,第2期

机译：通过静态代码指标使用贝叶斯分类器确定软件安全级别的新方法
4. Mining Static Code Metrics for a Robust Prediction of Software Defect-Proneness [C] . Li Lianfa, Leung Hareton International Symposium on Empirical Software Engineering and Measurement . 2011

机译：挖掘静态代码指标，用于软件缺陷的强大预测
5. Aspect mining using self-organizing maps with method level dynamic software metrics as input vectors. [D] . Maisikeli, Sayyed Garba. 2009

机译：使用具有方法级别动态软件指标作为输入向量的自组织映射进行方面挖掘。
6. Web service QoS prediction using improved software source code metrics [O] . Sarathkumar Rangarajan, Huai Liu, Hua Wang 2020

机译：使用改进的软件源代码指标预测Web服务QoS预测
7. Mining static code metrics for a robust prediction of software defect-proneness [O] . Li L, Leung H 2011

机译：挖掘静态代码指标，以可靠地预测软件缺陷倾向

Mining Static Code Metrics for a Robust Prediction of Software Defect-Proneness

摘要

著录项

相似文献

相关主题

期刊订阅