首页> 外文会议>International Conference on Future Software Engineering and Multimedia Engineering >Suitability of KNN Regression in the Development of Interaction Based Software Fault Prediction Models
【24h】

Suitability of KNN Regression in the Development of Interaction Based Software Fault Prediction Models

机译:KNN回归在基于交互的软件故障预测模型的开发中的适用性

获取原文
获取外文期刊封面目录资料

摘要

Accurate fault prediction is an indispensable step, to the extent of being a critical activity in software engineering. In fault prediction model development research, combination of metrics significantly improves the prediction capability of the model, but it also gives rise to the issue of handling an increased number of predictors and evolved nonlinearity due to complex interaction among metrics. Ordinary least square (OLS) based parametric regression techniques cannot effectively model such nonlinearity with a large number of predictors because the global parametric function to fit the data is not known beforehand. In our previous studies[1-3], we showed the impact of interaction in the combined metrics approach of fault prediction and statistically established the simultaneous increment in the predictive accuracy of the model with interaction. In this study we use K-Nearest Neighbor (KNN) regression as an example of nonparametric regression technique, otherwise well known for classification tasks in the data mining community. Through the results derived here, we empirically establish and validate the hypothesis that the performance of KNN regression remains ordinarily unaffected with increasing number of interacting predictors and simultaneously provides superior performance over widely used multiple linear regression (MLR).
机译:准确的故障预测是一个不可或缺的步骤,到成为软件工程中的关键活动的程度。在故障预测模型开发研究中,指标的组合显着提高了模型的预测能力,但由于度量之间的复杂相互作用,它也引起了处理增加数量的预测器和进化非线性的问题。基于普通的基于总量(OLS)的参数回归技术不能有效地模拟大量预测器的这种非线性,因为预先知道数据的全局参数函数不知道。在我们以前的研究[1-3]中,我们在故障预测中的组合度量方法中展示了相互作用的影响,并在统计上建立了与交互模型的预测精度同时增加。在这项研究中,我们使用K-最近邻(knn)回归作为非参数回归技术的示例,否则在数据挖掘社区中众所周知的分类任务。通过此处的结果,我们经验地建立和验证了kNN回归性能往往不会因越来越多的相互作用预测因子而不受影响的假设,并同时提供优异的性能,通过广泛使用的多元线性回归(MLR)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号