首页> 外文会议>International conference on web information systems and applications >Is Bigger Data Better for Defect Prediction: Examining the Impact of Data Size on Supervised and Unsupervised Defect Prediction
【24h】

Is Bigger Data Better for Defect Prediction: Examining the Impact of Data Size on Supervised and Unsupervised Defect Prediction

机译:更大的数据对缺陷预测是否更好:检查数据大小对有监督和无监督缺陷预测的影响

获取原文

摘要

Defect prediction could help software practitioners to predict the future occurrence of bugs in the software code regions. In order to improve the accuracy of defect prediction, dozens of supervised and unsupervised methods have been put forward and achieved good results in this field. One limiting factor of defect prediction is that the data size of defect data is not big, which restricts the scope of application with defect prediction models. In this study, we try to construct bigger defect datasets by merging available datasets with the same measurement dimension and check whether bigger data will bring better defect prediction performance with supervised and unsupervised models or not. The results of our experiment reveal that larger-scale dataset doesn't bring improvements of both supervised and unsupervised classifiers.
机译:缺陷预测可以帮助软件从业人员预测软件代码区域中错误的未来发生。为了提高缺陷预测的准确性,提出了数十种有监督和无监督的方法,并在该领域取得了良好的效果。缺陷预测的一个限制因素是缺陷数据的数据量不大,这限制了缺陷预测模型的应用范围。在这项研究中,我们尝试通过合并具有相同测量维度的可用数据集来构建更大的缺陷数据集,并检查更大的数据是否可以在有监督和无监督的模型下带来更好的缺陷预测性能。我们的实验结果表明,大规模数据集并没有带来监督分类器和非监督分类器的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号