首页> 外文会议>International Conference on Evaluation and Assessment in Software Engineering >Assessing the Quality and Cleaning of a Software Project Dataset: An Experience Report
【24h】

Assessing the Quality and Cleaning of a Software Project Dataset: An Experience Report

机译:评估软件项目数据集的质量和清洁:体验报告

获取原文

摘要

OBJECTIVE - The aim is to report upon an assessment of the impact noise has on the predictive accuracy by comparing noise handling techniques. METHOD - We describe the process of cleaning a large software management dataset comprising initially of more than 10,000 projects. The data quality is mainly assessed through feedback from the data provider and manual inspection of the data. Three methods of noise correction (polishing, noise elimination and robust algorithms) are compared with each other assessing their accuracy. The noise detection was undertaken by using a regression tree model. RESULTS - Three noise correction methods are compared and different results in their accuracy where noted. CONCLUSIONS - The results demonstrated that polishing improves classification accuracy compared to noise elimination and robust algorithms approaches.
机译:目标 - 目的是通过比较噪声处理技术对冲击噪声进行预测准确性的评估报告。方法 - 我们描述了清洁大型软件管理数据集的过程,该数据集包含最初超过10,000个项目。数据质量主要通过来自数据提供商的反馈和数据的反馈来评估。将三种噪声校正方法(抛光,噪声消除和鲁棒算法)进行比较,相互评估其准确性。通过使用回归树模型进行噪声检测。结果 - 比较三种噪声校正方法,并在注明的准确度下不同导致结果。结论 - 结果表明,与噪声消除和强大的算法相比,抛光提高了分类准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号