首页> 外文会议>IEEE/ACM International Conference on Software Engineering >Missing Data Imputation Based on Low-Rank Recovery and Semi-Supervised Regression for Software Effort Estimation
【24h】

Missing Data Imputation Based on Low-Rank Recovery and Semi-Supervised Regression for Software Effort Estimation

机译:基于低排名恢复和半监督回归的软件努力估算的半监督回归缺失数据估算

获取原文

摘要

Software effort estimation (SEE) is a crucial step in software development. Effort data missing usually occurs in real-world data collection. Focusing on the missing data problem, existing SEE methods employ the deletion, ignoring, or imputation strategy to address the problem, where the imputation strategy was found to be more helpful for improving the estimation performance. Current imputation methods in SEE use classical imputation techniques for missing data imputation, yet these imputation techniques have their respective disadvantages and might not be appropriate for effort data. In this paper, we aim to provide an effective solution for the effort data missing problem. Incompletion includes the drive factor missing case and effort label missing case. We introduce the low-rank recovery technique for addressing the drive factor missing case. And we employ the semi-supervised regression technique to perform imputation in the case of effort label missing. We then propose a novel effort data imputation approach, named low-rank recovery and semi-supervised regression imputation (LRSRI). Experiments on 7 widely used software effort datasets indicate that: (1) the proposed approach can obtain better effort data imputation effects than other methods; (2) the imputed data using our approach can apply to multiple estimators well.
机译:软件努力估算(参见)是软件开发的重要步骤。努力丢失的努力通常发生在现实世界数据收集中。关注缺少的数据问题,现有的参见方法采用删除,忽略或归咎策略来解决问题,其中发现估算策略更有助于提高估计性能。目前的估算方法在使用缺失数据归档的使用经典销售技术中,但这些拒绝技术具有各自的缺点,并且可能不适合努力数据。在本文中,我们的目标是为努力提供有效的解决方案缺失问题。不完整包括驱动因子丢失的案例和努力标签丢失的情况。我们介绍了用于解决驱动因子丢失案例的低级恢复技术。我们采用半监督回归技术在缺少努力标签的情况下进行估算。然后,我们提出了一种新颖的努力数据归象方法,命名为低级恢复和半监督回归归属(LRSRI)。 7个广泛使用的软件努力数据集的实验表明:(1)所提出的方法可以获得比其他方法更好的努力数据归咎效果; (2)使用我们的方法的估算数据可以很好地适用于多个估算器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号