首页> 外文会议>IEEE International Conference on Software Quality, Reliability and Security >An Empirical Analysis of Three-Stage Data-Preprocessing for Analogy-Based Software Effort Estimation on the ISBSG Data
【24h】

An Empirical Analysis of Three-Stage Data-Preprocessing for Analogy-Based Software Effort Estimation on the ISBSG Data

机译:基于ISBSG数据的基于类推软件工作量估计的三阶段数据预处理的实证分析

获取原文

摘要

Analogy-based software effort estimation is a method to estimate the project cost of an unseen project based on analogies against previous projects sharing selected features. The validity of the selected features depends on many factors, and one of most crucial factors is the effectiveness of the datapreprocessing techniques applied to the datasets of the previous projects. In this paper, we report the first controlled experiment that studies the class of three-stage data-preprocessing techniques with stages of missing data imputation, data normalization, and feature selection for analogy-based effort estimation. We conducted our investigation on the ISBSG data. The experimental results show that three-stage data-preprocessing techniques have significant impacts on the resultant effort estimation accuracy. The results also indicate that the combined use of Z-Score normalization, kNN imputation and mutual information based feature weighting can be an effective choice for analogy-based effort estimation.
机译:基于类比​​的软件工作量估算是一种基于与共享选定功能的先前项目的类比估算未见项目的项目成本的方法。所选功能的有效性取决于许多因素,而最关键的因素之一就是应用于先前项目数据集的数据预处理技术的有效性。在本文中,我们报告了第一个受控实验,该实验研究了三阶段数据预处理技术的类别,这些阶段具有基于模拟的工作量估算的缺失数据插补,数据归一化和特征选择的阶段。我们对ISBSG数据进行了调查。实验结果表明,三阶段数据预处理技术对最终的工作量估计准确性有重大影响。结果还表明,Z-Score归一化,kNN归因和基于互信息的特征加权的组合使用可能是基于类比的工作量估计的有效选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号