...
首页> 外文期刊>Journal of software maintenance and evolution rsearch and practice >Selecting best predictors from large software repositories for highly accurate software effort estimation
【24h】

Selecting best predictors from large software repositories for highly accurate software effort estimation

机译:从大型软件存储库中选择最佳预测器,以获得高精度的软件努力估算

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Accurate prediction of software effort is important for planning, scheduling, and allocating resources. However, software effort estimation has been a challenging task. Although numerous estimation models have been proposed, few achieve anything close to accurate prediction of software development effort. To achieve optimal results, machine learning techniques have recently been employed for predicting software development effort using relatively large software repositories. However, some issues remain unresolved, and this paper aims to address the following issues. First, feature selection methods often neglected the information rich variables present in the dataset. Second, selection of important features was done through statistical methods, which lack domain knowledge. Third, missing values in the data that significantly influence the prediction outcome was not efficiently handled. Fourth, majority of the literature neglected advanced evaluation measures, which thoroughly evaluate the ability of learning models to produce accurate results. To address the above issues, a machine learning-based model has been proposed in this paper, which not only allows effective preprocessing of data but also provides highly accurate prediction results with minimum error rate. The purpose is to best identify attributes (predictors) from large software repositories that are most influential in the estimation of effort. In addition, we apply MMRE for better performance analysis.
机译:精确预测软件工作对于规划,调度和分配资源非常重要。但是,软件努力估算是一个具有挑战性的任务。虽然已经提出了许多估计模型,但很少有几点靠近对软件开发工作的准确预测。为了实现最佳结果,最近使用机器学习技术来使用相对大的软件存储库预测软件开发工作。但是,一些问题仍未解决,本文旨在解决以下问题。首先,特征选择方法通常忽略了数据集中存在的富有变量的信息。其次,通过统计方法选择重要特征,缺乏域知识。第三,没有有效处理显着影响预测结果的数据中的缺失值。第四,大多数文献忽略了先进的评价措施,这彻底评估了学习模型产生准确结果的能力。为了解决上述问题,本文提出了一种基于机器学习的模型,这不仅允许有效预处理数据,而且还提供了最小误差率的高精度预测结果。目的是最好地识别来自大型软件存储库的属性(预测因子),这些存储库最具影响力在努力估算中。此外,我们应用MMRE以获得更好的性能分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号