首页> 外文会议>Asia-Pacific Software Engineering Conference >Filter-INC: Handling Effort-Inconsistency in Software Effort Estimation Datasets
【24h】

Filter-INC: Handling Effort-Inconsistency in Software Effort Estimation Datasets

机译:Filter-Inc:处理软件工作估算数据集的努力 - 不一致

获取原文

摘要

Effort-inconsistency is a situation where historical software project data used for software effort estimation (SEE) are contaminated by many project cases with similar characteristics but are completed with significantly different amount of effort. Using these data for SEE generally produces inaccurate results; however, an effective technique for its handling is yet made to be available. This study approaches the problem differently from common solutions, where available techniques typically attempt to remove every project case they have detected as outliers. Instead, we hypothesize that data inconsistency is caused by only a few deviant project cases and any attempt to remove those other cases will result in reduced accuracy, largely due to loss of useful information and data diversity. Filter-INC (short for Filtering technique for handling effort-INConsistency in SEE datasets) implements the hypothesis to decide whether a project case being detected by any existing technique should be subject to removal. The evaluation is carried out by comparing the performance of 2 filtering techniques between before and after having Filter-INC applied. The results produced from 8 real-world datasets together with 3 machine-learning models, and evaluated by 4 performance measures show a significant accuracy improvement at the confident interval of 95%. Based on the results, we recommend our proposed hypothesis as an important instrument to design a data preprocessing technique for handling effort-inconsistency in SEE datasets, definitely an important step forward in preprocessing data for a more accurate SEE model.
机译:努力不一致是用于软件努力估算(参见)的历史软件项目数据的情况被许多具有相似特征的许多项目案例污染,但是以明显不同的努力完成。使用这些数据通常会产生不准确的结果;然而,尚未提供处理的有效技术。本研究与共同解决方案不同地接近问题,其中可用技术通常尝试删除它们检测为异常值的每个项目案例。相反,我们假设数据不一致是由少数异常项目案例引起的,并且任何删除其他情况的尝试都会导致准确性降低,主要是由于有用信息和数据分集的丢失。 Filter-Inc(用于处理努力的滤波技术简短 - 在SEE DataSets中处理努力 - 不一致)实现了假设以决定是否应将任何现有技术检测到的项目案例应进行删除。通过比较在施用过滤器-Inc施用之前和之后的2个过滤技术的性能来进行评价。从8个现实世界数据集一起生产的结果与3个机器学习模型,并在4个性能措施评估,以95%的自信间隔显示出显着的准确性改善。根据结果​​,我们建议我们提出的假设作为设计数据预处理技术的重要仪器,以便在SEE DataSets中处理努力 - 不一致,绝对是预处理数据前进的重要步骤,以便更准确地看到模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号