Filter-INC: Handling Effort-Inconsistency in Software Effort Estimation Datasets

机译：Filter-Inc：处理软件工作估算数据集的努力 - 不一致

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Effort-inconsistency is a situation where historical software project data used for software effort estimation (SEE) are contaminated by many project cases with similar characteristics but are completed with significantly different amount of effort. Using these data for SEE generally produces inaccurate results; however, an effective technique for its handling is yet made to be available. This study approaches the problem differently from common solutions, where available techniques typically attempt to remove every project case they have detected as outliers. Instead, we hypothesize that data inconsistency is caused by only a few deviant project cases and any attempt to remove those other cases will result in reduced accuracy, largely due to loss of useful information and data diversity. Filter-INC (short for Filtering technique for handling effort-INConsistency in SEE datasets) implements the hypothesis to decide whether a project case being detected by any existing technique should be subject to removal. The evaluation is carried out by comparing the performance of 2 filtering techniques between before and after having Filter-INC applied. The results produced from 8 real-world datasets together with 3 machine-learning models, and evaluated by 4 performance measures show a significant accuracy improvement at the confident interval of 95%. Based on the results, we recommend our proposed hypothesis as an important instrument to design a data preprocessing technique for handling effort-inconsistency in SEE datasets, definitely an important step forward in preprocessing data for a more accurate SEE model.

机译：努力不一致是用于软件努力估算（参见）的历史软件项目数据的情况被许多具有相似特征的许多项目案例污染，但是以明显不同的努力完成。使用这些数据通常会产生不准确的结果;然而，尚未提供处理的有效技术。本研究与共同解决方案不同地接近问题，其中可用技术通常尝试删除它们检测为异常值的每个项目案例。相反，我们假设数据不一致是由少数异常项目案例引起的，并且任何删除其他情况的尝试都会导致准确性降低，主要是由于有用信息和数据分集的丢失。 Filter-Inc（用于处理努力的滤波技术简短 - 在SEE DataSets中处理努力 - 不一致）实现了假设以决定是否应将任何现有技术检测到的项目案例应进行删除。通过比较在施用过滤器-Inc施用之前和之后的2个过滤技术的性能来进行评价。从8个现实世界数据集一起生产的结果与3个机器学习模型，并在4个性能措施评估，以95％的自信间隔显示出显着的准确性改善。根据结果，我们建议我们提出的假设作为设计数据预处理技术的重要仪器，以便在SEE DataSets中处理努力 - 不一致，绝对是预处理数据前进的重要步骤，以便更准确地看到模型。

著录项

来源
《Asia-Pacific Software Engineering Conference》|2016年|403p|共8页
会议地点
作者
Passakorn Phannachitta; Jacky Keung; Kwabena Ebo Bennin; Akito Monden; Kenichi Matsumoto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.5-53;
关键词
Software; Estimation; Software engineering; Information filters; Data models;

机译：软件;估计;软件工程;信息过滤器;数据模型;

相似文献

外文文献
中文文献
专利

1. Analyzing the Stationarity Process in Software Effort Estimation Datasets [J] . Michael Franklin Bosu, Stephen G. MacDonell, Peter A. Whigham International journal of software engineering and knowledge engineering . 2020,第11a12期

机译：分析软件工作估算数据集的实体性过程
2. Experience: Quality Benchmarking of Datasets Used in Software Effort Estimation [J] . MICHAEL F. BOSU, STEPHEN G. MACDONELL ACM journal of data and information quality . 2019,第4期

机译：体验：软件工作中使用的数据集质量基准测试
3. Active Learning and Effort Estimation: Finding the Essential Content of Software Effort Estimation Data [J] . Kocaguneli, Ekrem, Menzies, IEEE Transactions on Software Engineering . 2013,第8期

机译：主动学习和工作量估算：查找软件工作量估算数据的基本内容
4. Filter-INC: Handling Effort-Inconsistency in Software Effort Estimation Datasets [C] . Passakorn Phannachitta, Jacky Keung, Kwabena Ebo Bennin, Asia-Pacific Software Engineering Conference . 2016

机译：Filter-INC：处理软件工作量估算数据集中的工作量不一致
5. Software effort estimation accuracy: A comparative study of estimations based on software sizing and development methods. [D] . Lafferty, Mark T. 2010

机译：软件工作量估计准确性：基于软件大小和开发方法的估计的比较研究。
6. Software Development Effort Estimation Using Regression Fuzzy Models [O] . Ali Bou Nassif, Mohammad Azzeh, Ali Idri, 2019

机译：基于回归模糊模型的软件开发工作量估算
7. KOMPARASI METODE KOMBINASI SELEKSI FITUR DAN MACHINE LEARNING K-NEAREST NEIGHBOR PADA DATASET LABEL HOURS SOFTWARE EFFORT ESTIMATION [O] . Indra Kurniawan, Ahmad Faiq Abror 2019

机译：比较组合方法选择特征和机器学习K-Collect邻居在Label Label Houth软件工作估算数据集

Filter-INC: Handling Effort-Inconsistency in Software Effort Estimation Datasets

摘要

著录项

相似文献

相关主题

期刊订阅