首页> 外文会议>IEEE International Conference on Software Engineering and Service Science >Utilizing cluster quality in hierarchical clustering for analogy-based software effort estimation
【24h】

Utilizing cluster quality in hierarchical clustering for analogy-based software effort estimation

机译:利用基于类比的软件工作估算的分层聚类中的群集质量

获取原文

摘要

Analogy-based software effort estimation is one of the most popular estimation methods. It is built upon the principle of case-based reasoning (CBR) based on the k-th similar projects completed in the past. Therefore the determination of the k value is crucial to the prediction performance. Various research have been carried out to use a single and fixed k value for experiments, and it is known that dynamically allocated k values in an experiment will produce the optimized performance. This paper proposes an interesting technique based on hierarchical clustering to produce a range for k through various cluster quality criteria. We find that complete linkage clustering is more suitable for large datasets while single linkage clustering is suitable for small datasets. The method searches for optimized k values based on the proposed heuristic optimization technique, which have the advantages of easy computation and optimized for the dataset being investigated. Datasets from the PROMISE repository have been used to evaluate the proposed technique. The results of the experiments show that the proposed method is able to determine an optimized set of k values for analogy-based prediction, and to give estimates that outperformed traditional models based on a fixed k value. The implication is significant in that the analogy-based model will be optimized according the dataset being used, without the need to ask an expert to determining a single, fixed k value.
机译:基于类比​​的软件努力估计是最受欢迎的估计方法之一。基于案例的推理原理(CBR)基于过去完成的K-TH类似项目而建立在基础上。因此,k值的确定对预测性能至关重要。已经进行了各种研究以使用单一和固定的k值进行实验,并且已知在实验中动态分配的k值将产生优化的性能。本文提出了一种基于分层聚类的有趣技术,通过各种群集质量标准生成K的范围。我们发现完整的链接群集更适合大型数据集,而单个链接群集适用于小型数据集。该方法根据所提出的启发式优化技术搜索优化的k值,这具有易于计算的优点,并针对所研究的数据集进行优化。来自Promise存储库的数据集已被用于评估所提出的技术。实验结果表明,该方法能够确定基于类比的预测的优化的K值集,并给出基于固定的k值表现优于传统模型的估计。这种含义在显着的是,基于类比的模型将根据正在使用的数据集进行优化,而无需要求专家确定单个固定的k值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号