...
首页> 外文期刊>Software Quality Journal >An empirical study on predictability of software maintainability using imbalanced data
【24h】

An empirical study on predictability of software maintainability using imbalanced data

机译:使用不平衡数据的软件可维护性可预测性的实证研究

获取原文
获取原文并翻译 | 示例
           

摘要

In software engineering predictive modeling, early prediction of software modules or classes that possess high maintainability effort is a challenging task. Many prediction models are constructed to predict the maintainability of software classes or modules by applying various machine learning (ML) techniques. If the software modules or classes need high maintainability, effort would be reduced in a dataset, and there would be imbalanced data to train the model. The imbalanced datasets make ML techniques bias their predictions towards low maintainability effort or majority classes, and minority class instances get discarded as noise by the machine learning (ML) techniques. In this direction, this paper presents empirical work to improve the performance of software maintainability prediction (SMP) models developed with ML techniques using imbalanced data. For developing the models, the imbalanced data is pre-processed by applying data resampling methods. Fourteen data resampling methods, including oversampling, undersampling, and hybrid resampling, are used in the study. The study results recommend that the safe-level synthetic minority oversampling technique (Safe-Level-SMOTE) is a useful method to deal with the imbalanced datasets and to develop competent prediction models to forecast software maintainability.
机译:在软件工程预测建模中,软件模块的早期预测具有高易于培养性的软件模块或课程是一个具有挑战性的任务。构造许多预测模型以通过应用各种机器学习(ML)技术来预测软件类或模块的可维护性。如果软件模块或类需要高的可维护性,则在数据集中将减少努力,并且将有不平衡的数据培训模型。不平衡的数据集使ML技术偏向其对低维护力量或多数类的预测,并且通过机器学习(ML)技术被丢弃为噪音。朝着这个方向,本文提出了提高使用ML技术开发的软件可维护性预测(SMP)模型的实证工作,使用ML技术使用不平衡数据。为了开发模型,通过应用数据重采样方法预处理不平衡数据。在研究中使用了十四个数据重采样方法,包括过采样,欠采样和混合重采样。该研究结果建议,安全级合成少数群体过采样技术(安全级别 - SMOTE)是处理不平衡数据集的有用方法,并开发能力预测模型以预测软件可维护性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号