首页> 外文期刊>Technical Gazette >Sampling imbalance dataset for software defect prediction using hybrid neuro-fuzzy systems with Naive Bayes classifier
【24h】

Sampling imbalance dataset for software defect prediction using hybrid neuro-fuzzy systems with Naive Bayes classifier

机译:使用带有朴素贝叶斯分类器的混合神经模糊系统进行软件缺陷预测的采样不平衡数据集

获取原文
           

摘要

Software defect prediction (SDP) is a process with difficult tasks in the case of software projects. The SDP process is useful for the identification and location of defects from the modules. This task will tend to become more costly with the addition of complex testing and evaluation mechanisms, when the software project modules size increases. Further measurement of software in a consistent and disciplined manner offers several advantages like accuracy in the estimation of project costs and schedules, and improving product and process qualities. Detailed analysis of software metric data also gives significant clues about the locations of possible defects in a programming code. The main goal of this proposed work is to introduce software defects detection and prevention methods for identifying defects from software using machine learning approaches. This proposed work used imbalanced datasets from NASA’s Metrics Data Program (MDP) and software metrics of datasets are selected by using Genetic algorithm with Ant Colony Optimization (GACO) method. The sampling process with semi supervised learning Modified Co Forest method generates the balanced labelled using imbalanced datasets, which is used for efficient software defect detection process with machine learning Hybrid Neuro-Fuzzy Systems with Naive Bayes methods. The experimental results of this proposed method proves that this defect detecting machine learning method yields more efficiency and better performance in defect prediction result of software in comparison with the other available methods.
机译:对于软件项目,软件缺陷预测(SDP)是一个任务艰巨的过程。 SDP过程对于从模块识别和定位缺陷很有用。当软件项目模块的大小增加时,通过添加复杂的测试和评估机制,该任务的成本往往会更高。以一致且有纪律的方式对软件进行进一步的测量具有多个优势,例如估算项目成本和进度的准确性以及改进产品和过程的质量。对软件度量数据的详细分析还提供了有关编程代码中可能的缺陷位置的重要线索。这项拟议工作的主要目的是介绍使用机器学习方法从软件中识别缺陷的软件缺陷检测和预防方法。这项拟议的工作使用了来自NASA度量数据程序(MDP)的不平衡数据集,并使用遗传算法和蚁群优化(GACO)方法选择了数据集的软件度量。使用半监督学习的改进Co Forest方法进行的采样过程使用不平衡的数据集生成平衡的标签,该数据用于通过机器学习的混合神经模糊系统和朴素贝叶斯方法进行有效的软件缺陷检测过程。该方法的实验结果证明,与其他现有方法相比,该缺陷检测机器学习方法在软件的缺陷预测结果中具有更高的效率和更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号