首页> 外文期刊>Journal of Information & Optimization Sciences >ELM and KELM based software defect prediction using feature selection techniques
【24h】

ELM and KELM based software defect prediction using feature selection techniques

机译:使用特征选择技术的基于ELM和KELM的软件缺陷预测

获取原文
获取原文并翻译 | 示例
       

摘要

Context: Software defect prediction (SDP) models help in delivering a dependable and a genuine product to the clients. However, the performance of these models is affected by the presence of irrelevant features in the datasets. This problem is addressed by feature selection techniques. Objectives : (1) To determine the performance of feature selection based classification models in the context of software defect prediction, and (2) To determine if the removal of insignificant features makes a significant difference in the performance of the SDP models. Method: SDP models are built using two classifiers - Extreme learning machine (ELM) and Kernel based extreme learning machine (KELM) based on five wrapper and seven filter based feature selection techniques. Experiments are performed using seven datasets from the PROMISE repository. Testing accuracy is used for performance comparison of the feature selection based ELM and KELM defect classification models. Results : (1) ELM based classifiers achieved a higher testing accuracy with wrapper based feature selection methods while KELM classifiers performed better with filter based methods. (2) It is also found that even after eliminating over 85 percent of the attributes from the original software project data, the classification performance of the models is comparable before and after removing the insignificant features in most of the cases and it improved in very few experiments. Conclusion : With respect to the feature selection based defect classification, the performance of ELM and KELM based models is better with wrapper and filter based methods, respectively. Overall, a dimensionally reduced space does not significantly affect the prediction performance of the SDP models. In a way, it is indicated that the feature subsets obtained after removing the insignificant software metrics provide more significance to the output class.
机译:背景信息:软件缺陷预测(SDP)模型有助于为客户提供可靠且真实的产品。但是,这些模型的性能受到数据集中不相关特征的影响。通过特征选择技术解决了该问题。目标:(1)在软件缺陷预测的背景下确定基于特征选择的分类模型的性能,以及(2)确定删除无关紧要的功能是否对SDP模型的性能产生重大影响。方法:使用两个分类器建立SDP模型-极限学习机(ELM)和基于内核的极限学习机(KELM)基于五个包装器和七个基于过滤器的特征选择技术。使用PROMISE储存库中的七个数据集进行实验。测试精度用于基于特征选择的ELM和KELM缺陷分类模型的性能比较。结果:(1)使用基于包装器的特征选择方法,基于ELM的分类器获得了更高的测试准确性,而使用基于过滤器的方法,KELM分类器的性能更好。 (2)还发现,即使在从原始软件项目数据中删除了超过85%的属性之后,在大多数情况下,模型的分类性能在去除无关紧要的特征之前和之后都是可比的,并且在极少数情况下得到了改善实验。结论:对于基于特征选择的缺陷分类,基于ELM和KELM的模型的性能分别优于基于包装器和过滤器的方法。总体而言,缩小的空间不会显着影响SDP模型的预测性能。以某种方式表明,在删除无关紧要的软件度量之后获得的特征子集为输出类提供了更大的意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号