首页> 外文期刊>Continental Shelf Research: A Companion Journal to Deep-Sea Research and Progress in Oceanography >Software defect prediction based on correlation weighted class association rule mining
【24h】

Software defect prediction based on correlation weighted class association rule mining

机译:基于相关加权类关联规则挖掘的软件缺陷预测

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Software defect prediction based on supervised learning plays a crucial role in guiding software testing for resource allocation. In particular, it is worth noticing that using associative classification with high accuracy and comprehensibility can predict defects. But owing to the imbalance data distribution inherent, it is easy to generate a large number of non-defective class association rules, but the defective class association rules are easily ignored. Furthermore, classical associative classification algorithms mainly measure the interestingness of rules by the occurrence frequency, such as support and confidence, without considering the importance of features, resulting in combinations of the insignificant frequent itemset. This promotes the generation of weighted associative classification. However, the feature weighting based on domain knowledge is subjective and unsuitable for a high dimensional dataset. Hence, we present a novel software defect prediction model based on correlation weighted class association rule mining (CWCAR). It leverages a multi-weighted supports-based framework rather than the traditional support-confidence approach to handle class imbalance and utilizes the correlation-based heuristic approach to assign feature weight. Besides, we also optimize the ranking, pruning and prediction stages based on weighted support. Results show that CWCAR is significantly superior to state-of-the-art classifiers in terms of Balance, MCC, and Gmean. (C) 2020 Elsevier B.V. All rights reserved.
机译:基于监督学习的软件缺陷预测在引导资源分配的软件测试中起着至关重要的作用。特别是值得注意的是,利用具有高精度和可理解性的关联分类可以预测缺陷。但由于不平衡的数据分布固有,很容易生成大量的非缺陷类关联规则,但缺陷类关联规则很容易忽略。此外,经典关联分类算法主要通过发生频率来测量规则的兴趣,例如支持和置信度,而不考虑特征的重要性,导致微不足道的频繁项目集的组合。这促进了加权联想分类的产生。然而,基于域知识的特征加权是主观的并且不适合高维数据集。因此,我们提出了一种基于相关权加权类关联规则挖掘(CWCAR)的新软件缺陷预测模型。它利用了基于多重支持的基于支持的框架,而不是传统的支持 - 置信方法来处理类别不平衡,并利用基于相关的启发式方法来分配特征权重。此外,我们还基于加权支持优化排名,修剪和预测阶段。结果表明,CWCAR在余额,MCC和Gmean方面明显优于最先进的分类器。 (c)2020 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号