首页> 外文期刊>International Journal of Performability Engineering >An Information Flow-based Feature Selection Method for Cross-Project Defect Prediction
【24h】

An Information Flow-based Feature Selection Method for Cross-Project Defect Prediction

机译:基于信息流的跨项目缺陷预测特征选择方法

获取原文
获取原文并翻译 | 示例
       

摘要

Software defect prediction (SDP) plays a significant part in identifying the most defect-prone modules before software testing and allocating limited testing resources. One of the most commonly used scenarios in SDP is classification. To guarantee the prediction accuracy, the classification models should first be trained appropriately. The training data could be obtained from historical software repositories, which may affect the performance of classification to a large extent. In order to improve the data quality, we propose a novel software feature selection method, which innovatively utilizes the information flows to perform causality analysis in the features of training datasets. More specifically, we conduct causality analysis between each feature metric and the labeled metric bug; then, based on the obtained feature ranking list, we select the top-k features to control redundancy. Finally, we choose the most suitable feature subset based on the F-measure. To demonstrate the effectiveness and practicability of the feature selection method, we select the Nearest Neighbor approach to construct a homogeneous training dataset, and utilize three commonly used classification models to implement comparison experiments. The final experimental results have verified the availability and validity of the feature selection method.
机译:软件缺陷预测(SDP)在软件测试和分配有限的测试资源之前识别最缺陷的易于模块的重要组成部分。 SDP中最常用的场景之一是分类。为了保证预测准确性,首先应适当培训分类模型。可以从历史软件存储库获得培训数据,这可能会在很大程度上影响分类的性能。为了提高数据质量,我们提出了一种新颖的软件特征选择方法,它创新利用信息流在训练数据集的特征中执行因果关系分析。更具体地,我们在每个特征度量和标记的度量标准错误之间进行因果关系分析;然后,基于所获得的特征排名列表,我们选择Top-K功能以控制冗余。最后,我们基于F测量选择最合适的特征子集。为了证明特征选择方法的有效性和实用性,我们选择最接近的邻近方法来构建一个均匀的训练数据集,并利用三种常用的分类模型来实现比较实验。最终的实验结果已经验证了特征选择方法的可用性和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号