WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR DETERMINING RELEVANCE OF SOFTWARE ENGINEERING METRICS

WILKER ALTIDOR; TAGHI M. KHOSHGOFTAARt; KEHAN GAO

首页> 外文期刊>International Journal of Reliability, Quality and Safety Engineering >WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR DETERMINING RELEVANCE OF SOFTWARE ENGINEERING METRICS

【24h】

WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR DETERMINING RELEVANCE OF SOFTWARE ENGINEERING METRICS

机译：用于确定软件工程指标相关性的基于包装器的特征排序技术

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Classification, an important data mining function that assigns class label to items in a collection, is of practical applications in various domains. In software engineering, for instance, a common classification problem is to determine the quality of a software item. In such a problem, software metrics represent the independent features while the fault proneness represents the class label. With many classification problems, one must often deal with the presence of irrelevant features in the feature space. That, coupled with class imbalance, renders the task of discriminating one class from another rather difficult. In this study, we empirically evaluate our proposed wrapper-based feature ranking where nine performance metrics aided by a particular learner and a methodology are considered. We examine five learners and take three different approaches, each in conjunction with one of three different methodologies: 3-fold Cross-Validation, 3-fold Cross-Validation Risk Impact, and a combination of the two. In this study, we consider two sets of software engineering datasets. To evaluate the classifier performance after feature selection has been applied, we use Area Under Receiver Operating Characteristic curve as the performance evaluator. We investigate the performance of feature selection as we vary the three factors that form the foundation of the wrapper-based feature ranking. We show that the performance is conditioned by not only the choice of methodology but also the learner. We also evaluate the effect of sampling on wrapper-based feature ranking. Finally, we provide guidance as to which software metrics are relevant in software defect prediction problems and how the number of software metrics can be selected when using wrapper-based feature ranking.

机译：分类是一种重要的数据挖掘功能，可将类别标签分配给集合中的项目，在各个领域都有实际应用。例如，在软件工程中，常见的分类问题是确定软件项目的质量。在这样的问题中，软件指标代表独立的功能，而故障倾向代表类标签。对于许多分类问题，必须经常处理特征空间中不相关特征的存在。这加上阶级的不平衡，使区分一个阶级与另一个阶级的任务变得相当困难。在这项研究中，我们根据经验评估了我们提出的基于包装的特征排名，其中考虑了由特定学习者和方法论辅助的九种性能指标。我们检查了五个学习者，并采取三种不同的方法，每种方法都与三种不同的方法之一结合：3倍交叉验证，3倍交叉验证风险影响以及两者的结合。在这项研究中，我们考虑了两组软件工程数据集。为了在应用特征选择之后评估分类器的性能，我们使用“接收器工作区域下的特征曲线”作为性能评估器。当我们改变构成基于包装器的特征排名基础的三个因素时，我们将研究特征选择的性能。我们表明，绩效不仅取决于方法论的选择，而且还取决于学习者。我们还评估了采样对基于包装的特征排名的影响。最后，我们提供有关哪些软件指标与软件缺陷预测问题相关的指南，以及在使用基于包装的特征排名时如何选择软件指标的数量的指南。

著录项

来源
《International Journal of Reliability, Quality and Safety Engineering》 |2010年第5期|p.425-464|共40页
作者
WILKER ALTIDOR; TAGHI M. KHOSHGOFTAARt; KEHAN GAO;
展开▼
作者单位

Department of Computer Science and Engineering Florida Atlantic University, 777 Glades Road Boca Raton, Florida 33431, USA;

rnDepartment of Computer Science and Engineering Florida Atlantic University, 777 Glades Road Boca Raton, Florida 33431, USA;

rnDepartment of Mathematics and Computer Science Eastern Connecticut State University, 83 Windham Street Willimantic, Connecticut 06226, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
feature selection; wrapper-based feature ranking; ranker aid; performance measures; sampling techniques; software metrics;

机译：特征选择;基于包装的功能排名;等级援助;绩效指标;抽样技术;软件指标;

相似文献

外文文献
中文文献
专利

1. A COMPARATIVE STUDY OF FILTER-BASED AND WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR SOFTWARE QUALITY MODELING [J] . TAGHI M. KHOSHGOFTAAR, KEHAN GAO, LOFTON A. BULLARD International Journal of Reliability, Quality and Safety Engineering . 2011,第4期

机译：软件质量建模的基于过滤器和基于包装器的特征排序技术的比较研究
2. Ranking of software engineering metrics by fuzzy-based matrix methodology [J] . R. K. Garg, H Kapil Sharma, C. K. Nagpal, Software Testing, Verification and Reliability . 2013,第2期

机译：基于模糊矩阵方法的软件工程指标排名
3. AN EMPIRICAL STUDY OF FEATURE RANKING TECHNIQUES FOR SOFTWARE QUALITY PREDICTION [J] . TAGHI M. KHOSHGOFTAAR, KEHAN GAO, AMRI NAPOLITANO International journal of software engineering and knowledge engineering . 2012,第2期

机译：软件质量预测的特征排名技术的实证研究
4. Wrapper-Based Feature Ranking for Software Engineering Metrics [C] . Altidor Wilker, Khoshgoftaar Taghi M., Napolitano Amri Machine Learning and Applications, 2009. ICMLA '09 . 2009

机译：基于包装器的软件工程指标功能排名
5. Features Ranking Techniques for Single Nucleotide Polymorphism Data [D] . Abounada, Mohanad Feisal M. H. 2017

机译：具有单核苷酸多态性数据的排名技术
6. CNFE-SE: a novel approach combining complex network-based feature engineering and stacked ensemble to predict the success of intrauterine insemination and ranking the features [O] . Sima Ranjbari, Toktam Khatibi, Ahmad Vosough Dizaji, 2021

机译：CNFE-SE：一种组合复杂的基于网络的特征工程和堆叠合奏的新方法以预测宫内授精和排名的成功
7. Exploring Software Quality Classification with a Wrapper-Based Feature Ranking Technique [O] . Kehan Gao, Taghi Khoshgoftaar, Amri Napolitano 2015

机译：利用基于包装器的特征排序技术探索软件质量分类

WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR DETERMINING RELEVANCE OF SOFTWARE ENGINEERING METRICS

摘要

著录项

相似文献

相关主题

期刊订阅