Code smell detection using feature selection and stacking ensemble: An empirical investigation

Alazba Amal; Aljamaan Hamoud

首页> 外文期刊>Information and software technology >Code smell detection using feature selection and stacking ensemble: An empirical investigation

【24h】

Code smell detection using feature selection and stacking ensemble: An empirical investigation

机译：使用特征选择和堆叠集合的代码闻到：实证调查

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Context: Code smell detection is the process of identifying code pieces that are poorly designed and implemented. Recently more research has been directed towards machine learning-based approaches for code smells detection. Many classifiers have been explored in the literature, yet, finding an effective model to detect different code smells types has not yet been achieved. Objective: The main objective of this paper is to empirically investigate the capabilities of stacking heterogeneous ensemble model in code smell detection. Methods: Gain feature selection technique was applied to select relevant features in code smell detection. Detection performance of 14 individual classifiers was investigated in the context of two class-level and four method-level code smells. Then, three stacking ensembles were built using all individual classifiers as base classifiers, and three different meta-classifiers (LR, SVM and DT). Results: GP, MLP, DT and SVM(Lin) classifiers were among the best performing classifiers in detecting most of the code smells. On the other hand, SVM(Sig), NB(B), NB(M), and SGD were among the least accurate classifiers for most smell types. The stacking ensemble with LR and SVM meta-classifiers achieved a consistent high detection performance in class-level and method-level code smells compared to all individual models. Conclusion: This paper concludes that the detection performance of the majority of individual classifiers varied from one code smell type to another. However, the detection performance of the stacking ensemble with LR and SVM meta-classifiers was consistently superior over all individual classifiers in detecting different code smell types.

机译：上下文：代码气味检测是识别设计和实现不良的码片的过程。最近更多的研究已经针对基于机器学习的代码味道检测的方法。在文献中探讨了许多分类器，但发现尚未实现有效模型以检测不同的代码味道类型。目的：本文的主要目的是经验探讨代码闻杂志检测中堆叠异构集合模型的能力。方法：应用增益特征选择技术以在代码闻检测中选择相关特征。在两个类级和四种方法级代码气味中研究了14个单独分类器的检测性能。然后，使用所有单独的分类器作为基本分类器和三种不同的元分类器（LR，SVM和DT）构建了三个堆叠集合。结果：GP，MLP，DT和SVM（LIN）分类器是检测大部分代码气味的最佳性分类器之一。另一方面，SVM（SIG），Nb（B），Nb（M）和SGD是最含量的最低味道类型的准确分类器。与所有单个模型相比，具有LR和SVM元分类器的堆叠集合在类级和方法级代码气味中实现了一致的高检测性能。结论：本文得出结论，大多数单个分类器的检测性能从一个代码味道类型变为另一个代码。然而，用LR和SVM元分类器的堆叠集合的检测性能在检测不同代码空间类型的所有单个分类器上始终如一地优异。

著录项

来源
《Information and software technology》 |2021年第10期|106648.1-106648.14|共14页
作者
Alazba Amal; Aljamaan Hamoud;
展开▼
作者单位

King Fahd Univ Petr & Minerals Informat & Comp Sci Dept Dhahran Saudi Arabia|King Saud Univ Dept Informat Syst Riyadh Saudi Arabia;

King Fahd Univ Petr & Minerals Informat & Comp Sci Dept Dhahran Saudi Arabia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Code smell detection; Classification; Ensemble learning; Stacking; Feature selection;

机译：代码闻到检测;分类;集合学习;堆叠;特征选择;

相似文献

外文文献
中文文献
专利

1. A Novel Four-Way Approach Designed With Ensemble Feature Selection for Code Smell Detection [J] . Inderpreet Kaur, Arvinder Kaur Quality Control, Transactions . 2021,第1期

机译：一种设计用于代码嗅觉检测的集合功能选择的新型四通方法
2. AF detection from ECG recordings using feature selection, sparse coding, and ensemble learning [J] . Muhammed Rizwan, Bradley M Whitaker, David V Anderson Physiological measurement . 2018,第12期

机译：AF使用特征选择，稀疏编码和集合学习的ECG录制检测
3. An intelligent grinding burn detection system based on two-stage feature selection and stacked sparse autoencoder [J] . Guo Weicheng, Li Beizhi, Shen Shouguo, The International Journal of Advanced Manufacturing Technology . 2019,第5a8期

机译：基于两阶段特征选择和堆积稀疏自动化器的智能磨削燃烧检测系统
4. An empirical study on optimization of training dataset in harmfulness prediction of code clone using ensemble feature selection model [C] . Sheng Yan, Liping Zhang, Dongsheng Liu International Conference on Information and Communication Technologies for Disaster Management . 2019

机译：使用集合特征选择模型的守则克隆危害数据集优化的实证研究
5. Feature fusion models via stacked autoencoders: Applications to vehicular traffic flow prediction and Alzheimer's disease stage detection [D] . Moussavi-Khalkhali, Arezou. 2016

机译：通过堆叠式自动编码器的特征融合模型：在交通流量预测和阿尔茨海默氏病阶段检测中的应用
6. Rub-Impact Fault Diagnosis Using an Effective IMF Selection Technique in Ensemble Empirical Mode Decomposition and Hybrid Feature Models [O] . Alexander E. Prosvirin, Manjurul Islam, Jaeyoung Kim, 2018

机译：集成经验模式分解和混合特征模型中使用有效IMF选择技术的碰碰故障诊断
7. A Novel Four-Way Approach Designed With Ensemble Feature Selection for Code Smell Detection [O] . Inderpreet Kaur, Arvinder Kaur 2021

机译：一种设计用于代码嗅觉检测的集合功能选择的新型四通方法

Code smell detection using feature selection and stacking ensemble: An empirical investigation

摘要

著录项

相似文献

相关主题

期刊订阅