The impact of parameter optimization of ensemble learning on defect prediction

Muhammed Maruf Ozturk

首页> 外文期刊>Computer science journal of Moldova >The impact of parameter optimization of ensemble learning on defect prediction

【24h】

The impact of parameter optimization of ensemble learning on defect prediction

机译：集成学习参数优化对缺陷预测的影响

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine learning algorithms have configurable parameters which are generally used with default settings by practitioners. Making modifications on the parameters of machine learning algorithm is called hyperparameter optimization (HO) performed to find out the most suitable parameter setting in classification experiments. Such studies propose either using default classification model or optimal parameter configuration. This work investigates the effects of applying HO on ensemble learning algorithms in terms of defect prediction performance. Further, this paper presents a new ensemble learning algorithm called novelEnsemble for defect prediction data sets. The method has been tested on 27 data sets. Proposed method is then compared with three alternatives. Welch's Heteroscedastic F Test is used to examine the difference between performance parameters. To control the magnitude of the difference, Cliff's Delta is applied on the results of comparison algorithms. According to the results of the experiment: 1) Ensemble methods featuring HO performs better than a single predictor; 2) Despite the error of triTraining decreases linearly, it produces errors at an unacceptable level; 3) novelEnsemble yields promising results especially in terms of area under the curve (AUC) and Matthews Correlation Coefficient (MCC); 4) HO is not stagnant depending on the scale of the data set; 5) Each ensemble learning approach may not create a favorable effect on HO. To demonstrate the prominence of hyperparameter selection process, the experiment is validated with suitable statistical analyzes. The study revealed that the success of HO which is, contrary to expectations, not depended on the type of the classifiers but rather on the design of ensemble learners.

机译：机器学习算法具有可配置的参数，实践者通常将其与默认设置一起使用。对机器学习算法的参数进行修改的过程称为超参数优化（HO），旨在找出分类实验中最合适的参数设置。这些研究建议使用默认分类模型或最佳参数配置。这项工作从缺陷预测性能的角度研究了将HO应用于集成学习算法的效果。此外，本文提出了一种用于缺陷预测数据集的新的集成学习算法，称为NovelEnsemble。该方法已在27个数据集上进行了测试。然后将提议的方法与三个替代方案进行比较。 Welch的异方差F检验用于检查性能参数之间的差异。为了控制差异的大小，在比较算法的结果上应用了Cliff的Delta。根据实验结果：1）具有HO的组合方法的性能优于单个预测器。 2）尽管triTraining的误差线性降低，但仍会产生无法接受的误差； 3）NovelEnsemble产生了令人鼓舞的结果，尤其是在曲线下面积（AUC）和马修斯相关系数（MCC）方面； 4）HO不会停滞不前，具体取决于数据集的规模； 5）每种整体学习方法可能不会对HO产生有利的影响。为了证明超参数选择过程的突出性，通过适当的统计分析验证了该实验。研究表明，与期望相反，HO的成功不取决于分类器的类型，而取决于合奏学习者的设计。

著录项

来源
《Computer science journal of Moldova》 |2019年第1期|85-128|共44页
作者
Muhammed Maruf Ozturk;
展开▼
作者单位

Department of Computer Engineering Faculty of Engineering Isparta TURKEY;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Omni-Ensemble Learning (OEL): Utilizing Over-Bagging, Static and Dynamic Ensemble Selection Approaches for Software Defect Prediction [J] . Reza Mousavi, Mahdi Eftekhari, Farhad Rahdari International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2018,第6期

机译：Omni-Ensemble学习（OEL）：利用过度袋，静态和动态的集合选择方法，用于软件缺陷预测
2. The Impact of Automated Parameter Optimization on Defect Prediction Models [J] . Tantithamthavorn Chakkrit, McIntosh Shane, Hassan Ahmed E., IEEE Transactions on Software Engineering . 2019,第7期

机译：自动参数优化对缺陷预测模型的影响
3. Impact of Hyper Parameter Optimization for Cross-Project Software Defect Prediction [J] . Yubin Qu, Xiang Chen, Yingquan Zhao, International Journal of Performability Engineering . 2018,第6期

机译：超参数优化对跨项目软件缺陷预测的影响
4. Vibration Tendency Prediction of Hydroelectric Generator Unit Based on Fast Ensemble Empirical Mode Decomposition and Kernel Extreme Learning Machine with Parameters Optimization [C] . Yahui Shan, Jianzhong Zhou, Wei Jiang, International Symposium on Computational Intelligence and Design . 2018

机译：基于快速集成经验模态分解和参数优化的核极限学习机的水轮发电机组振动趋势预测
5. The Effects of Parameter Tuning on Machine Learning Performance in a Software Defect Prediction Context [D] . Province, Benjamin N. 2015

机译：在软件缺陷预测环境中参数调整对机器学习性能的影响
6. Disease prediction via Bayesian hyperparameter optimization and ensemble learning [O] . Liyuan Gao, Yongmei Ding 2020

机译：通过贝叶斯超参数优化和集成学习进行疾病预测
7. The Impact of Automated Parameter Optimization on Defect Prediction Models [O] . Chakkrit Tantithamthavorn, Shane McIntosh, Ahmed E. Hassan, 2019

机译：自动参数优化对缺陷预测模型的影响

The impact of parameter optimization of ensemble learning on defect prediction

摘要

著录项

相似文献

相关主题

期刊订阅