Combining Imbalance Learning Strategy and Multiclassifier Estimator for Bug Report Classification

Shikai Guo; Siwen Wang; Miaomiao Wei; Rong Chen; Chen Guo; Hui Li

首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >Combining Imbalance Learning Strategy and Multiclassifier Estimator for Bug Report Classification

【24h】

Combining Imbalance Learning Strategy and Multiclassifier Estimator for Bug Report Classification

机译：组合不平衡学习策略和MultiClassifier估算器进行错误报告分类

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Since a large number of bug reports are submitted to the bug repository every day, efficiently assigning bug reports to the correct developer is a considerable challenge. Because of the large differences between the different components of different projects, the current bug classification mainly relies on the components of the bug report to dispatch bug reports to the designated developer or developer community. Unfortunately, the component information of the bug report is filled in by default according to the bug submitter and the result is often incorrect. Thus, an automatic technology that can identify high-impact bug reports can help developers to be aware of them early, rectify them quickly, and minimize the damages they cause. In this paper, we propose a method based on the combination of imbalanced learning strategies such as random undersampling (RUS), random oversampling (ROS), synthetic minority oversampling technique (SMOTE), and AdaCost algorithms with multiclass classification methods, OVO and OVA, to solve bug reports component classification problem. We investigate the effectiveness of different combinations, i.e., variants, each of which includes a specific imbalance learning strategy and a specific classification algorithm. We mainly perform an analytical study on five open bug repositories (Eclipse, Mozilla, GCC, OpenOffice, and NetBeans). The results show that different variants have different performance for bug reports component identification and the best performance variants are combined with the imbalanced learning strategy RUS and the OVA method based on the SVM classifier.

机译：由于每天将大量错误报告提交到Bug存储库，因此有效地将错误报告分配给正确的开发人员是一个相当大的挑战。由于不同项目的不同组件之间的巨大差异，目前的错误分类主要依赖于错误报告的组件，以向指定的开发人员或开发人员社区调度错误报告。遗憾的是，根据错误提交者默认情况下，错误报告的组件信息填写，结果通常不正确。因此，可以识别高影响力报告的自动技术可以帮助开发人员早期意识到它们，快速纠正它们，并最大限度地减少它们的损坏。在本文中，我们提出了一种基于非衡度学习策略的组合的方法，例如随机欠采样（RUS），随机过采样（ROS），合成少数群体过采样技术（SMITE），以及具有多款分类方法，OVO和OVA的adacost算法，解决错误报告组件分类问题。我们研究了不同组合，即变体的有效性，其中每个组合包括特定的不平衡学习策略和特定的分类算法。我们主要对五个开放式错误存储库（Eclipse，Mozilla，GCC，OpenOffice和NetBeans）进行分析研究。结果表明，不同的变体对错误报告组件识别具有不同的性能，并且最佳性能变体与基于SVM分类器的不平衡学习策略RUS和OVA方法相结合。

著录项

来源
《Mathematical Problems in Engineering: Theory, Methods and Applications》 |2020年第1期|共16页
作者
Shikai Guo; Siwen Wang; Miaomiao Wei; Rong Chen; Chen Guo; Hui Li;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Surprise Bug Report Prediction Utilizing Optimized Integration with Imbalanced Learning Strategy [J] . Hui Li, Yang Qu, Shikai Guo, Complexity . 2020,第1期

机译：令人惊讶的错误报告预测利用了与不平衡学习策略的优化集成
2. High-Impact Bug Report Identification with Imbalanced Learning Strategies [J] . Xin-Li Yang, David Lo, Xin Xia, 计算机科学技术学报（英文版） . 2017,第001期

机译：具有不平衡学习策略的高影响力错误报告识别
3. LEARNING TO RANK AND CLASSIFICATION OF BUG REPORTS USING SVM AND FEATURE EVALUATION [J] . S.Rajeswari, S. Sharavanan, R.Vijai, International Journal on Smart Sensing and Intelligent Systems . 2017,第SPECIALaISSUE期

机译：使用SVM和特征评估学习对错误报告的排名和分类
4. Combining Deep Learning with Information Retrieval to Localize Buggy Files for Bug Reports [C] . An Ngoc Lam, Anh Tuan Nguyen, Hoan Anh Nguyen, IEEE/ACM International Conference on Automated Software Engineering . 2015

机译：将深度学习与信息检索结合起来，以本地化错误报告的错误文件
5. Learning to Rank Relevant Files for Bug Reports Using Domain knowledge, Replication and Extension of a Learning-to-Rank Approach [D] . Safdari, Nasir. 2018

机译：使用领域知识学习，对错误报告的相关文件进行排名，从学习到排名方法的复制和扩展
6. An Impartial Semi-Supervised Learning Strategy for Imbalanced Classification on VHR Images [O] . Fei Sun, Fang Fang, Run Wang, 2020

机译：VHR图像上不平衡分类的公正半监督学习策略
7. Combining Imbalance Learning Strategy and Multiclassifier Estimator for Bug Report Classification [O] . Shikai Guo, Siwen Wang, Miaomiao Wei, 2020

机译：组合不平衡学习策略和MultiClassifier估算器进行错误报告分类

Combining Imbalance Learning Strategy and Multiclassifier Estimator for Bug Report Classification

摘要

著录项

相似文献

相关主题

期刊订阅