Cost-Aware Clustering of Bug Reports by Using a Genetic Algorithm

Lee Jaekwon; Kim Dongsun; Jung Woosung

首页> 外文期刊>Journal of Information Recording >Cost-Aware Clustering of Bug Reports by Using a Genetic Algorithm

【24h】

Cost-Aware Clustering of Bug Reports by Using a Genetic Algorithm

机译：使用遗传算法的错误报告的成本感知聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The inefficient distribution of bugs to developers is increasing the cost of software development and maintenance. In efforts to tackle this issue, various studies have been carried out to recommend suitable developers for specific bugs. These studies often leverage similarity between bug reports; for example, if a developer addressed a bug report similar to a newly incoming report, that developer can be suitable to fix the bug described in the new report. However, the existing studies have resulted in imbalanced distribution - a large number of bugs can be concentrated in a small number of developers. In this paper, we propose a novel approach to achieve a cost-aware distribution of bug reports to support workload balancing. Our approach is composed of two phases. First, a set of similar report groups composed of strongly related bugs is generated based on their similarity and dependency. Clusters are then created by grouping the similar report groups so that each cluster can have similar cost (i.e., minimizing its standard deviation). Our approach leverages a genetic algorithm to find a near-optimal distribution of bug reports because it is an NP-hard problem. The experiments with 1,047 bug reports collected from Mozilla's Firefox were conducted to evaluate our approach. The results showed that our approach effectively provides an appropriate solution to achieve a cost-balanced distribution of bug reports. In addition, we carried out a user study targeting 30 developers from 15 companies to figure out the usefulness and effectiveness of our approach. Among the participants, 67% answered that our approach is useful for triaging their bugs to developers. This shows the possibility for use in cases of managing or triaging bugs from the project manager's perspective.

机译：错误地将错误分发给开发人员会增加软件开发和维护的成本。为了解决这个问题，已经进行了各种研究以针对特定的错误推荐合适的开发人员。这些研究经常利用错误报告之间的相似性。例如，如果开发人员处理的错误报告类似于新收到的报告，则该开发人员可能适合修复新报告中描述的错误。但是，现有研究导致分布不平衡-大量错误可能集中在少数开发人员中。在本文中，我们提出了一种新颖的方法来实现错误报告的成本意识分布，以支持工作负载平衡。我们的方法包括两个阶段。首先，根据它们的相似性和依赖性生成一组由高度相关的错误组成的相似报告组。然后，通过对相似的报告组进行分组来创建聚类，以便每个聚类可以具有相似的成本（即，最小化其标准偏差）。我们的方法利用遗传算法来查找错误报告的最佳分布，因为它是NP难题。进行了从Mozilla Firefox收集的1,047个错误报告的实验，以评估我们的方法。结果表明，我们的方法有效地提供了适当的解决方案，以实现错误报告的成本平衡分发。此外，我们针对15家公司的30名开发人员进行了一项用户研究，以了解该方法的有用性和有效性。在参与者中，有67％的人回答说我们的方法有助于将他们的错误分类给开发人员。从项目经理的角度来看，这显示了在管理或分类错误的情况下使用的可能性。

著录项

来源
《Journal of Information Recording》 |2019年第1期|175-200|共26页
作者
Lee Jaekwon; Kim Dongsun; Jung Woosung;
展开▼
作者单位

Chungbuk Natl Univ Dept Comp Engn Cheongju 28644 South Korea;

Univ Luxembourg Interdisciplinary Ctr Secur Reliabil & Trust Kirchberg 4365 Luxembourg;

Seoul Natl Univ Educ Grad Sch Educ Seoul 06639 South Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
bug report; mining software repositories; bug triage; genetic algorithm; assignment optimization;

机译：错误报告;挖掘软件仓库;错误分类;遗传算法作业优化;
入库时间 2022-08-18 04:33:22

相似文献

外文文献
中文文献
专利

1. Cost-aware triage ranking algorithms for bug reporting systems [J] . Park Jin-woo, Lee Mu-Woong, Kim Jinhan, Knowledge and information systems . 2016,第3期

机译：错误报告系统的成本感知分类分类算法
2. Proposal of a cannibalism bug-based search strategy using genetic algorithms (C-BUGS) and its application to multiobjective optimization problems [J] . Keiichiro Yasuda, Osamu Yamazaki, Takao Watanabe Electrical engineering in Japan . 2002,第1期

机译：基于食人错误的基于遗传算法（C-BUGS）的搜索策略的建议及其在多目标优化问题中的应用
3. Unsupervised Bug Report Categorization Using Clustering and Labeling Algorithm [J] . Nachai Limsettho, Hideaki Hata, Akito Monden, International journal of software engineering and knowledge engineering . 2016,第7期

机译：使用聚类和标记算法的无监督错误报告分类
4. CosTriage: A Cost-Aware Triage Algorithm for Bug Reporting Systems [C] . Jin-woo Park, Mu-Woong Lee, Jinhan Kim, Innovative applications of artificial intelligence conference;AAAI conference on artificial intelligence;IAAI-11;Symposium on educational advances in artificial intelligence;AAAI-11;EAAI-11 . 2011

机译：CosTriage：错误报告系统的成本感知分类算法
5. Improving the standard ant clustering algorithm using genetic algorithms. [D] . AlFraihi, Mohammed Hamad. 2014

机译：使用遗传算法改进标准蚂蚁聚类算法。
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. Proposal of Cannibalism Bug Based Search Strategy using Genetic Algorithms (C-BUGS) and Its Application to Multi-Objective Optimization Problem [O] . Keiichiro Yasuda, Osamu Yamazaki, Takao Watanabe 2000

机译：基于同类的搜索策略使用遗传算法（C-BUG）的基于同类的搜索策略的提议及其在多目标优化问题中的应用

Cost-Aware Clustering of Bug Reports by Using a Genetic Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅