首页> 外文期刊>Journal of Information Recording >Cost-Aware Clustering of Bug Reports by Using a Genetic Algorithm
【24h】

Cost-Aware Clustering of Bug Reports by Using a Genetic Algorithm

机译:使用遗传算法的错误报告的成本感知聚类

获取原文
获取原文并翻译 | 示例
       

摘要

The inefficient distribution of bugs to developers is increasing the cost of software development and maintenance. In efforts to tackle this issue, various studies have been carried out to recommend suitable developers for specific bugs. These studies often leverage similarity between bug reports; for example, if a developer addressed a bug report similar to a newly incoming report, that developer can be suitable to fix the bug described in the new report. However, the existing studies have resulted in imbalanced distribution - a large number of bugs can be concentrated in a small number of developers. In this paper, we propose a novel approach to achieve a cost-aware distribution of bug reports to support workload balancing. Our approach is composed of two phases. First, a set of similar report groups composed of strongly related bugs is generated based on their similarity and dependency. Clusters are then created by grouping the similar report groups so that each cluster can have similar cost (i.e., minimizing its standard deviation). Our approach leverages a genetic algorithm to find a near-optimal distribution of bug reports because it is an NP-hard problem. The experiments with 1,047 bug reports collected from Mozilla's Firefox were conducted to evaluate our approach. The results showed that our approach effectively provides an appropriate solution to achieve a cost-balanced distribution of bug reports. In addition, we carried out a user study targeting 30 developers from 15 companies to figure out the usefulness and effectiveness of our approach. Among the participants, 67% answered that our approach is useful for triaging their bugs to developers. This shows the possibility for use in cases of managing or triaging bugs from the project manager's perspective.
机译:错误地将错误分发给开发人员会增加软件开发和维护的成本。为了解决这个问题,已经进行了各种研究以针对特定的错误推荐合适的开发人员。这些研究经常利用错误报告之间的相似性。例如,如果开发人员处理的错误报告类似于新收到的报告,则该开发人员可能适合修复新报告中描述的错误。但是,现有研究导致分布不平衡-大量错误可能集中在少数开发人员中。在本文中,我们提出了一种新颖的方法来实现错误报告的成本意识分布,以支持工作负载平衡。我们的方法包括两个阶段。首先,根据它们的相似性和依赖性生成一组由高度相关的错误组成的相似报告组。然后,通过对相似的报告组进行分组来创建聚类,以便每个聚类可以具有相似的成本(即,最小化其标准偏差)。我们的方法利用遗传算法来查找错误报告的最佳分布,因为它是NP难题。进行了从Mozilla Firefox收集的1,047个错误报告的实验,以评估我们的方法。结果表明,我们的方法有效地提供了适当的解决方案,以实现错误报告的成本平衡分发。此外,我们针对15家公司的30名开发人员进行了一项用户研究,以了解该方法的有用性和有效性。在参与者中,有67%的人回答说我们的方法有助于将他们的错误分类给开发人员。从项目经理的角度来看,这显示了在管理或分类错误的情况下使用的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号