...
首页> 外文期刊>Empirical Software Engineering >Preventing duplicate bug reports by continuously querying bug reports
【24h】

Preventing duplicate bug reports by continuously querying bug reports

机译:通过不断查询错误报告来防止重复的错误报告

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Bug deduplication or duplicate bug report detection is a hot topic in software engineering information retrieval research, but it is often not deployed. Typically to de-duplicate bug reports developers rely upon the search capabilities of the bug report software they employ, such as Bugzilla, Jira, or Github Issues. These search capabilities range from simple SQL string search to IR-based word indexing methods employed by search engines. Yet too often these searches do very little to stop the creation of duplicate bug reports. Some bug trackers have more than 10% of their bug reports marked as duplicate. Perhaps these bug tracker search engines are not enough? In this paper we propose a method of attempting to prevent duplicate bug reports before they start: continuously querying. That is as the bug reporter types in their bug report their text is used to query the bug database to find duplicate or related bug reports. This continuously querying bug reports allows the reporter to be alerted to duplicate bug reports as they report the bug, rather than formulating queries to find the duplicate bug report. Thus this work ushers in a new way of evaluating bug report deduplication techniques, as well as a new kind of bug deduplication task. We show that simple IR measures can address this problem but also that further research is needed to refine this novel process that is integrate-able into modern bug report systems.
机译:错误重复数据删除或重复错误报告检测是软件工程信息检索研究中的热门话题,但通常并未部署。通常,要删除重复的错误报告,开发人员将依赖于他们使用的错误报告软件的搜索功能,例如Bugzilla,Jira或Github Issues。这些搜索功能范围从简单的SQL字符串搜索到搜索引擎采用的基于IR的单词索引方法。但是,这些搜索通常很少能阻止重复的错误报告的创建。一些错误跟踪器将超过10%的错误报告标记为重复。也许这些Bug跟踪器搜索引擎还不够?在本文中,我们提出了一种尝试在重复的bug报告开始之前进行预防的方法:连续查询。也就是说,当错误报告者在其错误报告中键入内容时,其文本用于查询错误数据库以查找重复的或相关的错误报告。这种不断查询错误报告的方式可以使报告程序在报告错误时被提醒报告者重复错误报告,而不是通过查询来查找重复的错误报告。因此,这项工作带来了评估错误报告重复数据删除技术的新方法,以及一种新型的错误重复数据删除任务。我们表明,简单的IR措施可以解决此问题,但还需要进一步研究以完善可集成到现代错误报告系统中的新颖过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号