...
首页> 外文期刊>Empirical Software Engineering >On the relationship between bug reports and queries for text retrieval-based bug localization
【24h】

On the relationship between bug reports and queries for text retrieval-based bug localization

机译:关于基于文本检索的错误本地化的错误报告与查询的关系

获取原文
获取原文并翻译 | 示例
           

摘要

As societal dependence on software continues to grow, bugs are becoming increasingly costly in terms of financial resources as well as human safety. Bug localization is the process by which a developer identifies buggy code that needs to be fixed to make a system safer and more reliable. Unfortunately, manually attempting to locate bugs solely from the information in a bug report requires advanced knowledge of how a system is constructed and the way its constituent pieces interact. Therefore, previous work has investigated numerous techniques for reducing the human effort spent in bug localization. One of the most common approaches is Text Retrieval (TR) in which a system's source code is indexed into a search space that is then queried for code relevant to a given bug report. In the last decade, dozens of papers have proposed improvements to bug localization using TR with largely positive results. However, several other studies have called the technique into question. According to these studies, evaluations of TR-based approaches often lack sufficient controls on biases that artificially inflate the results, namely: misclassified bugs, tangled commits, and localization hints. Here we argue that contemporary evaluations of TR approaches also include a negative bias that outweighs the previously identified positive biases: while TR approaches expect a natural language query, most evaluations simply formulate this query as the full text of a bug report. In this study we show that highly performing queries can be extracted from the bug report text, in order to make TR effective even without the aforementioned positive biases. Further, we analyze the provenance of terms in these highly performing queries to drive future work in automatic query extraction from bug reports.
机译:随着社会依赖的软件继续增长,由于财政资源和人类安全方面的错误变得越来越昂贵。 BUG本地化是开发人员识别需要修复的错误代码的过程,以使系统更安全,更可靠。不幸的是,手动尝试仅从错误报告中的信息查找错误,需要高级了解如何构建系统以及其成分件交互的方式。因此,以前的工作已经调查了减少在错误本地化中花费的人力努力的许多技术。最常见的方法之一是文本检索(TR),其中系统的源代码被索引到搜索空间中,然后查询与给定错误报告相关的代码。在过去的十年中,数十篇论文提出了使用TR的危害本地化的改进,主要是积极的结果。然而,其他几项研究已经称为该技术。根据这些研究,基于TR的方法的评估通常缺乏对人为膨胀结果的偏差的充分控制,即:错误分类,纠结的犯罪和本地化提示。在这里,我们认为TR方法的当代评估还包括负面偏见,超过先前识别的正面偏差:虽然TR方法期望自然语言查询,但大多数评估只需将此查询作为错误报告的全文制定。在本研究中,我们表明可以从错误报告文本中提取高度执行的查询,以便即使没有上述正偏差,也可以使TR生效。此外,我们分析了这些高度表现疑问中的术语的出处,以推动来自错误报告的自动查询提取中的未来工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号