On the relationship between bug reports and queries for text retrieval-based bug localization

Chris Mills; Esteban Parra; Jevgenija Pantiuchina; Gabriele Bavota; Sonia Haiduc

首页> 外文期刊>Empirical Software Engineering >On the relationship between bug reports and queries for text retrieval-based bug localization

【24h】

On the relationship between bug reports and queries for text retrieval-based bug localization

机译：关于基于文本检索的错误本地化的错误报告与查询的关系

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

As societal dependence on software continues to grow, bugs are becoming increasingly costly in terms of financial resources as well as human safety. Bug localization is the process by which a developer identifies buggy code that needs to be fixed to make a system safer and more reliable. Unfortunately, manually attempting to locate bugs solely from the information in a bug report requires advanced knowledge of how a system is constructed and the way its constituent pieces interact. Therefore, previous work has investigated numerous techniques for reducing the human effort spent in bug localization. One of the most common approaches is Text Retrieval (TR) in which a system's source code is indexed into a search space that is then queried for code relevant to a given bug report. In the last decade, dozens of papers have proposed improvements to bug localization using TR with largely positive results. However, several other studies have called the technique into question. According to these studies, evaluations of TR-based approaches often lack sufficient controls on biases that artificially inflate the results, namely: misclassified bugs, tangled commits, and localization hints. Here we argue that contemporary evaluations of TR approaches also include a negative bias that outweighs the previously identified positive biases: while TR approaches expect a natural language query, most evaluations simply formulate this query as the full text of a bug report. In this study we show that highly performing queries can be extracted from the bug report text, in order to make TR effective even without the aforementioned positive biases. Further, we analyze the provenance of terms in these highly performing queries to drive future work in automatic query extraction from bug reports.

机译：随着社会依赖的软件继续增长，由于财政资源和人类安全方面的错误变得越来越昂贵。 BUG本地化是开发人员识别需要修复的错误代码的过程，以使系统更安全，更可靠。不幸的是，手动尝试仅从错误报告中的信息查找错误，需要高级了解如何构建系统以及其成分件交互的方式。因此，以前的工作已经调查了减少在错误本地化中花费的人力努力的许多技术。最常见的方法之一是文本检索（TR），其中系统的源代码被索引到搜索空间中，然后查询与给定错误报告相关的代码。在过去的十年中，数十篇论文提出了使用TR的危害本地化的改进，主要是积极的结果。然而，其他几项研究已经称为该技术。根据这些研究，基于TR的方法的评估通常缺乏对人为膨胀结果的偏差的充分控制，即：错误分类，纠结的犯罪和本地化提示。在这里，我们认为TR方法的当代评估还包括负面偏见，超过先前识别的正面偏差：虽然TR方法期望自然语言查询，但大多数评估只需将此查询作为错误报告的全文制定。在本研究中，我们表明可以从错误报告文本中提取高度执行的查询，以便即使没有上述正偏差，也可以使TR生效。此外，我们分析了这些高度表现疑问中的术语的出处，以推动来自错误报告的自动查询提取中的未来工作。

著录项

来源
《Empirical Software Engineering》 |2020年第5期|3086-3127|共42页
作者
Chris Mills; Esteban Parra; Jevgenija Pantiuchina; Gabriele Bavota; Sonia Haiduc;
展开▼
作者单位

Florida State University 600 W College Ave . Tallahassee FL 32306 USA;

Florida State University 600 W College Ave . Tallahassee FL 32306 USA;

Universita della Svizzera italiana Via Giuseppe Buffi 13 6900 Lugano Switzerland;

Universita della Svizzera italiana Via Giuseppe Buffi 13 6900 Lugano Switzerland;

Florida State University 600 W College Ave . Tallahassee FL 32306 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Bug localization; Query formulation; Text retrieval;

机译：错误本地化;查询配方;文本检索;

相似文献

外文文献
中文文献
专利

1. Using bug descriptions to reformulate queries during text-retrieval-based bug localization [J] . Chaparro Oscar, Florez Juan Manuel, Marcus Andrian Empirical Software Engineering . 2019,第5期

机译：在基于文本检索的错误本地化过程中使用错误描述重新构造查询
2. Using bug descriptions to reformulate queries during text-retrieval-based bug localization [J] . Chaparro Oscar, Florez Juan Manuel, Marcus Andrian Empirical Software Engineering . 2019,第5期

机译：使用错误描述来重新定制基于文本检索的错误本地化查询
3. Are datasets for information retrieval-based bug localization techniques trustworthy? Impact analysis of bug types on IRBL [J] . Kim Misoo, Lee Eunseok Empirical Software Engineering . 2021,第3期

机译：数据集是基于信息检索的错误本地化技术可信赖吗？ IRBL上的错误类型的影响分析
4. Are Bug Reports Enough for Text Retrieval-Based Bug Localization? [C] . Chris Mills, Jevgenija Pantiuchina, Esteban Parra, IEEE International Conference on Software Maintenance and Evolution . 2018

机译：错误报告是否足以进行基于文本检索的错误本地化？
5. Source code retrieval for bug localization using latent Dirichlet allocation, and its relationship to stability of agilely developed software. [D] . Lukins, Stacy K. 2009

机译：使用潜在的Dirichlet分配进行bug本地化的源代码检索，及其与敏捷开发软件的稳定性的关系。
6. Interactive ontology debugging: Two query strategies for efficient fault localization [O] . Kostyantyn Shchekotykhin, Gerhard Friedrich, Philipp Fleiss, -1

机译：交互式本体调试：两种查询策略可实现有效的故障定位
7. Where should the bugs be fixed? More accurate information retrieval-based bug localization based on bug reports [O] . Jian Zhou, Hongyu Zhang, David Lo 2012

机译：错误应该在哪里修复？基于错误报告的基于Revical的基于错误定位更准确的信息

On the relationship between bug reports and queries for text retrieval-based bug localization

摘要

著录项

相似文献

相关主题

期刊订阅