...
首页> 外文期刊>The Journal of Systems and Software >Bug severity prediction using question-and-answer pairs from Stack Overflow
【24h】

Bug severity prediction using question-and-answer pairs from Stack Overflow

机译:使用来自堆栈溢出的问题和答案对的错误严重性预测

获取原文
获取原文并翻译 | 示例
           

摘要

Nowadays, bugs have been common in most software systems. For large-scale software projects, developers usually conduct software maintenance tasks by utilizing software artifacts (e.g., bug reports). The severity of bug reports describes the impact of the bugs and determines how quickly it needs to be fixed. Bug triagers often pay close attention to some features such as severity to determine the importance of bug reports and assign them to the correct developers. However, a large number of bug reports submitted every day increase the workload of developers who have to spend more time on fixing bugs. In this paper, we collect question-and-answer pairs from Stack Overflow and use logical regression to predict the severity of bug reports. In detail, we extract all the posts related to bug repositories from Stack Overflow and combine them with bug reports to obtain enhanced versions of bug reports. We achieve severity prediction on three popular open source projects (e,g., Mozilla, Ecplise, and GCC) with Naieve Bayesian, k-Nearest Neighbor algorithm (KNN), and Long Short-Term Memory (LSTM). The results of our experiments show that our model is more accurate than the previous studies for predicting the severity. Our approach improves by 23.03%, 21.86%, and 20.59% of the average F-measure for Mozilla, Eclipse, and GCC by comparing with the Naieve Bayesian based approach which performs the best among all baseline approaches.
机译:如今,Bug在大多数软件系统中都很常见。对于大型软件项目,开发人员通常通过利用软件工件(例如,错误报告)进行软件维护任务。错误报告的严重性描述了错误的影响,并确定需要修复的速度。 BUG Trijers经常要密切关注某些功能,例如严重性,以确定错误报告的重要性并将它们分配给正确的开发人员。但是,每天提交的大量错误报告增加了开发人员的工作量,他们必须花更多时间在修复错误上。在本文中,我们从堆栈溢出中收集问题和答案对,并使用逻辑回归来预测错误报告的严重性。详细地,我们从堆栈溢出中提取与错误存储库相关的所有帖子,并将它们与错误报告组合以获取增强版本的错误报告。我们通过明智的贝叶斯,K最近邻算法(KNN)和长短期内存(LSTM)实现了对三个流行的开源项目(E,G.,Mozilla,Foplise和GCC)的严重性预测。我们的实验结果表明,我们的模型比以前的预测严重程度的研究更准确。我们的方法通过比较所有基于基于基线方法的恶劣贝叶斯夜种方法,提高了23.03%,21.86%和20.59%的莫扎拉,日食和GCC的平均F措施。

著录项

  • 来源
    《The Journal of Systems and Software》 |2020年第7期|110567.1-110567.14|共14页
  • 作者单位

    College of Software Harbin Engineering University Harbin 150001 China;

    College of Software Harbin Engineering University Harbin 150001 China;

    School of Computer Science and Technology Harbin Institute of Technology Harbin 150001 China;

    Faculty of Information Technology Macau University of Science and Technology Macao China;

    School of Big Data and Software Engineering Chongqing University Chongqing China;

    Department of Computing Hong Kong Polytechnic University Hong Kong China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Stack overflow; Severity prediction; Logistic regression; Bug reports;

    机译:堆栈溢出;严重程度预测;物流回归;错误报告;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号