首页> 外文会议>Asia-Pacific Software Engineering Conference >Classifying Bug Reports to Bugs and Other Requests Using Topic Modeling
【24h】

Classifying Bug Reports to Bugs and Other Requests Using Topic Modeling

机译:使用主题建模对错误报告错误报告错误和其他请求

获取原文

摘要

Bug reports are widely used in several research areas such as bug prediction, bug triaging, and etc. The performance of these studies relies on the information from bug reports. Previous study showed that a significant number of bug reports are actually misclassified between bugs and nonbugs. However, classifying bug reports is a time-consuming task. In the previous study, researchers spent 90 days to classify manually more than 7,000 bug reports. To tackle this problem, we propose automatic bug report classification techniques. We apply topic modeling to the corpora of preprocessed bug reports of three open-source software projects with decision tree, naive Bayes classifier, and logistic regression. The performance in classification, measured in F-measure score, varies between 0.66-0.76, 0.65-0.77, and 0.71-0.82 for HTTPClient, Jackrabbit, and Lucene project respectively.
机译:错误报告广泛应用于诸如Bug预测,Bug Trijing等几个研究领域,这些研究的性能依赖于来自错误报告的信息。以前的研究表明,大量错误报告实际上在错误和非营收之间错误分类。但是,对错误报告进行分类是耗时的任务。在以前的研究中,研究人员花了90天,以便手动分类超过7,000个错误报告。为了解决这个问题,我们提出了自动错误报告分类技术。我们将主题建模到具有决策树,天真贝叶斯分类器和Logistic回归的三个开源软件项目的预处理错误报告的Corpora。在F测量评分中测量的分类中的性能分别在0.66-0.76,0.65-0.77和0.71-0.82之间分别变化为Httpclient,Jackrabbit和Lucene项目。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号