首页> 外文会议>International Conference on Computing and Information Technology >Feature Comparison for Automatic Bug Report Classification
【24h】

Feature Comparison for Automatic Bug Report Classification

机译:自动错误报告分类的功能比较

获取原文

摘要

Nowadays, various bug tracking systems (BTS) such as Jira, Trace, and Bugzilla have been developed and proposed to gather the issues from users worldwide. This is because those issues, called bug reports, contain a significant information for software quality maintenance and improvement. However, many bug reports with poor quality might have been submitted to the BTS. In general, the reported bugs in the BTS are firstly analyzed and filtered out by bug triagers. However, with the increasing amount of bug reports in the BTS, manually classifying bug reports is a time-consuming task. To address this problem, automatically distinguishing of bugs and non-bugs is necessary. To the best of our knowledge, this task is never easy for bug reports classification because the problem of bug reports misclassification still occurs to date. The background of this problem may be arise from using inappropriate or confusing features. Therefore, this work aims to study and discover the most proper features for binary bug report classification. This study compares seven features such as unigram, bigram, camel case, unigram+bigram, unigram+camel case, bigram+ camel case, and all features together. The experimental results show that the unigram+camel case should be the most proper features for binary bug report classification, especially when using with the logistic regression algorithm. Consequently, the unigram+camel case should be the proper feature to distinguish bug reports from the non-bugs ones.
机译:如今,已经开发了各种错误跟踪系统(BTS),如Jira,Trace和Bugzilla,并提出从全球用户收集问题。这是因为这些问题称为错误报告,包含软件质量维护和改进的重要信息。但是,许多质量差的错误报告可能已提交给BTS。一般而言,首先分析BTS中的报告错误并被Bug交换机滤除。但是,随着BTS中的越来越多的错误报告,手动对错误报告是耗时的任务。为了解决这个问题,需要自动区分错误和非错误。据我们所知,这项任务永远不会容易出现错误报告分类,因为错误报告错误分类仍然发生在日期。可以从使用不适当或令人困惑的功能来产生此问题的背景。因此,这项工作旨在研究和发现二进制错误报告分类的最适当的功能。本研究比较了七种功能,如unigram,Bigram,骆驼盒,Unigram + Bigram,Unigram +骆驼盒,Bigram +骆驼盒,以及所有功能。实验结果表明,Unigram +骆驼盒应该是二进制错误报告分类的最适当的功能,尤其是在使用Logistic回归算法时。因此,UNIGRAM +骆驼盒应该是区分从非漏洞的错误报告的适当特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号