Feature Comparison for Automatic Bug Report Classification

机译：自动错误报告分类的功能比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays, various bug tracking systems (BTS) such as Jira, Trace, and Bugzilla have been developed and proposed to gather the issues from users worldwide. This is because those issues, called bug reports, contain a significant information for software quality maintenance and improvement. However, many bug reports with poor quality might have been submitted to the BTS. In general, the reported bugs in the BTS are firstly analyzed and filtered out by bug triagers. However, with the increasing amount of bug reports in the BTS, manually classifying bug reports is a time-consuming task. To address this problem, automatically distinguishing of bugs and non-bugs is necessary. To the best of our knowledge, this task is never easy for bug reports classification because the problem of bug reports misclassification still occurs to date. The background of this problem may be arise from using inappropriate or confusing features. Therefore, this work aims to study and discover the most proper features for binary bug report classification. This study compares seven features such as unigram, bigram, camel case, unigram+bigram, unigram+camel case, bigram+ camel case, and all features together. The experimental results show that the unigram+camel case should be the most proper features for binary bug report classification, especially when using with the logistic regression algorithm. Consequently, the unigram+camel case should be the proper feature to distinguish bug reports from the non-bugs ones.

机译：如今，已经开发了各种错误跟踪系统（BTS），如Jira，Trace和Bugzilla，并提出从全球用户收集问题。这是因为这些问题称为错误报告，包含软件质量维护和改进的重要信息。但是，许多质量差的错误报告可能已提交给BTS。一般而言，首先分析BTS中的报告错误并被Bug交换机滤除。但是，随着BTS中的越来越多的错误报告，手动对错误报告是耗时的任务。为了解决这个问题，需要自动区分错误和非错误。据我们所知，这项任务永远不会容易出现错误报告分类，因为错误报告错误分类仍然发生在日期。可以从使用不适当或令人困惑的功能来产生此问题的背景。因此，这项工作旨在研究和发现二进制错误报告分类的最适当的功能。本研究比较了七种功能，如unigram，Bigram，骆驼盒，Unigram + Bigram，Unigram +骆驼盒，Bigram +骆驼盒，以及所有功能。实验结果表明，Unigram +骆驼盒应该是二进制错误报告分类的最适当的功能，尤其是在使用Logistic回归算法时。因此，UNIGRAM +骆驼盒应该是区分从非漏洞的错误报告的适当特征。

著录项

来源
《International Conference on Computing and Information Technology》|2020年|xi 270 pages :|共10页
会议地点
作者
Bancha Luaphol; Boonchoo Srikudkao; Tontrakant Kachai; Natthakit Srikanjanapert; Jantima Polpinij; Poramin Bheganan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-532;
关键词
Bug report features; Bug report classification; Misclassification; Bug and non-bug; Unigram; Bigram; Camel case; Na?ve Bayes; Logistic regression; Support vector machines;

机译：错误报告特征;错误报告分类;错误分类;虫子和非虫子;unigram;骆驼盒;na？ve贝叶斯;物流回归;支持矢量机器;

相似文献

外文文献
中文文献
专利

1. An HMM-based approach for automatic detection and classification of duplicate bug reports [J] . Ebrahimi Neda, Trabelsi Abdelaziz, Islam Md Shariful, Information and software technology . 2019,第SEPa期

机译：基于HMM的方法，用于自动检测和分类重复的错误报告
2. An HMM-based approach for automatic detection and classification of duplicate bug reports [J] . Ebrahimi Neda, Trabelsi Abdelaziz, Islam Md Shariful, Information and software technology . 2019,第Sepa期

机译：基于HMM的自动检测方法和分类重复错误报告
3. LEARNING TO RANK AND CLASSIFICATION OF BUG REPORTS USING SVM AND FEATURE EVALUATION [J] . S.Rajeswari, S. Sharavanan, R.Vijai, International Journal on Smart Sensing and Intelligent Systems . 2017,第SPECIALaISSUE期

机译：使用SVM和特征评估学习对错误报告的排名和分类
4. Feature Comparison for Automatic Bug Report Classification [C] . Bancha Luaphol, Boonchoo Srikudkao, Tontrakant Kachai, International Conference on Computing and Information Technology . 2020

机译：自动错误报告分类的功能比较
5. Automatic Sentence Annotation for More Useful Bug Report Summarization [D] . Galappaththi, Akalanka. 2020

机译：自动句子注释以获取更多有用的错误报告摘要
6. Automatic ICD-10 multi-class classification of cause of death from plaintext autopsy reports through expert-driven feature selection [O] . Ghulam Mujtaba, Liyana Shuib, Ram Gopal Raj, -1

机译：通过专家驱动的特征选择从纯文本尸检报告中自动对死亡原因进行ICD-10多类分类
7. An HMM-based approach for automatic detection and classification of duplicate bug reports [O] . Neda Ebrahimi, Abdelaziz Trabelsi, Md. Shariful Islam, 2019

机译：基于HMM的自动检测方法和分类重复错误报告

Feature Comparison for Automatic Bug Report Classification

摘要

著录项

相似文献

相关主题

期刊订阅