首页> 外文期刊>The Journal of Systems and Software >An automatically created novel bug dataset and its validation in bug prediction
【24h】

An automatically created novel bug dataset and its validation in bug prediction

机译:一个自动创建的新型错误数据集及其在Bug预测中的验证

获取原文
获取原文并翻译 | 示例
           

摘要

Bugs are inescapable during software development due to frequent code changes, tight deadlines, etc.; therefore, it is important to have tools to find these errors. One way of performing bug identification is to analyze the characteristics of buggy source code elements from the past and predict the present ones based on the same characteristics, using e.g. machine learning models. To support model building tasks, code elements and their characteristics are collected in so-called bug datasets which serve as the input for learning. We present the BugHunter Dataset; a novel kind of automatically constructed and freely available bug dataset containing code elements (files, classes, methods) with a wide set of code metrics and bug information. Other available bug datasets follow the traditional approach of gathering the characteristics of all source code elements (buggy and non-buggy) at only one or more pre-selected release versions of the code. Our approach, on the other hand, captures the buggy and the fixed states of the same source code elements from the narrowest timeframe we can identify for a bug's presence, regardless of release versions. To show the usefulness of the new dataset, we built and evaluated bug prediction models and achieved F-measure values over 0.74.
机译:由于频繁的代码变化,紧张截止日期等,在软件开发期间错误是不可避免的。因此,有工具可以找到这些错误很重要。执行错误识别的一种方法是通过使用例如使用例如相同的特征来分析错误源代码元素的特征,并通过例如使用例如相同的特征来预测本发明的特征。机器学习模型。为了支持模型构建任务,代码元素及其特征在所谓的Bug数据集中收集,该数据集用作学习的输入。我们介绍了Bughunter DataSet;一种新颖的自动构造和自由的Bug数据集,包含具有广泛代码指标和错误信息的代码元素(文件,类,方法)。其他可用的BUG数据集遵循传统的方法,即在代码的一个或多个预先选择的发布版本中收集所有源代码元素(越野车和非丢弃)的特征。另一方面,我们的方法从最窄的时间帧中捕获相同源代码元素的错误和固定状态,我们可以识别错误的存在,而不管发布版本如何。为了显示新数据集的有用性,我们构建和评估了Bug预测模型,并实现了超过0.74的F测量值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号