首页> 外文会议>The 2nd International Conference on Software Engineering and Data Mining >A data mining based method: Detecting software defects in source code
【24h】

A data mining based method: Detecting software defects in source code

机译:一种基于数据挖掘的方法:检测源代码中的软件缺陷

获取原文

摘要

With the expansion of software size and complexity, how to detect defects becomes a challenging problem. This paper proposes a defect detection method which applies data mining techniques in source code to detect two types of defects in one process. The two types of defects are rule-violating defects and copy-paste related defects which may include semantic defects. During the process, this method can also extract implicit programming rules without prior knowledge of the software and detect copy-paste segments with different granularities. The method is evaluated with the Linux kernel that contains more than 4 million lines of C code. The result shows that the resulting system can quickly detect many programming rules and violations to the rules. After using the novel pruning techniques, it will greatly reduce the effort of manually checking violations so as a large number of false positives are effectively eliminated. As an illustrative example of its effectiveness, a case study shows that among the top 50 violations reported by the proposed model, 11 defects can be confirmed after examining the source code.
机译:随着软件规模和复杂性的扩展,如何检测缺陷已成为一个具有挑战性的问题。本文提出了一种缺陷检测方法,该方法将源代码中的数据挖掘技术应用于在一个过程中检测两种类型的缺陷。两种类型的缺陷是违反规则的缺陷和与复制粘贴相关的缺陷,其中可能包括语义缺陷。在此过程中,此方法还可以在不事先了解软件的情况下提取隐式编程规则,并检测具有不同粒度的复制粘贴段。该方法是使用包含超过400万行C代码的Linux内核进行评估的。结果表明,生成的系统可以快速检测许多编程规则和违反规则的情况。使用新颖的修剪技术后,它将大大减少手动检查违规情况的工作量,从而有效地消除了许多误报。作为其有效性的说明性示例,案例研究表明,在提出的模型报告的前50个违规中,通过检查源代码可以确认11个缺陷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号