首页> 外文会议>IEEE International Conference on Software Analysis, Evolution and Reengineering >Representation vs. Model: What Matters Most for Source Code Vulnerability Detection
【24h】

Representation vs. Model: What Matters Most for Source Code Vulnerability Detection

机译:表示与模型:源代码漏洞检测最重要的是什么

获取原文

摘要

Vulnerabilities in the source code of software are critical issues in the realm of software engineering. Coping with vulnerabilities in software source code is becoming more challenging due to several aspects of complexity and volume. Deep learning has gained popularity throughout the years as a means of addressing such issues. In this paper, we propose an evaluation of vulnerability detection performance on source code representations and evaluate how Machine Learning (ML) strategies can improve them. The structure of our experiment consists of 3 Deep Neural Networks (DNNs) in conjunction with five different source code representations; Abstract Syntax Trees (ASTs), Code Gadgets (CGs), Semantics-based Vulnerability Candidates (SeVCs), Lexed Code Representations (LCRs), and Composite Code Representations (CCRs). Experimental results show that employing different ML strategies in conjunction with the base model structure influences the performance results to a varying degree. However, ML-based techniques suffer from poor performance on class imbalance handling when used in conjunction with source code representations for software vulnerability detection.
机译:软件源代码中的漏洞是软件工程领域的关键问题。由于复杂性和卷的几个方面,应对软件源代码中的漏洞正在变得更具挑战性。深入学习越来越受欢迎,作为解决此类问题的手段。在本文中,我们提出了对源代码表示的漏洞检测性能的评估,评估机器学习(ML)策略可以改善它们。我们的实验结构由3个深度神经网络(DNN)组成,与五种不同的源代码表示;摘要语法树(AST),代码小工具(CGS),基于语义的漏洞候选候选(SEVC),LEXED代码表示(LCR)和综合代码表示(CCR)。实验结果表明,与基础模型结构结合使用不同的ML策略影响性能结果的变化程度。然而,当与软件漏洞检测的源代码表示结合使用时,ML的基于ML的技术在类别不平衡处理中遭受了差的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号