Representation vs. Model: What Matters Most for Source Code Vulnerability Detection

机译：表示与模型：源代码漏洞检测最重要的是什么

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Vulnerabilities in the source code of software are critical issues in the realm of software engineering. Coping with vulnerabilities in software source code is becoming more challenging due to several aspects of complexity and volume. Deep learning has gained popularity throughout the years as a means of addressing such issues. In this paper, we propose an evaluation of vulnerability detection performance on source code representations and evaluate how Machine Learning (ML) strategies can improve them. The structure of our experiment consists of 3 Deep Neural Networks (DNNs) in conjunction with five different source code representations; Abstract Syntax Trees (ASTs), Code Gadgets (CGs), Semantics-based Vulnerability Candidates (SeVCs), Lexed Code Representations (LCRs), and Composite Code Representations (CCRs). Experimental results show that employing different ML strategies in conjunction with the base model structure influences the performance results to a varying degree. However, ML-based techniques suffer from poor performance on class imbalance handling when used in conjunction with source code representations for software vulnerability detection.

机译：软件源代码中的漏洞是软件工程领域的关键问题。由于复杂性和卷的几个方面，应对软件源代码中的漏洞正在变得更具挑战性。深入学习越来越受欢迎，作为解决此类问题的手段。在本文中，我们提出了对源代码表示的漏洞检测性能的评估，评估机器学习（ML）策略可以改善它们。我们的实验结构由3个深度神经网络（DNN）组成，与五种不同的源代码表示;摘要语法树（AST），代码小工具（CGS），基于语义的漏洞候选候选（SEVC），LEXED代码表示（LCR）和综合代码表示（CCR）。实验结果表明，与基础模型结构结合使用不同的ML策略影响性能结果的变化程度。然而，当与软件漏洞检测的源代码表示结合使用时，ML的基于ML的技术在类别不平衡处理中遭受了差的性能。

著录项

来源
《IEEE International Conference on Software Analysis, Evolution and Reengineering》|2021年|647-653|共7页
会议地点
作者
Wei Zheng; Abubakar Omari Abdallah Semasaba; Xiaoxue Wu; Samuel Akwasi Agyemang; Tao Liu; Yuan Ge;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Measurement; Analytical models; Conferences; Transfer learning; Neural networks; Syntactics;

机译：深入学习;测量;分析模型;会议;转移学习;神经网络;句法;

相似文献

外文文献
中文文献
专利

1. A Modified Maximal Divergence Sequential Auto-Encoder and Time Delay Neural Network Models for Vulnerable Binary Codes Detection [J] . Albahar Marwan Ali Quality Control, Transactions . 2020,第期

机译：修改的最大分流顺序自动编码器和易受攻击二进制代码检测的时间延迟神经网络模型
2. SQVDT: A scalable quantitative vulnerability detection technique for source code security assessment [J] . Akram Junaid, Luo Ping Software . 2021,第2期

机译：SQVDT：源代码安全评估的可扩展定量漏洞检测技术
3. Source code plagiarism detection with low-level structural representation and information retrieval [J] . Oscar Karnalim International Journal of Computers & Applications . 2021,第4a6期

机译：源代码抄袭检测低水平结构表示和信息检索
4. Vulnerability Detection in C/C++ Source Code With Graph Representation Learning [C] . Yuelong Wu, Jintian Lu, Yunyi Zhang, Annual Computing and Communication Workshop and Conference . 2021

机译：C / C ++源代码中的漏洞检测与图形表示学习
5. Towards the automation of vulnerability detection in source code. [D] . Ling, Hai Zhou. 2010

机译：旨在实现源代码中漏洞检测的自动化。
6. Modeling Deep Brain Stimulation: Point Source Approximation vs. Realistic Representation of the Electrode [O] . Tianhe C Zhang, Warren M Grill -1

机译：建模深部脑刺激：点源近似与电极的逼真再现
7. Automated Vulnerability Detection in Source Code Using Minimum Intermediate Representation Learning [O] . Xin Li, Lu Wang, Yang Xin, 2020

机译：使用最小中间表示学习的源代码中自动漏洞检测
8. Methane Modeling: Predicting the Inflow of Methane Gas into Coal Mines. Phase 2 - Small-Scale in-Mine Tests and Development of Two-Dimensional Models. Phase 3 - Final Report and Computer Source Codes. Volume 2. Computer Source Codes [R] . Schwerer, F. C. , Bollinger, E. R. , Pavone, A. M. , 1984

机译：甲烷模拟：预测甲烷气体进入煤矿。第2阶段 - 小规模矿井试验和二维模型的开发。第3阶段 - 最终报告和计算机源代码。第2卷。计算机源代码

Representation vs. Model: What Matters Most for Source Code Vulnerability Detection

摘要

著录项

相似文献

相关主题

期刊订阅