Getting to the root of the problem: A detailed comparison of kernel and user level data for dynamic malware analysis

Nunes Matthew; Burnap Pete; Rana Omer; Reinecke Philipp; Lloyd Kaelon

首页> 外文期刊>Information Security Technical Report >Getting to the root of the problem: A detailed comparison of kernel and user level data for dynamic malware analysis

【24h】

Getting to the root of the problem: A detailed comparison of kernel and user level data for dynamic malware analysis

机译：找出问题的根源：对内核和用户级别数据进行详细比较以进行动态恶意软件分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Dynamic malware analysis is fast gaining popularity over static analysis since it is not easily defeated by evasion tactics such as obfuscation and polymorphism. During dynamic analysis it is common practice to capture the system calls that are made to better understand the behaviour of malware. There are several techniques to capture system calls, the most popular of which is a user-level hook. To study the effects of collecting system calls at different privilege levels and viewpoints, we collected data at a process-specific user-level using a virtualised sandbox environment and a system-wide kernel-level using a custom-built kernel driver. We then tested the performance of several state-of-the-art machine learning classifiers on the data. Random Forest was the best performing classifier with an accuracy of 95.2% for the kernel driver and 94.0% at a user-level. The combination of user and kernel level data gave the best classification results with an accuracy of 96.0% for Random Forest. This may seem intuitive but was hitherto not empirically demonstrated. Additionally, we observed that machine learning algorithms trained on data from the user-level tended to use the anti-debug/ anti-vm features in malware to distinguish it from benignware. Whereas, when trained on data from our kernel driver, machine learning algorithms seemed to use the differences in the general behaviour of the system to make their prediction, which explains why they complement each other so well. Our results show that capturing data at different privilege levels will affect the classifier's ability to detect malware, with kernel-level providing more utility than user-level for malware classification. Despite this, there exist more established user-level tools than kernel-level tools, suggesting more research effort should be directed at kernel-level. In short, this paper provides the first objective, evidence-based comparison of user and kernel level data for the purposes of malware classification. (C) 2019 The Authors. Published by Elsevier Ltd.

机译：动态恶意软件分析比静态分析迅速普及，因为动态化恶意软件分析不容易被混淆和多态性等规避策略所击败。在动态分析过程中，通常的做法是捕获为更好地了解恶意软件行为而进行的系统调用。有几种捕获系统调用的技术，其中最流行的是用户级挂钩。为了研究在不同特权级别和观点下收集系统调用的效果，我们使用虚拟化的沙箱环境在特定于进程的用户级别收集数据，并使用定制的内核驱动程序在系统范围的内核级别收集数据。然后，我们在数据上测试了几个最新的机器学习分类器的性能。随机森林是性能最好的分类器，内核驱动程序的准确度为95.2％，用户级别的准确度为94.0％。用户和内核级数据的组合给出了最佳分类结果，对于随机森林，其准确度为96.0％。这看起来似乎很直观，但是到目前为止还没有经验证明。此外，我们观察到在用户级别的数据上训练的机器学习算法倾向于使用恶意软件中的反调试/反虚拟机功能来将其与良性软件区分开。而当对来自我们的内核驱动程序的数据进行训练时，机器学习算法似乎利用系统一般行为的差异来进行预测，这解释了它们为什么能很好地互补。我们的结果表明，以不同的特权级别捕获数据将影响分类器检测恶意软件的能力，内核级别的恶意程序分类功能比用户级别的实用程序更多。尽管如此，与内核级工具相比，存在更多的已建立的用户级工具，这表明应该将更多的研究精力用于内核级。简而言之，本文提供了第一个客观的，基于证据的用户和内核级数据比较，以进行恶意软件分类。（C）2019作者。由Elsevier Ltd.发布

著录项

来源
《Information Security Technical Report》 |2019年第10期|102365.1-102365.18|共18页
作者
Nunes Matthew; Burnap Pete; Rana Omer; Reinecke Philipp; Lloyd Kaelon;
展开▼
作者单位

Cardiff Univ Sch Comp Sci & Informat Queens Bldg 5 Parade Cardiff CF24 3AA S Glam Wales;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Dynamic malware analysis; Behavioural malware analysis; API-calls; Machine learning;

机译：动态恶意软件分析;行为恶意软件分析;API调用;机器学习;

相似文献

外文文献
中文文献
专利

1. Microscopic Dynamics and Topology of Polymer Rings Immersed in a Host Matrix of Longer Linear Polymers: Results from a Detailed Molecular Dynamics Simulation Study and Comparison with Experimental Data [J] . George D. Papadopoulos, Dimitrios G. Tsalikis, Vlasis G. Mavrantzas Polymers . 2016,第8期

机译：浸入较长线性聚合物主体基质中的聚合物环的微观动力学和拓扑：详细的分子动力学模拟研究与实验数据的比较结果
2. The Roots of Controllers: Some Users Reorganize or Replace RTOS and Kernels in Their Operating Systems to Gain New Performance Capabilities [J] . Jim Montague Control Design: For Machine Builders . 2014,第3期

机译：控制器的根源：某些用户在其操作系统中重组或替换RTOS和内核以获取新的性能
3. Characteristics of finasteride users in comparison with nonusers: A Nordic nationwide study based on individual-level data from Denmark, Finland, and Sweden [J] . Pharmacoepidemiology and drug safety . 2020,第4期

机译：与非用户相比，利用者的特征：基于来自丹麦，芬兰和瑞典的个人级别数据的北欧全国性研究
4. Detection of Malware and Kernel-level Rootkits in Cloud Computing Environments [C] . Thu Yein Win, Huaglory Tianfield, Quentin Mair IEEE International Conference on Cyber Security and Cloud Computing . 2015

机译：在云计算环境中检测恶意软件和内核级rootkits
5. Data-centric approaches to kernel malware defense. [D] . Rhee, Junghwan. 2011

机译：以数据为中心的内核恶意软件防御方法。
6. Microscopic Dynamics and Topology of Polymer Rings Immersed in a Host Matrix of Longer Linear Polymers: Results from a Detailed Molecular Dynamics Simulation Study and Comparison with Experimental Data [O] . George D. Papadopoulos, Dimitrios G. Tsalikis, Vlasis G. Mavrantzas 2016

机译：浸入较长线性聚合物主体基质中的聚合物环的微观动力学和拓扑：详细的分子动力学模拟研究结果与实验数据进行比较得出结果
7. Getting to the root of the problem: A detailed comparison of kernel and user level data for dynamic malware analysis [O] . Matthew Nunes, Pete Burnap, Omer Rana, 2019

机译：获取问题的根源：用于动态恶意软件分析的内核和用户级数据的详细比较

Getting to the root of the problem: A detailed comparison of kernel and user level data for dynamic malware analysis

摘要

著录项

相似文献

相关主题

期刊订阅