首页> 外文期刊>Information and software technology >A statistical pattern based feature extraction method on system call traces for anomaly detection
【24h】

A statistical pattern based feature extraction method on system call traces for anomaly detection

机译:基于统计模式的系统呼叫迹线对异常检测的特征提取方法

获取原文
获取原文并翻译 | 示例
           

摘要

Context: In host-based anomaly detection, feature extraction on the system call traces is important to build an effective anomaly detection model. Different kinds of feature extraction methods are recently proposed and most of them aim at preserving the positional information of the system calls within a trace. These extracted features are generally named from system calls, therefore, cannot be used directly in the case of cross platform applications. In addition, some of these feature extraction methods are very costly to implement.Objective: This paper presents a new feature extraction method. It aims at extracting features that are irrelevant to the names of system calls. The samples represented by the extracted features can be directly used in the case of cross platform applications. In addition, this method is lightweight in that the feature values are not expensive to compute.Method: The proposed method firstly transforms the system calls in a trace into frequency sequences of n-grams and then explores a fixed number of statistical features on the frequency sequences. The extracted features are irrelevant to the names/indexes of system calls on a platform. The calculation of feature values works on the frequency sequences rather than on system call sequences. These feature vectors built on the training set with only normal data are then used to train a one class classification model for anomaly detection.Results: We compared our method with four previously proposed feature extraction methods on system call traces. When used on the same platform, even though our method does not always obtain the highest AUC, overall, it performs better than all the compared methods. When testing on cross platform, it performs the best among all compared methods.Conclusion: The features extracted by our method are platform-independent and are suitable for anomaly detection across platforms.
机译:背景信息:在基于主机的异常检测中,系统呼叫迹线上的功能提取对于构建有效的异常检测模型非常重要。最近提出了不同种类的特征提取方法,其中大多数旨在保留在迹线中系统呼叫的位置信息。因此,这些提取的功能通常从系统呼叫命名,因此不能直接使用跨平台应用程序的情况。此外,这些特征提取方法中的一些是非常昂贵的。目的:本文提出了一种新的特征提取方法。它旨在提取与系统调用的名称无关的功能。由提取的特征表示的样本可以直接用于跨平台应用的情况。此外,该方法是重量轻,因为特征值与计算成本不昂贵。方法:所提出的方法首先将系统调用转换为n克的频率序列,然后探讨频率上的固定数量的统计功能序列。提取的功能与平台上系统呼叫的名称/索引无关。特征值的计算适用于频率序列而不是系统呼叫序列。然后,这些特征向量基于仅具有正常数据的培训集,用于培训一个类别检测的一个类分类模型。结果:我们将我们的方法与系统调用迹线上的四个先前提出的特征提取方法进行了比较。当在同一平台上使用时,即使我们的方法并不总是获得最高的AUC,总的来说,它比所有比较方法更好。当在跨平台上测试时,它在所有比较方法中表现最佳。结论:我们的方法提取的功能是平台无关的,适用于平台的异常检测。

著录项

  • 来源
    《Information and software technology》 |2020年第10期|106348.1-106348.13|共13页
  • 作者单位

    Guangdong Pharmaceut Univ Sch Med Informat Engn Guangzhou 510006 Peoples R China|Amer Univ Dept Comp Sci Washington DC 20016 USA|Guangdong Prov Precise Med & Big Data Engn Techno Guangzhou 510006 Peoples R China;

    Amer Univ Dept Comp Sci Washington DC 20016 USA;

    South China Univ Technol Informat & Network Engn & Res Ctr Guangzhou 510041 Peoples R China|Commun & Comp Network Lab Guangdong Guangzhou 510041 Peoples R China;

    Guangdong Pharmaceut Univ Sch Med Informat Engn Guangzhou 510006 Peoples R China|Guangdong Prov Precise Med & Big Data Engn Techno Guangzhou 510006 Peoples R China;

    Guangdong Pharmaceut Univ Sch Med Informat Engn Guangzhou 510006 Peoples R China|Guangdong Prov Precise Med & Big Data Engn Techno Guangzhou 510006 Peoples R China;

    Guangdong Pharmaceut Univ Sch Med Informat Engn Guangzhou 510006 Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Feature extraction; Statistical pattern; System calls; Platform-independent; One-class learning; Anomaly detection;

    机译:特征提取;统计模式;系统呼叫;平台无关;一流的学习;异常检测;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号