首页> 外文会议>International Conference on Computing, Communication and Networking Technologies >Machine Learning based Malware Detection in Cloud Environment using Clustering Approach
【24h】

Machine Learning based Malware Detection in Cloud Environment using Clustering Approach

机译:使用聚类方法的云环境中基于机器学习的恶意软件检测

获取原文

摘要

Enforcing security and resilience in a cloud platform is an essential but challenging problem due to the presence of a large number of heterogeneous applications running on shared resources. A security analysis system that can detect threats or malware must exist inside the cloud infrastructure. Much research has been done on machine learning-driven malware analysis, but it is limited in computational complexity and detection accuracy. To overcome these drawbacks, we proposed a new malware detection system based on the concept of clustering and trend micro locality sensitive hashing (TLSH). We used Cuckoo sandbox, which provides dynamic analysis reports of files by executing them in an isolated environment. We used a novel feature extraction algorithm to extract essential features from the malware reports obtained from the Cuckoo sandbox. Further, the most important features are selected using principal component analysis (PCA), random forest, and Chi-square feature selection methods. Subsequently, the experimental results are obtained for clustering and non-clustering approaches on three classifiers, including Decision Tree, Random Forest, and Logistic Regression. The model performance shows better classification accuracy and false positive rate (FPR) as compared to the state-of-the-art works and non-clustering approach at significantly lesser computation cost.
机译:由于存在大量在共享资源上运行的异构应用程序,因此在云平台上加强安全性和弹性是一个必不可少但具有挑战性的问题。云基础架构内部必须存在可以检测到威胁或恶意软件的安全分析系统。关于机器学习驱动的恶意软件分析的研究很多,但是在计算复杂性和检测准确性方面受到限制。为了克服这些缺点,我们提出了一种基于聚类和趋势微区域敏感哈希(TLSH)概念的新型恶意软件检测系统。我们使用了Cuckoo沙箱,该沙箱通过在隔离的环境中执行文件来提供文件的动态分析报告。我们使用了一种新颖的特征提取算法来从Cuckoo沙箱获得的恶意软件报告中提取基本特征。此外,使用主成分分析(PCA),随机森林和卡方特征选择方法可以选择最重要的特征。随后,获得了在决策树,随机森林和逻辑回归三个分类器上进行聚类和非聚类方法的实验结果。与最新技术和非聚类方法相比,模型性能显示出更好的分类准确性和误报率(FPR),而计算成本却大大降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号