首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Malware classification using dynamic features and Hidden Markov Model
【24h】

Malware classification using dynamic features and Hidden Markov Model

机译:使用动态功能和隐马尔可夫模型对恶意软件进行分类

获取原文
获取原文并翻译 | 示例
           

摘要

In recent years the number of new malware threats has increased significantly, causing a damage of billions of dollars globally. To counter this aggressive malware attack, the anti-malware industry needs to be able to correctly classify malware in order to provide defense against them. Consequently, malware classification has been an active area of research, and a multitude of malware classification approaches have been proposed in the literature. This paper evaluates two methods of sequence classification based on Hidden Markov Model, namely the maximum likelihood and similarity-based methods, for classification of malware using a large and comprehensive dataset. System calls generated by known malware during execution are used as observation sequences to train the Hidden Markov Models. Malware samples are evaluated against the trained models to produce similarity vectors, which are used in the maximum likelihood and similarity-based classification schemes to predict the family for an unknown malware sample. Comparison of the two schemes shows that combining the powerful statistical pattern analysis capability of Hidden Markov Models and discriminative classifiers in the similarity based method results in a significantly better classification performance as compared to the maximum likelihood approach. Furthermore, evaluation of different classifiers in the similarity-based method demonstrates that Random Forest classifier performs better than other classifiers on malware similarity vectors.
机译:近年来,新的恶意软件威胁的数量已大大增加,在全球范围内造成数十亿美元的损失。为了应对这种激进的恶意软件攻击,反恶意软件行业需要能够正确分类恶意软件,以便对它们进行防御。因此,恶意软件分类一直是研究的活跃领域,并且文献中已经提出了多种恶意软件分类方法。本文评估了基于隐马尔可夫模型的两种序列分类方法,即最大似然法和基于相似度的方法,用于使用大型而全面的数据集对恶意软件进行分类。执行期间由已知恶意软件生成的系统调用用作观察序列,以训练隐马尔可夫模型。针对训练有素的模型对恶意软件样本进行评估,以产生相似性向量,将其用于最大似然和基于相似度的分类方案中,以预测未知恶意软件样本的族。两种方案的比较表明,在基于相似度的方法中,结合了隐马尔可夫模型和判别式分类器的强大统计模式分析功能,与最大似然法相比,分类性能显着提高。此外,在基于相似度的方法中对不同分类器的评估表明,随机森林分类器在恶意软件相似性向量上的性能优于其他分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号