【24h】

Malicious Code Detection Using Active Learning

机译:使用主动学习的恶意代码检测

获取原文

摘要

The recent growth in network usage has motivated the creation of new malicious code for various purposes, including economic and other malicious purposes. Currently, dozens of new malicious codes are created every day and this number is expected to increase in the coming years. Today's signature-based anti-viruses and heuristic-based methods are accurate, but cannot detect new malicious code. Recently, classification algorithms were used successfully for the detection of malicious code. We present a complete methodology for the detection of unknown malicious code, inspired by text categorization concepts. However, this approach can be exploited further to achieve a more accurate and efficient acquisition method of unknown malicious files. We use an Active-Learning framework that enables the selection of the unknown files for fast acquisition. We performed an extensive evaluation of a test collection consisting of more than 30,000 files. We present a rigorous evaluation setup, consisting of real-life scenarios, in which the malicious file content is expected to be low, at about 10% of the files in the stream. We define specific evaluation measures based on the known precision and recall measures, which show the accuracy of the acquisition process and the improvement in the classifier resulting from the efficient acquisition process.
机译:最近的网络使用增长是为了为各种目的创造新的恶意代码,包括经济和其他恶意目的。目前,每天都会创建数十个新的恶意代码,预计未来几年的数字将增加。今天的基于签名的反病毒和基于启发式的方法是准确的,但无法检测到新的恶意代码。最近,分类算法成功用于检测恶意代码。我们提出了一种通过文本分类概念的启发的未知恶意代码的完整方法。然而,这种方法可以进一步利用以实现更准确和有效的未知恶意文件的采集方法。我们使用一个主动学习框架,使能选择未知文件以进行快速采集。我们对由超过30,000个文件组成的测试收集进行了广泛的评估。我们介绍了一个严谨的评估设置,由现实生活方案组成,其中恶意文件内容预计将低,在流中的文件的约10%。我们根据已知的精度和召回措施来定义特定的评估措施,该测量措施显示采集过程的准确性以及由有效采集过程产生的分类器的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号