...
首页> 外文期刊>Journal in computer virology >Unknown malcode detection and the imbalance problem
【24h】

Unknown malcode detection and the imbalance problem

机译:未知的恶意代码检测和不平衡问题

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The recent growth in network usage has motivated the creation of new malicious code for various purposes. Today's signature-based antiviruses are very accurate for known malicious code, but can not detect new malicious code. Recently, classification algorithms were used successfully for the detection of unknown malicious code. But, these studies involved a test collection with a limited size and the same malicious: benign file ratio in both the training and test sets, a situation which does not reflect real-life conditions. We present a methodology for the detection of unknown malicious code, which examines concepts from text categorization, based on n-grams extraction from the binary code and feature selection. We performed an extensive evaluation, consisting of a test collection of more than 30,000 files, in which we investigated the class imbalance problem. In real-life scenarios, the malicious file content is expected to be low, about 10% of the total files. For practical purposes, it is unclear as to what the corresponding percentage in the training setrnshould be. Our results indicate that greater than 95% accuracy can be achieved through the use of a training set that has a malicious file content of less than 33.3%.
机译:网络使用率的最近增长促使出于各种目的而创建新的恶意代码。当今基于签名的防病毒软件对于已知的恶意代码非常准确,但无法检测到新的恶意代码。最近,分类算法已成功用于检测未知恶意代码。但是,这些研究涉及的测试集规模有限,并且在训练和测试集中都具有相同的恶意:良性文件比率,这种情况无法反映实际情况。我们提供了一种用于检测未知恶意代码的方法,该方法基于从二进制代码中提取的n-gram和特征选择来检查文本分类中的概念。我们进行了广泛的评估,包括超过30,000个文件的测试集合,在其中我们调查了类不平衡问题。在现实生活中,恶意文件的内容可能很少,大约占文件总数的10%。出于实际目的,尚不清楚训练集中的相应百分比应为多少。我们的结果表明,通过使用恶意文件内容少于33.3%的训练集,可以达到95%以上的准确性。

著录项

  • 来源
    《Journal in computer virology》 |2009年第4期|295-308|共14页
  • 作者单位

    Deutsche Telekom Laboratories, Department of Information Systems Engineering, Ben Gurion University, 84105 Be'er Sheva, Israel;

    Deutsche Telekom Laboratories, Department of Information Systems Engineering, Ben Gurion University, 84105 Be'er Sheva, Israel;

    Deutsche Telekom Laboratories, Department of Information Systems Engineering, Ben Gurion University, 84105 Be'er Sheva, Israel;

    Deutsche Telekom Laboratories, Department of Information Systems Engineering, Ben Gurion University, 84105 Be'er Sheva, Israel;

    School of Information Technology and Engineering, University of Ottawa, Ottawa, ON KIN 6N5, Canada;

    Deutsche Telekom Laboratories, Department of Information Systems Engineering, Ben Gurion University, 84105 Be'er Sheva, Israel;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号