首页> 外文期刊>Journal of Computer Virology and Hacking Techniques >HDM-Analyser: a hybrid analysis approach based on data mining techniques for malware detection
【24h】

HDM-Analyser: a hybrid analysis approach based on data mining techniques for malware detection

机译:HDM-Analyser:基于数据挖掘技术的混合分析方法,用于恶意软件检测

获取原文
获取原文并翻译 | 示例

摘要

Today’s security threats like malware are more sophisticated and targeted than ever, and they are growing at an unprecedented rate. To deal with them, various approaches are introduced. One of them is Signature-based detection, which is an effective method and widely used to detect malware; however, there is a substantial problem in detecting new instances. In other words, it is solely useful for the second malware attack. Due to the rapid proliferation of malware and the desperate need for human effort to extract some kinds of signature, this approach is a tedious solution; thus, an intelligent malware detection system is required to deal with new malware threats. Most of intelligent detection systems utilise some data mining techniques in order to distinguish malware from sane programs. One of the pivotal phases of these systems is extracting features from malware samples and benign ones in order to make at least a learning model. This phase is called “Malware Analysis” which plays a significant role in these systems. Since API call sequence is an effective feature for realising unknown malware, this paper is focused on extracting this feature from executable files. There are two major kinds of approach to analyse an executable file. The first type of analysis is “Static Analysis” which analyses a program in source code level. The second one is “Dynamic Analysis” that extracts features by observing program’s activities such as system requests during its execution time. Static analysis has to traverse the program’s execution path in order to find called APIs. Because it does not have sufficient information about decision making points in the given executable file, it is not able to extract the real sequence of called APIs. Although dynamic analysis does not have this drawback, it suffers from execution overhead. Thus, the feature extraction phase takes noticeable time. In this paper, a novel hybrid approach, HDM-Analyser, is presented which takes advantages of dynamic and static analysis methods for rising speed while preserving the accuracy in a reasonable level. HDM-Analyser is able to predict the majority of decision making points by utilising the statistical information which is gathered by dynamic analysis; therefore, there is no execution overhead. The main contribution of this paper is taking accuracy advantage of the dynamic analysis and incorporating it into static analysis in order to augment the accuracy of static analysis. In fact, the execution overhead has been tolerated in learning phase; thus, it does not impose on feature extraction phase which is performed in scanning operation. The experimental results demonstrate that HDM-Analyser attains better overall accuracy and time complexity than static and dynamic analysis methods.
机译:如今,诸如恶意软件之类的安全威胁比以往任何时候都更加复杂,更有针对性,并且以前所未有的速度增长。为了处理它们,引入了各种方法。其中之一是基于签名的检测,这是一种有效的方法,广泛用于检测恶意软件。但是,在检测新实例方面存在很大的问题。换句话说,它仅对第二次恶意软件攻击有用。由于恶意软件的迅速扩散以及迫切需要人工来提取某些特征的需求,这种方法是一种乏味的解决方案。因此,需要一种智能的恶意软件检测系统来应对新的恶意软件威胁。大多数智能检测系统都利用某些数据挖掘技术,以区分恶意软件与健全程序。这些系统的关键阶段之一是从恶意软件样本和良性样本中提取特征,以便至少建立学习模型。此阶段称为“恶意软件分析”,在这些系统中扮演重要角色。由于API调用序列是实现未知恶意软件的有效功能,因此本文着重于从可执行文件中提取此功能。分析可执行文件有两种主要方法。第一类分析是“静态分析”,它以源代码级别分析程序。第二个是“动态分析”,它通过在程序执行期间观察程序的活动(例如系统请求)来提取特征。静态分析必须遍历程序的执行路径,才能找到所谓的API。由于在给定的可执行文件中没有足够的决策点信息,因此它无法提取被调用API的真实序列。尽管动态分析没有此缺点,但它具有执行开销。因此,特征提取阶段需要花费大量时间。在本文中,提出了一种新颖的混合方法HDM-Analyser,该方法利用动态和静态分析方法来提高速度,同时将精度保持在合理的水平。 HDM-Analyser能够利用动态分析收集的统计信息来预测大多数决策点;因此,没有执行开销。本文的主要贡献是利用动态分析的准确性优势,并将其合并到静态分析中,以提高静态分析的准确性。实际上,在学习阶段就可以容忍执行开销。因此,它不强加在扫描操作中执行的特征提取阶段。实验结果表明,与静态和动态分析方法相比,HDM-Analyser具有更好的总体准确性和时间复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号