首页> 外文期刊>Computers & Security >Detection method of domain names generated by DGAs based on semantic representation and deep neural network
【24h】

Detection method of domain names generated by DGAs based on semantic representation and deep neural network

机译:基于语义表示和深度神经网络的DGA生成的域名检测方法

获取原文
获取原文并翻译 | 示例

摘要

Botnets have become one of the main threats to cyberspace security currently. More and more bots utilize the domain generation algorithm (DGA) to generate malicious domain names to communicate with Command & Control (C&C) servers. A well-designed DGA can bypass the traditional detection methods such as sinkhole and rule filtering, which raises new challenges to cyberspace security. In the field of machine learning, the n-gram is a semantic model that characterizes the relationship among neighboring morphemes while deep convolutional neural networks have a robust capability in processing information with translation-invariant properties. In this paper, we combined n-gram and a deep convolutional neural network and then proposed a novel n-gram combined character based domain classification (n-CBDC) model. The n-CBDC model runs in an end-to-end way that doesn't require hand-extracted features or domain name system (DNS) contextual information; it only needs to input the domain name itself and can automatically estimate the probability that the domain name was generated by DGAs. Experiments on real-world data show that the proposed method can effectively detect domain names generated by DGAs with 98.69% average detection rate and 0.9829 average F-measure, and significantly outperformed the state-of-art methods in detecting pronounceable and wordlist-based DGA domain names with more than 93.89% detection rate. Therefore, the proposed detection method is robust and has a wide range of adaptability in detecting various types of domain names generated by DGAs. (C) 2019 Elsevier Ltd. All rights reserved.
机译:僵尸网络已成为当前对网络空间安全的主要威胁之一。越来越多的漫游器利用域生成算法(DGA)生成恶意域名,以与Command&Control(C&C)服务器进行通信。精心设计的DGA可以绕过传统的检测方法,例如污水坑和规则过滤,这对网络空间安全提出了新的挑战。在机器学习领域,n-gram是一个语义模型,用于描述相邻语素之间的关系,而深度卷积神经网络在处理具有平移不变属性的信息时具有强大的能力。在本文中,我们将n-gram和深层卷积神经网络相结合,然后提出了一种新的基于n-gram组合字符的域分类(n-CBDC)模型。 n-CBDC模型以端到端的方式运行,不需要手工提取的功能或域名系统(DNS)上下文信息。它只需要输入域名本身,就可以自动估计DGA生成域名的可能性。实际数据实验表明,该方法能够有效检测DGA生成的域名,平均检测率达98.69%,平均F测度为0.9829,在检测可发音的基于词表的DGA方面远胜过现有技术域名,检出率超过93.89%。因此,提出的检测方法是鲁棒的,并且在检测由DGA生成的各种类型的域名方面具有广泛的适应性。 (C)2019 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号