首页> 外文会议>International Joint Conference on Neural Networks >A Feature Ensemble-based Approach to Malicious Domain Name Identification from Valid DNS Responses
【24h】

A Feature Ensemble-based Approach to Malicious Domain Name Identification from Valid DNS Responses

机译:基于特征集合的有效DNS响应识别恶意域名的方法

获取原文

摘要

Identifying malicious domain names in Internet activities has become an effective method to protect Internet users. Previous works have achieved great identification results, but they highly rely on historical Domain Name System (DNS) responses and external intelligence sources. Thus, they may fail to identify unknown domain name without any prior knowledge. In this paper, we propose Glacier, a feature ensemble-based approach to identifying malicious domain names from valid DNS responses. Glacier addresses the aforementioned problem by utilizing two types of features in domain name strings: the linguistical features and the statistical features. (1) Linguistical features are vector representations generated from the character sequences of domain names by a bidirectional long short-term memory (BiLSTM) neural network. It is worthy to notice that we modify the last BiLSTM layer to enhance the expressiveness of the linguistical features. (2) Statistical features are six manually designed statistics that represent the structural information of a domain name. Structural information can hardly be learnt by a BiLSTM neural network directly. Thus, combining statistical features with linguistical features can improve the effectiveness of malicious domain name identification. We evaluate the identification ability of Glacier on a real-world domain name data set. The best metrics of Glacier are an average accuracy of 90.86% and an average F1-score of 84.37%. Our experimental results show that Glacier can accurately identify resolvable malicious domain names without any DNS traffic data or prior knowledge about unknown domain names.
机译:在Internet活动中识别恶意域名已成为保护Internet用户的有效方法。先前的工作已经取得了很好的识别结果,但是它们高度依赖于历史域名系统(DNS)响应和外部情报源。因此,在没有任何先验知识的情况下,他们可能无法识别未知域名。在本文中,我们提出Glacier,这是一种基于功能集成的方法,可以从有效的DNS响应中识别恶意域名。 Glacier通过利用域名字符串中的两种类型的功能解决了上述问题:语言功能和统计功能。 (1)语言特征是通过双向长短期记忆(BiLSTM)神经网络从域名的字符序列生成的矢量表示。值得注意的是,我们修改了最后一个BiLSTM层以增强语言功能的表现力。 (2)统计功能是六个手动设计的统计信息,它们代表域名的结构信息。 BiLSTM神经网络很难直接学习结构信息。因此,将统计特征与语言特征相结合可以提高恶意域名识别的有效性。我们评估现实世界域名数据集上Glacier的识别能力。 Glacier的最佳指标是90.86%的平均准确度和84.37%的平均F1得分。我们的实验结果表明,Glacier可以准确地识别可解决的恶意域名,而无需任何DNS流量数据或有关未知域名的先验知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号