【24h】

Learning Invariant Representation for Malicious Network Traffic Detection

机译:学习恶意网络流量检测的不变表示

获取原文

摘要

Statistical learning theory relies on an assumption that the joint distributions of observations and labels are the same in training and testing data. However, this assumption is violated in many real world problems, such as training a detector of malicious network traffic that can change over time as a result of attacker's detection evasion efforts. We propose to address this problem by creating an optimized representation, which significantly increases the robustness of detectors or classifiers trained under this distributional shift. The representation is created from bags of samples (e.g. network traffic logs) and is designed to be invariant under shifting and scaling of the feature values extracted from the logs and under permutation and size changes of the bags. The invariance is achieved by combining feature histograms with feature self-similarity matrices computed for each bag and significantly reduces the difference between the training and testing data. The parameters of the representation, such as histogram bin boundaries, are learned jointly with the classifier. We show that the representation is effective for training a detector of malicious traffic, achieving 90% precision and 67% recall on samples of previously unseen malware variants.
机译:统计学习理论依赖于假设观察和标签的联合分布在训练和测试数据中是相同的。然而,这种假设在许多现实世界问题中违反,例如培训可能因攻击者的检测逃避努力而随时间变化的恶意网络流量的探测器。我们建议通过创建优化的表示来解决这个问题,这显着增加了在该分布班次训练的探测器或分类器的稳健性。该表示是从样本袋(例如网络流量日志)创建的,并且旨在在从日志中提取的特征值的转换和缩放下不变,并且在袋子的置换和尺寸变化下。通过组合具有为每个袋子计算的特征自相似矩阵的特征直方图来实现不变性,并且显着降低训练和测试数据之间的差异。与分类器共同学习的表示的参数,例如直方图BIN边界。我们表明,该代表有效培训恶意流量的检测器,实现90%的精度和67%的预先考虑以前看不见的恶意软件变体的样本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号