Data preprocessing for distance-based unsupervised Intrusion Detection

机译：基于距离的无监督入侵检测数据预处理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Since Intrusion Detection Systems (IDSs) operate in real-time, they should be light-weighted to detect intrusions as fast as possible. Distance-based Outlier Detection (DBOD) is one of the most widely-used techniques for detecting outliers due to its simplicity and efficiency. Additionally, DBOD is an unsupervised approach which overcomes the problem of the lack of training datasets with known intrusions. However, since IDSs usually have high-dimensional datasets, using DBOD becomes subject to the curse of the dimensionality problem. Furthermore, intrusion datasets should be normalized before calculating pair-wise distance between observations. The purpose of this research is conduct a comparative study among different normalization methods in conjunction with a well-known feature extraction technique; Principle Component Analysis (PCA). Therefore, the efficiency of these methods as data preprocessing techniques can be investigated when applying DBOD to detect intrusions. Experiments were performed using two kinds of distance metrics; Euclidean distance and Mahalanobis distance. We further examined the PCA using 7 threshold values to indicate the number of Principle components to consider according to their total contribution in the variability of features. These approaches have been evaluated using the KDD Cup 1999 intrusion detection (KDD-Cup) dataset. The main purpose of this study is to find the best attribute normalization method along with the correct threshold value for PCA so that a fast unsupervised IDS can discover intrusions effectively. The results recommended using the Log normalization method combined the Euclidean distance while performing PCA.

机译：由于入侵检测系统（IDS）实时运行，因此它们应该重量加权以尽可能快地检测入侵。基于距离的异常检测（DBOD）是由于其简单性和效率而检测异常值最广泛使用的技术之一。此外，DBOD是一种无监督的方法，克服了具有已知入侵的训练数据集的问题。然而，由于IDS通常具有高维数据集，因此使用DBOD将受到维度问题的诅咒。此外，在计算观察之间的对距离之前，应归一化入侵数据集。该研究的目的是与众所周知的特征提取技术结合不同归一化方法的比较研究;原理分析（PCA）。因此，当应用DBOD以检测入侵时，可以研究这些方法作为数据预处理技术的效率。使用两种距离度量进行实验;欧几里德距离和马哈拉诺比斯距离。我们进一步使用7个阈值检查了PCA，以指示根据其在特征变异性的总贡献中考虑的原则组件数量。已经使用KDD杯1999年入侵检测（KDD-CUP）数据集进行了评估这些方法。本研究的主要目的是找到最佳的属性归一化方法以及PCA的正确阈值，从而快速无监督的ID可以有效地发现入侵。使用日志归一化方法建议的结果将欧几里德距离组合在执行PCA时。

著录项

来源
《Annual International Conference on Privacy, Security and Trust》|2011年||共8页
会议地点
作者
Said Dina; Stirling Lisa; Federolf Peter; Barker Ken;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393.08-53;
关键词
入库时间 2022-08-21 10:19:56

相似文献

外文文献
中文文献
专利

1. Statistical analysis of CIDDS-001 dataset for Network Intrusion Detection Systems using Distance-based Machine Learning [J] . Abhishek Verma, Virender Ranga Procedia Computer Science . 2018,第5期

机译：基于距离的机器学习网络入侵检测系统CIDDS-001数据集的统计分析
2. Intrusion detection taxonomy and data preprocessing mechanisms [J] . Al-Utaibi Khaled A., El-Alfy El-Sayed M. Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2018,第3期

机译：入侵检测分类和数据预处理机制
3. An Ensemble Data Preprocessing Approach for Intrusion Detection System Using variant Firefly and Bk-NN Techniques [J] . D. Shona, M. Senthilkumar International Journal of Applied Engineering Research . 2016,第6aPta5期

机译：一种采用变种萤火虫和Bk-NN技术的入侵检测系统集成数据预处理方法
4. Data preprocessing for distance-based unsupervised Intrusion Detection [C] . Said Dina, Stirling Lisa, Federolf Peter, 2011 Ninth Annual International Conference on Privacy, Security and Trust . 2011

机译：基于距离的无监督入侵检测的数据预处理
5. Evaluation of Unsupervised Learning Techniques for Intrusion Detection in Mobile Ad Hoc Networks. [D] . Dang, Binh Hy. 2014

机译：移动Ad Hoc网络中用于入侵检测的无监督学习技术的评估。
6. An IoT-Focused Intrusion Detection System Approach Based on Preprocessing Characterization for Cybersecurity Datasets [O] . Xavier Larriva-Novo, Víctor A. Villagrá, Mario Vega-Barbas, 2021

机译：基于网络安全数据集的预处理表征的IOT聚焦的入侵检测系统方法
7. Data Preprocessing for Intrusion Detection System using Swarm Intelligence Techniques [O] . S. Revathi, A. Malathi Ph. D 2014

机译：基于群体智能技术的入侵检测系统数据预处理

Data preprocessing for distance-based unsupervised Intrusion Detection

摘要

著录项

相似文献

相关主题

期刊订阅