首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >LogEvent2vec: LogEvent-to-Vector Based Anomaly Detection for Large-Scale Logs in Internet of Things

【2h】

LogEvent2vec: LogEvent-to-Vector Based Anomaly Detection for Large-Scale Logs in Internet of Things

机译：LogEvent2vec：物联网中大型日志的基于LogEvent到矢量的异常检测

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Log anomaly detection is an efficient method to manage modern large-scale Internet of Things (IoT) systems. More and more works start to apply natural language processing (NLP) methods, and in particular word2vec, in the log feature extraction. Word2vec can extract the relevance between words and vectorize the words. However, the computing cost of training word2vec is high. Anomalies in logs are dependent on not only an individual log message but also on the log message sequence. Therefore, the vector of words from word2vec can not be used directly, which needs to be transformed into the vector of log events and further transformed into the vector of log sequences. To reduce computational cost and avoid multiple transformations, in this paper, we propose an offline feature extraction model, named LogEvent2vec, which takes the log event as input of word2vec to extract the relevance between log events and vectorize log events directly. LogEvent2vec can work with any coordinate transformation methods and anomaly detection models. After getting the log event vector, we transform log event vector to log sequence vector by bary or tf-idf and three kinds of supervised models (Random Forests, Naive Bayes, and Neural Networks) are trained to detect the anomalies. We have conducted extensive experiments on a real public log dataset from BlueGene/L (BGL). The experimental results demonstrate that LogEvent2vec can significantly reduce computational time by 30 times and improve accuracy, comparing with word2vec. LogEvent2vec with bary and Random Forest can achieve the best F1-score and LogEvent2vec with tf-idf and Naive Bayes needs the least computational time.

机译：日志异常检测是管理现代大规模物联网（IoT）系统的有效方法。越来越多的作品开始在日志特征提取中应用自然语言处理（NLP）方法，尤其是word2vec。 Word2vec可以提取单词之间的相关性并将这些单词向量化。但是，训练word2vec的计算成本很高。日志异常不仅取决于单个日志消息，而且还取决于日志消息的顺序。因此，无法直接使用word2vec中的单词向量，需要将其转换为对数事件向量，然后进一步转换为对数序列向量。为了降低计算成本并避免多次转换，在本文中，我们提出了一个离线特征提取模型LogEvent2vec，该模型将日志事件作为word2vec的输入，以提取日志事件之间的相关性并直接矢量化日志事件。 LogEvent2vec可以使用任何坐标转换方法和异常检测模型。得到对数事件向量后，我们通过bary或tf-idf将对数事件向量转换为对数序列向量，并训练了三种监督模型（随机森林，朴素贝叶斯和神经网络）来检测异常。我们已经对BlueGene / L（BGL）的真实公共日志数据集进行了广泛的实验。实验结果表明，与word2vec相比，LogEvent2vec可以显着减少30倍的计算时间并提高准确性。具有bary和Random Forest的LogEvent2vec可以获得最佳的F1得分，具有tf-idf和Naive Bayes的LogEvent2vec所需的计算时间最少。

著录项

期刊名称 Sensors (Basel Switzerland)
作者
Jin Wang; Yangning Tang; Shiming He; Changqing Zhao; Pradip Kumar Sharma; Osama Alfarraj; Amr Tolba;
展开▼
作者单位

展开▼
年(卷),期 2020(20),9
年度 2020
页码 -1
总页数 19
原文格式 PDF
正文语种
中图分类
关键词
log anomaly detection; word2vec; log event; log template; device management; IoT;

机译：日志异常检测;word2vec;日志事件;日志模板;设备管理;物联网;

相似文献

外文文献
中文文献
专利

1. Collaborative anomaly-based detection of large-scale internet attacks [J] . Thomas Gamer Computer networks . 2012,第1期

机译：基于协作异常的大规模Internet攻击检测
2. DILAF: A framework for distributed analysis of large-scale system logs for anomaly detection [J] . Astekin Merve, Zengin Harun, Sozer Hasan Software . 2019,第2期

机译：DILAF：用于大规模系统日志的分布式分析以进行异常检测的框架
3. Execution anomaly detection in large-scale systems through console log analysis [J] . Bao Liang, Li Qian, Lu Peiyao, The Journal of Systems and Software . 2018,第sepa期

机译：通过控制台日志分析在大型系统中执行异常检测
4. Anomaly Detection Mechanism Based on Hierarchical Weights through Large-Scale Log Data [C] . Lanlan Xi, Yang Xin, Shoushan Luo, International Conference on Computer Communication and Artificial Intelligence . 2021

机译：基于大规模日志数据的分层权重的异常检测机制
5. Predicting Location and Time of Anomalies in Large-Scale Computing Systems via Log Mining [D] . Das, Anwesha. 2019

机译：通过日志挖掘预测大型计算系统中异常的位置和时间
6. LLAD: Life-Log Anomaly Detection Based on Recurrent Neural Network LSTM [O] . Ermal Elbasani, Jeong-Dong Kim 2021

机译：基于经常性神经网络LSTM的Llad：寿命对数异常检测
7. Business process analysis based on anomaly detection in event logs: a study on an incident management case [O] . Esther Maria Rojas Krugger, Ana Rocío Cárdenas Maita, Juliana Cristina Barbosa Alves, 2021

机译：基于异常检测的事件日志的业务流程分析：事件管理案例研究

LogEvent2vec: LogEvent-to-Vector Based Anomaly Detection for Large-Scale Logs in Internet of Things

摘要

著录项

相似文献

相关主题

期刊订阅