首页> 外文会议>International conference on knowledge science, engineering and management >FastLogSim: A Quick Log Pattern Parser Scheme Based on Text Similarity
【24h】

FastLogSim: A Quick Log Pattern Parser Scheme Based on Text Similarity

机译:FastLogSim:基于文本相似度的快速日志模式解析器方案

获取原文

摘要

Logs completely record all system events which can be used to reveal network security issue and analyse user behaviour. Since logs are stored in the form of unstructured data and there is no universal log retention standard, they can hardly be analysed directly. Most of the existing log parsers focus on the parsing accuracy and ignore the time performance while parsing the large-amount of logs. Therefore, this paper proposes FastLogSim, a fast log parsing scheme based on text similarity. To simplify the parsing workload, we perform deduplication on the logs after removing the key variable values to obtain the template. Subsequently, the similarity is computed to merge the similar templates and then obtain the log pattern. FastLogSim not only reduces the number of templates that need to be parsed from tens of millions to dozens, but also greatly improves the speed of pattern extraction. We evaluated FastLogSim on four real public log datasets. The experimental results show that when the FastLogSim process tens thousands of logs, it performs almost the same time as the mainstream log parser. However, when the number of logs exceeds ten million, FastLogSim is three times faster than previous state-of-the-art parsers. Hence, FastLogSim is appropriative for large-scale log pattern mining.
机译:日志完全记录了所有可用于揭示网络安全问题和分析用户行为的系统事件。由于日志是以非结构化数据的形式存储的,并且没有通用的日志保留标准,因此很难直接对其进行分析。现有的大多数日志解析器都专注于解析准确性,而在解析大量日志时会忽略时间性能。因此,本文提出了FastLogSim,一种基于文本相似度的快速日志解析方案。为了简化解析工作量,我们在删除关键变量值以获取模板后对日志执行重复数据删除。随后,计算相似度以合并相似模板,然后获得对数模式。 FastLogSim不仅将需要解析的模板的数量从数千万个减少到数十个,而且还大大提高了模式提取的速度。我们在四个真实的公共日志数据集上评估了FastLogSim。实验结果表明,当FastLogSim处理成千上万条日志时,其执行时间与主流日志解析器几乎相同。但是,当日志数量超过一千万时,FastLogSim会比以前的最新解析器快三倍。因此,FastLogSim适用于大规模日志模式挖掘。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号