首页> 外文会议>International Joint Conference on Neural Networks >Collecting Indicators of Compromise from Unstructured Text of Cybersecurity Articles using Neural-Based Sequence Labelling

【24h】

Collecting Indicators of Compromise from Unstructured Text of Cybersecurity Articles using Neural-Based Sequence Labelling

机译：使用基于神经的序列标签从网络安全文章的非结构化文本中收集危害指标

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Indicators of Compromise (IOCs) are artifacts observed on a network or in an operating system that can be utilized to indicate a computer intrusion and detect cyber-attacks in an early stage. Thus, they exert an important role in the field of cybersecurity. However, state-of-the-art IOCs detection systems rely heavily on hand-crafted features with expert knowledge of cybersecurity, and require large-scale manually annotated corpora to train an IOC classifier. In this paper, we propose using an end-to-end neural-based sequence labelling model to identify IOCs automatically from cybersecurity articles without expert knowledge of cybersecurity. By using a multi-head self-attention module and contextual features, we find that the proposed model is capable of gathering contextual information from texts of cybersecurity articles and performs better in the task of IOC identification. Experiments show that the proposed model outperforms other sequence labelling models, achieving the average F1-score of 89.0% on English cybersecurity article test set, and approximately the average F1-score of 81.8% on Chinese test set.

机译：危害指标（IOC）是在网络或操作系统中观察到的伪像，可用于在早期阶段指示计算机入侵并检测网络攻击。因此，它们在网络安全领域中发挥着重要作用。但是，最新的IOC检测系统严重依赖具有网络安全专家知识的手工制作功能，并且需要大规模的手动注释语料库来训练IOC分类器。在本文中，我们建议使用端到端基于神经的序列标记模型从网络安全文章中自动识别IOC，而无需具备网络安全方面的专业知识。通过使用多头自我关注模块和上下文特征，我们发现所提出的模型能够从网络安全文章的文本中收集上下文信息，并且在IOC识别任务中表现更好。实验表明，该模型优于其他序列标记模型，在英语网络安全文章测试集上的平均F1得分达到89.0％，在中文测试集上的平均F1得分约为81.8％。

著录项

来源
《International Joint Conference on Neural Networks》|2019年|1-8|共8页
会议地点
作者
Zi Long; Lianzhi Tan; Shengping Zhou; Chaoyang He; Xin Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. PubDNA Finder: a web database linking full-text articles to sequences of nucleic acids [J] . Garcia-Remesal, Miguel, Cuevas, Alejandro, Perez-Rey, David, Bioinformatics . 2010,第21期

机译：PubDNA Finder：一个将全文文章链接到核酸序列的网络数据库
2. PubDNA Finder: a web database linking full-text articles to sequences of nucleic acids [J] . Víctor Maojo Bioinformatics . 2010,第21期

机译：PubDNA Finder：将全文文章链接到核酸序列的网络数据库
3. SANAD: Single-label Arabic News Articles Dataset for automatic text categorization [J] . Omar Einea, Ashraf Elnagar, Ridhwan Al Debsi Data in Brief . 2019,第1期

机译：SANAD：用于自动文本分类的单标签阿拉伯新闻文章数据集
4. Collecting Indicators of Compromise from Unstructured Text of Cybersecurity Articles using Neural-Based Sequence Labelling [C] . Zi Long, Lianzhi Tan, Shengping Zhou, International Joint Conference on Neural Networks . 2019

机译：使用基于神经基序列标记收集来自网络安全文章的非结构化文本的折衷指标
5. SANAD: Single-label Arabic News Articles Dataset for automatic text categorization [O] . Omar Einea, Ashraf Elnagar, Ridhwan Al Debsi 2019

机译：SANAD：用于自动文本分类的单标签阿拉伯新闻文章数据集
6. Collecting Indicators of Compromise from Unstructured Text of Cybersecurity Articles using Neural-Based Sequence Labelling [O] . Zi Long, Lianzhi Tan, Shengping Zhou, 2019

机译：使用基于神经基序列标记收集来自网络安全文章的非结构化文本的折衷指标
7. Security Classification Using Automated Learning (SCALE): Optimizing Statistical Natural Language Processing Techniques to Assign Security Labels to Unstructured Text [R] . Brown, J. D., Charlebois, D. 2010

机译：使用自动学习的安全性分类（sCaLE）：优化统计自然语言处理技术，将安全标签分配给非结构化文本

Collecting Indicators of Compromise from Unstructured Text of Cybersecurity Articles using Neural-Based Sequence Labelling

摘要

著录项

相似文献

相关主题

期刊订阅