Cybersecurity Automated Information Extraction Techniques: Drawbacks of Current Methods, and Enhanced Extractors

机译：网络安全自动信息提取技术：当前方法的弊端和增强的提取器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address a crucial element of applied information extraction-accurate identification of basic security entities in text--by evaluating previous methods and presenting new labelers. Our survey reveals that the previous efforts have not been tested on documents similar to the targeted sources (news articles, blogs, tweets, etc.) and that no sufficiently large publicly available annotated corpus of these documents exists. By assembling a representative test corpus, we perform a quantitative evaluation of previous methods in a realistic setting, revealing an overall lack of recall, and giving insight to the models' beneficial and inhibiting elements. In particular, our results show that many previous efforts overfit to the non-representative test corpora in this domain. Informed by this evaluation, we present three novel cyber entity extractors, which seek to leverage the available labeled data but remain worthwhile on the more diverse documents encountered in the wild. Each new model increases the state of the art in recall, with maximal or near maximal F1 score. Our results establish that the state of the art in cyber entity tagging is characterized by F1 = 0.61.

机译：我们通过评估以前的方法并展示新的标签来解决应用信息提取的关键要素-准确识别文本中的基本安全实体。我们的调查表明，以前的努力尚未在类似于目标来源（新闻文章，博客，推文等）的文档上进行过测试，并且这些文档没有足够大的可公开获得注释的语料库。通过组建有代表性的测试语料库，我们在现实的环境中对以前的方法进行了定量评估，揭示了整体召回不足的情况，并洞悉了模型的有益和抑制因素。特别是，我们的结果表明，先前的许多努力都过度适合了该领域的非代表性测试语料库。通过此评估，我们介绍了三种新颖的网络实体提取器，它们试图利用可用的标记数据，但仍然值得在野外遇到的各种文档中使用。每个新模型都以最大或接近最大的F1得分提高了召回水平。我们的结果表明，网络实体标记的最新技术特征为F1 = 0.61。

著录项

来源
《IEEE International Conference on Machine Learning and Applications》|2017年|437-442|共6页
会议地点
作者
Robert A. Bridges; Kelly M.T. Huffer; Corinne L. Jones; Michael D. Iannacone; John R. Goodall;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Security; Feature extraction; Bridges; Training; Databases; Blogs; Micromechanical devices;

机译：安全性;特征提取;桥梁;培训;数据库;博客;微机械设备;

相似文献

外文文献
中文文献
专利

1. Current status and future trends on automated multidimensional separation techniques employing sorbent-based extraction columns [J] . Soares Maciel Edvaldo Vasconcelos, de Toffoli Ana Lucia, Lancas Fernando Mauro Journal of separation science. . 2019,第1期

机译：基于吸附剂的提取柱的自动多维分离技术的现状与未来趋势
2. DEVELOPMENT OF EXTRACTION TECHNIQUES AND STANDARDIZATION METHODS FOR A COMMON LADY'S MANTLE (ALCHEMILLA VULGARIS) EXTRACT [J] . I. M. Smolyakova, V. Yu. Andreeva, G. I. Kalinkina, Pharmaceutical Chemistry Journal . 2012,第11期

机译：普通MAN提取物的提取技术和标准化方法的发展
3. DEVELOPMENT OF EXTRACTION TECHNIQUES AND STANDARDIZATION METHODS FOR A COMMON LADY'S MANTLE (ALCHEMILLA VULGARIS) EXTRACT [J] . Pharmaceutical Chemistry Journal . 2011,第11期

机译：开发共同女士披风的提取技术和标准化方法（Alchemilla寻常）提取物
4. Cybersecurity Automated Information Extraction Techniques: Drawbacks of Current Methods, and Enhanced Extractors [C] . Robert A. Bridges, Kelly M.T. Huffer, Corinne L. Jones, IEEE International Conference on Machine Learning and Applications . 2017

机译：网络安全自动化信息提取技术：目前方法的缺点，增强型提取器
5. Pattern recognition and feature extraction using lidar-derived elevation models in GIS: A comparison between visualization techniques and automated methods for identifying prehistoric ditch-fortified sites in North Dakota [D] . Radermacher, Matthew Jeffery. 2016

机译：使用GIS中基于激光雷达的高程模型进行模式识别和特征提取：可视化技术与识别北达科他州史前沟壑加固地点的自动化方法之间的比较
6. The Lead Extractors Toolbox: A Review Of Current Endovascular Pacemaker And ICD Lead Extraction Techniques [O] . FA Bracke 2003

机译：铅提取器的工具箱：当前血管内起搏器和ICD铅提取技术的回顾
7. Comparação entre métodos de estocagem de DNA extraído de amostras de sangue, sêmen e pêlos e entre técnicas de extração Comparison between storage methods of DNA extracted from blood, semen and hair and between the techniques of extraction [O] . E.G.A. Coelho, D.A.A. Oliveira, C.S Teixeira, 2004

机译：从血液，精液和头发样本中提取的DNa存储方法与提取技术之间的比较从血液，精液和头发中提取的DNa的存储方法之间的比较

相关主题

Cybersecurity Automated Information Extraction Techniques: Drawbacks of Current Methods, and Enhanced Extractors

摘要

著录项

相似文献

相关主题

期刊订阅