Interactive Machine Learning for Data Exfiltration Detection: Active Learning with Human Expertise

机译：用于数据的交互式机器学习探测：与人类专业知识的主动学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data exfiltration is a serious threat to organizations. Such exfiltrations cause breach events that can lead to millions of dollars of loss. Perimeter defense is not enough by itself since successful exploits from insiders can also be very damaging. Internal network user activities need to be monitored to detect malicious actions. Automatic machine learning methods can be applied for network anomaly detection, but they create a lot of false alarms. Domain experts can identify malicious users, but they are unable to process large volumes of data. Interactive machine learning (iML) deals with this tradeoff by creating an efficient collaboration between domain experts and machine learning algorithms. Previous research in iML has focused mainly on collaboration with non-experts. The design and requirements for expertise-driven iML have yet to be delineated for cybersecurity applications. In this research, we proposed an Active Learning (AL) model trained with outputs from a liberal (outputting many false alarms as well as possible hits) anomaly detection (AD) criterion to study expert-iML collaboration in anomaly detection. The results showed that: iML in this context can prune false alarms and minimize misses; the performance/compatibility tradeoff that typically occurs in conventional machine learning updates may be less salient in iML. We suggest that compatibility between experts and algorithms can be improved by presenting information about feature relevance during the training process.

机译：数据exfiltration是对组织的严重威胁。此类exfilteration导致违规事件，可以导致数百万美元的损失。由于业内人士的成功利用，外界防御本身就不够了，因此也可能会非常损害。需要监视内部网络用户活动以检测恶意操作。自动机器学习方法可用于网络异常检测，但它们创建了很多误报。域专家可以识别恶意用户，但它们无法处理大量数据。交互式机器学习（IML）通过在域专家和机器学习算法之间创建有效的合作来处理此权衡。以前在IML的研究主要集中在与非专家合作。专业知识驱动IML的设计和要求尚未划算网络安全应用程序。在这项研究中，我们提出了一种由自由主义（输出许多错误警报以及可能的命中）异常检测（AD）标准进行的输出培训的主动学习（AL）模型，以研究异常检测中的专家-IML协作。结果表明：IML在此上下文中可以修剪虚假警报并最大限度地减少未命中;通常发生在传统机器学习更新中的性能/兼容性折衷可能不太突出IML。我们建议通过在培训过程中呈现有关特征相关性的信息来提高专家和算法之间的兼容性。

著录项

来源
《IEEE International Conference on Systems, Man, and Cybernetics》|2020年|280-287|共8页
会议地点
作者
Mu-Huan Chung; Mark Chignell; Lu Wang; Alexandra Jovicic; Abhay Raman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Explainable AI; cybersecurity; interactive machine learning; active learning;

机译：解释为AI;网络安全;互动机学习;积极学习;

相似文献

外文文献
中文文献
专利

1. Interactive labelling of a multivariate dataset for supervised machine learning using linked visualisations, clustering, and active learning [J] . Mohammad Chegini, Jürgen Bernard, Philip Berger, Visual Informatics . 2019,第1期

机译：使用链接的可视化，聚类和主动学习对用于监督式机器学习的多元数据集进行交互式标记
2. Development of a global infectious disease activity database using natural language processing, machine learning, and human expertise [J] . Joshua Feldman, Andrea Thomas-Bachli, Jack Forsyth, Journal of the American Medical Informatics Association : . 2019,第11期

机译：使用自然语言处理，机器学习和人类专业知识开发全球传染病活动数据库
3. Improving high-impact bug report prediction with combination of interactive machine learning and active learning [J] . Wu Xiaoxue, Zheng Wei, Chen Xiang, Information and software technology . 2021,第May期

机译：用交互式机器学习和主动学习的组合改善高影响力报告预测
4. Joint machine learning and human learning design with sequential active learning and outlier detection for linear regression problems [C] . Xiaohua Li, Jian Zheng Annual Conference on Information Sciences and Systems . 2016

机译：联合机器学习和人工学习设计，具有顺序主动学习和离群值检测的线性回归问题
5. Active learning with support vector machines for imbalanced datasets and a method for stopping active learning based on stabilizing predictions. [D] . Bloodgood, Michael. 2009

机译：支持向量机用于不平衡数据集的主动学习，以及一种基于稳定预测的主动学习停止方法。
6. Development of a global infectious disease activity database using natural language processing machine learning and human expertise [O] . Joshua Feldman, Andrea Thomas-Bachli, Jack Forsyth, 2019

机译：使用自然语言处理机器学习和人类专业知识的全球传染病活动数据库的开发
7. Active Learning for Interactive Neural Machine Translation of Data Streams [O] . Álvaro Peris, Francisco Casacuberta 2018

机译：积极学习数据流的互动神经机翻译

Interactive Machine Learning for Data Exfiltration Detection: Active Learning with Human Expertise

摘要

著录项

相似文献

相关主题

期刊订阅