首页> 外文会议>IEEE International Conference on Systems, Man, and Cybernetics >Interactive Machine Learning for Data Exfiltration Detection: Active Learning with Human Expertise
【24h】

Interactive Machine Learning for Data Exfiltration Detection: Active Learning with Human Expertise

机译:用于数据的交互式机器学习探测:与人类专业知识的主动学习

获取原文

摘要

Data exfiltration is a serious threat to organizations. Such exfiltrations cause breach events that can lead to millions of dollars of loss. Perimeter defense is not enough by itself since successful exploits from insiders can also be very damaging. Internal network user activities need to be monitored to detect malicious actions. Automatic machine learning methods can be applied for network anomaly detection, but they create a lot of false alarms. Domain experts can identify malicious users, but they are unable to process large volumes of data. Interactive machine learning (iML) deals with this tradeoff by creating an efficient collaboration between domain experts and machine learning algorithms. Previous research in iML has focused mainly on collaboration with non-experts. The design and requirements for expertise-driven iML have yet to be delineated for cybersecurity applications. In this research, we proposed an Active Learning (AL) model trained with outputs from a liberal (outputting many false alarms as well as possible hits) anomaly detection (AD) criterion to study expert-iML collaboration in anomaly detection. The results showed that: iML in this context can prune false alarms and minimize misses; the performance/compatibility tradeoff that typically occurs in conventional machine learning updates may be less salient in iML. We suggest that compatibility between experts and algorithms can be improved by presenting information about feature relevance during the training process.
机译:数据exfiltration是对组织的严重威胁。此类exfilteration导致违规事件,可以导致数百万美元的损失。由于业内人士的成功利用,外界防御本身就不够了,因此也可能会非常损害。需要监视内部网络用户活动以检测恶意操作。自动机器学习方法可用于网络异常检测,但它们创建了很多误报。域专家可以识别恶意用户,但它们无法处理大量数据。交互式机器学习(IML)通过在域专家和机器学习算法之间创建有效的合作来处理此权衡。以前在IML的研究主要集中在与非专家合作。专业知识驱动IML的设计和要求尚未划算网络安全应用程序。在这项研究中,我们提出了一种由自由主义(输出许多错误警报以及可能的命中)异常检测(AD)标准进行的输出培训的主动学习(AL)模型,以研究异常检测中的专家-IML协作。结果表明:IML在此上下文中可以修剪虚假警报并最大限度地减少未命中;通常发生在传统机器学习更新中的性能/兼容性折衷可能不太突出IML。我们建议通过在培训过程中呈现有关特征相关性的信息来提高专家和算法之间的兼容性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号