首页> 外国专利> Detecting poisoning attacks on neural networks by activation clustering

Detecting poisoning attacks on neural networks by activation clustering

机译:通过激活聚类检测神经网络的中毒攻击

摘要

One embodiment provides a method comprising receiving a training set comprising a plurality of data points, where a neural network is trained as a classifier based on the training set. The method further comprises, for each data point of the training set, classifying the data point with one of a plurality of classification labels using the trained neural network, and recording neuronal activations of a portion of the trained neural network in response to the data point. The method further comprises, for each classification label that a portion of the training set has been classified with, clustering a portion of all recorded neuronal activations that are in response to the portion of the training set, and detecting one or more poisonous data points in the portion of the training set based on the clustering.
机译:一个实施例提供了一种方法,包括接收包括多个数据点的训练集,其中神经网络被视为基于训练集的分类器。 该方法还包括,对于训练集的每个数据点,使用训练的神经网络将数据点与多个分类标签之一进行分类,并响应于数据点记录训练的神经网络的一部分的神经元激活 。 该方法还包括,对于每个分类标签,所以训练集的一部分已被分类,群集响应于训练集的部分的所有记录的神经元激活的一部分,并检测一个或多个有毒数据点 基于群集的训练集的部分。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号