...
首页> 外文期刊>Computers & Security >Semi-supervised recommendation attack detection based on Co-Forest
【24h】

Semi-supervised recommendation attack detection based on Co-Forest

机译:基于融合的半监督推荐攻击检测

获取原文
获取原文并翻译 | 示例

摘要

In recommendation attack, malicious users attempt to bias the recommendation results by injecting fake profiles into the rating database. To detect such attack, three types of methods, i.e., unsupervised, supervised and semi-supervised, have been proposed. Among these works, the advantage of semi-supervised methods is that they can use the unlabeled user profiles to improve the detection performance. However, the existing semi-supervised methods suffer from low precision. Aiming at this problem, in this paper, we propose a semi-supervised detection approach named SSADR-CoF based on the Co-Forest algorithm. Being different from the existing semi-supervised methods which only use a few of features to train a single classifier for the detection, the proposed approach uses a series of features to train an ensemble of classifiers to detect the recommendation attack. We first use the window dividing and rating behavior statistical methods to extract a series of user rating behavior mode features for training the detection model. Then, we use a small number of labeled user profiles to initialize an ensemble of classifiers, and use the ensemble of classifiers to assign labels to the unlabeled user profiles. Finally, we use the labeled and the newly labeled user profiles to iteratively update the classifiers for the detection. Experiments conducted on three benchmark datasets MovieLens 10M, MovieLens 25M, and Amazon show that the proposed approach can effectively improve the precision of the semi-supervised methods under the condition of maintaining high recall and AUC.
机译:在推荐攻击中,恶意用户试图通过将假型材注入评级数据库来偏见推荐结果。为了检测此类攻击,提出了三种类型的方法,即无监督,监督和半监督。在这些作品中,半监督方法的优势在于它们可以使用未标记的用户配置文件来提高检测性能。然而,现有的半监督方法遭受低精度。针对这个问题,在本文中,我们提出了一种基于共林算法的名为SSADR-COF的半监督检测方法。与现有的半监督方法不同,该方法只使用一些特征来训练单个分类器进行检测,所提出的方法使用一系列功能来训练分类器的集合来检测推荐攻击。我们首先使用窗口分割和评级行为统计方法来提取一系列用户评级行为模式功能,用于训练检测模型。然后,我们使用少量标记的用户配置文件来初始化分类器的集合,并使用分类器的集合来将标签分配给未标记的用户配置文件。最后,我们使用标记的和新标记的用户配置文件来迭代更新检测的分类器。在三个基准数据集MOVIELENS 10M,MOVIELENS 25M和亚马逊进行的实验表明,该拟议方法可以在维护高召回和AUC的条件下有效提高半监督方法的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号