首页> 外文期刊>International journal of machine learning and cybernetics >Unsupervised and supervised methods for the detection of hurriedly created profiles in recommender systems
【24h】

Unsupervised and supervised methods for the detection of hurriedly created profiles in recommender systems

机译:在推荐系统中检测匆忙创建的配置文件的无监督和监督方法

获取原文
获取原文并翻译 | 示例
       

摘要

Recommender systems try to provide users with accurate personalized suggestions for items based on an analysis of previous user decisions and the decisions made by other users. These systems suffer from profile injection attacks, where malicious profiles are generated in order to promote or demote a particular item introducing abnormal ratings. The problem of automatic detection of such malicious profiles has been recently addressed by a great number of authors in the literature using supervised and unsupervised approaches. In this paper, we propose a framework to identify anomalous rating profiles, where each attacker (outlier) hurriedly creates profiles that inject into the system an unspecified combination of random ratings and specific ratings, without any prior knowledge of the existing ratings. This attack is a superset of the two different attacks (Uniform and Delta) proposed in Harper et al. (ACM Trans Interact Intell Syst 5(4):19, 2016) making the attack model more realistic and its detection more challenging. The proposed detection method is based on several attributes related to the unpredictable behavior of the outliers in a validation set, on the user-item rating matrix, on the similarity between users and on the filler items. In this work, we propose a new attribute (RIS) to capture the randomness in item selection of the abnormal profiles. In this work, three different systems are proposed: (1) a probabilistic framework that estimates the probability of a user to be an outlier by combining several features in a completely unsupervised way. (2) An unsupervised clustering system based on the k-means algorithm that automatically spots the spurious profiles. (3) A supervised framework that uses a random forest classifier for cases where labeling sample data is available. Experimental results on the MovieLens and the Small Netflix datasets demonstrate the high performance of the proposed methods as well as the discrimination accuracy of the proposed features.
机译:推荐系统尝试为用户提供准确的个性化建议,根据先前用户决策的分析和其他用户所做的决策。这些系统遭受型材注射攻击,其中生成恶意配置文件,以便促进或降低引入异常评级的特定项目。最近在文献中的大量作者使用监督和无人监督的方法,最近涉及自动检测这种恶意档案的问题。在本文中,我们提出了一个框架来识别异常评级概况,其中每个攻击者(异常值)赶紧创建注入系统的简要额定值和特定额定值的未指定组合,而无需现有评级的任何先验知识。这种攻击是Harper等人提出的两个不同攻击(制服和三角洲)的超集。 (ACM Trans Interact Intell Syst 5(4):19,2016)使攻击模型更加现实,其检测更具挑战性。所提出的检测方法基于与验证集中的异常值的不可预测行为相关的几个属性,在用户项评级矩阵上,在用户和填充物品上的相似性上。在这项工作中,我们提出了一个新的属性(RIS)来捕获异常配置文件的项目选择中的随机性。在这项工作中,提出了三种不同的系统:(1)概率框架,其通过以完全无监视的方式组合多个特征来估计用户成为异常值的概率。 (2)基于K-Means算法的无监督群集系统,自动斑点斑点杂散配置文件。 (3)监督框架,用于随机林分类器,用于标记样品数据可用的情况。 Movielens和小型Netflix数据集上的实验结果证明了所提出的方法的高性能以及所提出的特征的辨别准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号