...
首页> 外文期刊>Applied Soft Computing >A comparison of clustering-based privacy-preserving collaborative filtering schemes
【24h】

A comparison of clustering-based privacy-preserving collaborative filtering schemes

机译:基于聚类的隐私保护协作过滤方案的比较

获取原文
获取原文并翻译 | 示例

摘要

Privacy-preserving collaborative filtering (PPCF) methods designate extremely beneficial filtering skills without deeply jeopardizing privacy. However, they mostly suffer from scalability, sparsity, and accuracy problems. First, applying privacy measures introduces additional costs making scalability worse. Second, due to randomness for preserving privacy, quality of predictions diminishes. Third, with increasing number of products, sparsity becomes an issue for both CF and PPCF schemes. In this study, we first propose a content-based profiling (CBP) of users to overcome sparsity issues while performing clustering because the very sparse nature of rating profiles sometimes do not allow strong discrimination. To cope with scalability and accuracy problems of PPCF schemes, we then show how to apply k-means clustering (KMC), fuzzy c-means method (FCM), and self-organizing map (SOM) clustering to CF schemes while preserving users' confidentiality. After presenting an evaluation of clustering-based methods in terms of privacy and supplementary costs, we carry out real data-based experiments to compare the clustering algorithms within and against traditional CF and PPCF approaches in terms of accuracy. Our empirical outcomes demonstrate that FCM achieves the best low cost performance compared to other methods due to its approximation-based model. The results also show that our privacy-preserving methods are able to offer precise predictions.
机译:隐私保护协作过滤(PPCF)方法指定了非常有益的过滤技能,而不会严重危害隐私。但是,它们大多遭受可伸缩性,稀疏性和准确性问题的困扰。首先,采用隐私措施会带来额外的成本,使可伸缩性变差。其次,由于保存隐私的随机性,预测的质量降低了。第三,随着产品数量的增加,CF和PPCF方案都变得稀疏。在这项研究中,我们首先提出用户的基于内容的概要分析(CBP),以在执行聚类时克服稀疏性问题,因为评级配置文件的稀疏性质有时不允许强烈的歧视。为了解决PPCF方案的可伸缩性和准确性问题,我们然后说明如何在保留用户身份的同时将k均值聚类(KMC),模糊c均值方法(FCM)和自组织映射(SOM)聚类到CF方案。保密。在对基于聚类的方法的隐私性和附加成本进行了评估之后,我们进行了基于数据的真实实验,以比较传统CF和PPCF方法内和与传统CF和PPCF方法之间的聚类算法的准确性。我们的经验结果表明,由于其基于近似的模型,与其他方法相比,FCM可获得最佳的低成本性能。结果还表明,我们的隐私保护方法能够提供精确的预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号