【24h】

Unifying user-based and item-based collaborative filtering approaches by similarity fusion

机译:通过相似度融合统一基于用户和基于项目的协作过滤方法

获取原文

摘要

Memory-based methods for collaborative filtering predict new ratings by averaging (weighted) ratings between, respectively, pairs of similar users or items. In practice, a large number of ratings from similar users or similar items are not available, due to the sparsity inherent to rating data. Consequently, prediction quality can be poor. This paper re-formulates the memory-based collaborative filtering problem in a generative probabilistic framework, treating individual user-item ratings as predictors of missing ratings. The final rating is estimated by fusing predictions from three sources: predictions based on ratings of the same item by other users, predictions based on different item ratings made by the same user, and, third, ratings predicted based on data from other but similar users rating other but similar items. Existing user-based and item-based approaches correspond to the two simple cases of our framework. The complete model is however more robust to data sparsity, because thedifferent types of ratings are used in concert, while additional ratings from similar users towards similar items are employed as a background model to smooth the predictions. Experiments demonstrate that the proposed methods are indeed more robust against data sparsity and give better recommendations.
机译:基于内存的协作过滤方法通过分别对成对的相似用户项目之间的(加权)评级进行平均,从而预测新的评级。实际上,由于收视率数据固有的稀疏性,因此无法获得来自类似用户或类似项目的大量收视率。因此,预测质量可能很差。本文在生成概率框架中重新构造了基于内存的协作过滤问题,将单个用户项目评分视为缺失评分的预测指标。最终评级是通过融合来自三个来源的预测来估算的:基于其他用户对同一项目的评级的预测,基于同一用户做出的不同项目评级的预测以及第三,基于其他但相似用户的数据预测的评级对其他但相似的项目进行评分。现有的基于用户和基于项目的方法对应于我们框架的两个简单情况。但是,完整的模型对数据稀疏性更强健,因为不同类型的评级共同使用,而类似用户对类似项目的额外评级被用作背景模型来平滑预测。实验表明,所提出的方法确实对数据稀疏性更健壮,并给出了更好的建议。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号