...
首页> 外文期刊>Knowledge and Information Systems >Collaborative Filtering Using a Regression-Based Approach
【24h】

Collaborative Filtering Using a Regression-Based Approach

机译:使用基于回归的方法进行协同过滤

获取原文
获取原文并翻译 | 示例

摘要

The task of collaborative filtering is to predict the preferences of an active user for unseen items given preferences of other users. These preferences are typically expressed as numerical ratings. In this paper, we propose a novel regression-based approach that first learns a number of experts describing relationships in ratings between pairs of items. Based on ratings provided by an active user for some of the items, the experts are combined by using statistical methods to predict the user’s preferences for the remaining items. The approach was designed to efficiently address the problem of data sparsity and prediction latency that characterise collaborative filtering. Extensive experiments on Eachmovie and Jester benchmark collaborative filtering data show that the proposed regression-based approach achieves improved accuracy and is orders of magnitude faster than the popular neighbour-based alternative. The difference in accuracy was more evident when the number of ratings provided by an active user was small, as is common for real-life recommendation systems. Additional benefits were observed in predicting items with large rating variability. To provide a more detailed characterisation of the proposed algorithm, additional experiments were performed on synthetic data with second-order statistics similar to that of the Eachmovie data. Strong experimental evidence was obtained that the proposed approach can be applied to data over a large range of sparsity scenarios and is superior to non-personalised predictors even when ratings data are very sparse.
机译:协作过滤的任务是,根据其他用户的偏好,预测活动用户对看不见的商品的偏好。这些首选项通常表示为数字等级。在本文中,我们提出了一种新颖的基于回归的方法,该方法首先学习了许多描述项目对之间等级之间关系的专家。根据活跃用户对某些商品提供的评分,通过使用统计方法来组合专家,以预测用户对其余商品的偏好。该方法旨在有效解决协作过滤的数据稀疏性和预测延迟问题。在Everymovie和Jester基准协作过滤数据上的大量实验表明,所提出的基于回归的方法可以提高准确性,并且比流行的基于邻域的方法要快几个数量级。当活动用户提供的评分数量很少时(如现实推荐系统中常见的那样),准确性的差异更加明显。在预测具有较大等级差异的项目时还观察到了其他好处。为了提供所提出算法的更详细特征,对合成数据进行了额外的实验,合成数据的二阶统计量与Everymovie数据相似。获得了有力的实验证据,证明了该方法可应用于各种稀疏情况下的数据,即使收视率数据非常稀疏,也优于非个性化的预测器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号