首页> 外文期刊>International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms >AN OPTIMAL SCALING FRAMEWORK FOR COLLABORATIVE FILTERING RECOMMENDATION SYSTEMS
【24h】

AN OPTIMAL SCALING FRAMEWORK FOR COLLABORATIVE FILTERING RECOMMENDATION SYSTEMS

机译:协同过滤推荐系统的最优标度框架。

获取原文
获取原文并翻译 | 示例
           

摘要

Collaborative Filtering (CF) is a popular technique employed by Recommender Systems, a term used to describe intelligent methods that generate personalized recommendations. Some of the most efficient approaches to CF are based on latent factor models and nearest neighbor methods, and have received considerable attention in recent literature. Latent factor models can tackle some fundamental challenges of CF, such as data sparsity and scalability. In this work, we present an optimal scaling framework to address these problems using Categorical Principal Component Analysis (CatPCA) for the low-rank approximation of the user-item ratings matrix, followed by a neighborhood formation step. CatPCA is a versatile technique that utilizes an optimal scaling process where original data are transformed so that their overall variance is maximized. We considered both smooth and non-smooth transformations for the observed variables (items), such as numeric, (spline) ordinal, (spline) nominal and multiple nominal. The method was extended to handle missing data and incorporate differential weighting for items. Experiments were executed on three data sets of different sparsity and size, MovieLens 100k, 1M and Jester, aiming to evaluate the aforementioned options in terms of accuracy. A combined approach with a multiple nominal transformation and a "passive" missing data strategy clearly outperformed the other tested options for all three data sets. The results are comparable with those reported for single methods in the CF literature.
机译:协作过滤(CF)是Recommender系统采用的一种流行技术,该术语用于描述生成个性化推荐的智能方法。 CF的一些最有效方法是基于潜在因子模型和最近邻方法的,并且在最近的文献中受到了相当大的关注。潜在因素模型可以解决CF的一些基本挑战,例如数据稀疏性和可伸缩性。在这项工作中,我们提出了一个最佳的缩放框架,以使用分类主成分分析(CatPCA)来解决用户-项目评分矩阵的低秩近似问题,然后进行邻域形成步骤。 CatPCA是一种通用技术,它利用最佳缩放过程对原始数据进行转换,以使它们的整体方差最大化。对于观察到的变量(项目),我们考虑了平滑变换和非平滑变换,例如数值,(样条)序数,(样条)名义和多重名义。该方法已扩展为处理丢失的数据并合并了项目的差分加权。在三个稀疏性和大小不同的数据集(MovieLens 100k,1M和Jester)上进行了实验,旨在评估上述选项的准确性。具有多个名义转换和“被动”缺失数据策略的组合方法显然优于所有这三个数据集的其他测试选项。结果与CF文献中单一方法报道的结果相当。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号