首页> 外文会议>IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies >Improving Collaborative Filtering Based Recommenders Using Topic Modelling
【24h】

Improving Collaborative Filtering Based Recommenders Using Topic Modelling

机译:使用主题建模改进基于协作过滤的推荐器

获取原文

摘要

Standard Collaborative Filtering (CF) algorithms make use of interactions between users and items in the form of implicit or explicit ratings alone for generating recommendations. Similarity among users or items is calculated purely based on rating overlap in this case, without considering explicit properties of users or items involved, limiting their applicability in domains with very sparse rating spaces. In many domains such as movies, news or electronic commerce recommenders, considerable contextual data in text form describing item properties is available along with the rating data, which could be utilized to improve recommendation quality. In this paper, we propose a novel approach to improve standard CF based recommenders by utilizing latent Dirichlet allocation (LDA) to learn latent properties of items, expressed in terms of topic proportions, derived from their textual description. We infer user's topic preferences or user profile in the same latent space, based on her historical ratings. While computing similarity between users, we make use of a combined similarity measure involving rating overlap as well as similarity in the latent topic space. This approach alleviates sparsity problem as it allows calculation of similarity between users even if they have not rated any items in common. Our experiments on multiple public datasets indicate that the proposed hybrid approach significantly outperforms standard User Based and Item Based CF recommenders in terms of classification accuracy metrics such as precision, recall and F-measure.
机译:标准协作过滤(CF)算法仅以隐式或显式评级的形式利用用户与项目之间的交互来生成推荐。在这种情况下,用户或项目之间的相似性仅基于评级重叠来计算,而无需考虑所涉及的用户或项目的显式属性,从而限制了它们在具有非常稀疏的评级空间的域中的适用性。在电影,新闻或电子商务推荐者等许多领域中,可以使用描述项目属性的文本形式的大量上下文数据以及评分数据,这些数据可以用来提高推荐质量。在本文中,我们提出了一种新颖的方法,通过利用潜在的狄利克雷分配(LDA)学习从项目的文本描述中得出的以主题比例表示的项目的潜在属性,从而改进了基于CF的标准推荐者。我们根据用户的历史评分,推断用户在相同潜在空间中的主题偏好或用户个人资料。在计算用户之间的相似性时,我们使用了包含评分重叠以及潜在主题空间中相似性的组合相似性度量。这种方法缓解了稀疏性问题,因为它可以计算用户之间的相似度,即使他们没有共同评价任何项目。我们在多个公共数据集上的实验表明,在分类精度指标(例如精度,召回率和F度量)方面,所提出的混合方法明显优于标准的“基于用户”和“基于项目”的CF推荐器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号