首页> 外文学位 >Enriching user and item profiles for collaborative filtering: From concept hierarchies to user-generated reviews.
【24h】

Enriching user and item profiles for collaborative filtering: From concept hierarchies to user-generated reviews.

机译:丰富用户和项目配置文件以进行协作过滤:从概念层次结构到用户生成的评论。

获取原文
获取原文并翻译 | 示例

摘要

Collaborative Filtering (CF) is a recommender systems technique that generates personalized recommendations for users based on user preferences. Such preferences are usually expressed in the form of numerical ratings, or binary votes such as purchase data. Despite its considerable success and popularity in both research and practice, CF suffers from the problems of data sparseness and cold-start recommendation, which is an extreme form of data sparseness. Specifically, CF algorithms have difficulty with generating reliable recommendations when data are sparse, and they cannot recommend items that have not received any ratings from users.;This thesis addresses the problems of data sparseness and cold-start recommendation of CF along two dimensions. Firstly, we developed two novel recommendation algorithms based on association rule mining techniques. The proposed algorithms, namely FARAMS and CLARE, exploit the relationships between items that are encoded in the concept hierarchies of the items when users' preference data are too limited for generating recommendations. Specifically, FARAMS makes use of interesting associations between item categories to find recommendable items for users having limited known preferences, while CLARE generates recommendations for a given cold-start item by finding other items in the system that are highly correlated with the attributes of the cold-start item. We evaluated both algorithms based on widely adopted benchmarking datasets of CF. Results show that both algorithms outperform related algorithms in addressing data sparseness and the cold-start problem under similar experimental settings.;Secondly, we investigated the use of user-generated reviews for generating personalized recommendations. We made three major contributions in this area. First, we collected and analyzed a set of movie reviews to understand how user opinions are expressed in user-generated reviews, which are free-form texts written in natural language. Based on the results of our analysis, we proposed a novel method for determining the sentimental orientations and strength of user opinions. Second, we proposed a rating inference framework, namely PREF, for augmenting ratings for CF. PREF aims at determining and representing the overall sentiments expressed in reviews as numerical ratings that can readily be used by existing CF algorithms. In other words, PREF enables existing CF algorithms to utilize textual reviews as an additional source of user preferences, thereby lessens the problem of data sparseness. Third, we found that user-generated reviews contain valuable information for constructing the interest profiles of users and domain items based on a real-world dataset of tourist attraction reviews. Using such information for generating personalized recommendations significantly improve the prediction quality and coverage of traditional CF algorithms. While existing CF algorithms operate on numerical ratings or binary votes of items, our research represents an important pioneering step towards a novel CF paradigm based on user-generated reviews.
机译:协作过滤(CF)是一种推荐程序系统技术,可根据用户偏好为用户生成个性化推荐。这种偏好通常以数字等级或二元投票(例如购买数据)的形式表示。尽管在研究和实践中都取得了巨大的成功和普及,但CF仍然存在数据稀疏和冷启动建议的问题,这是数据稀疏的一种极端形式。具体来说,CF算法在数据稀疏时难以生成可靠的推荐,因此无法推荐未收到用户任何评价的项目。;本文从两个维度解决了CF数据稀疏和冷启动推荐的问题。首先,我们基于关联规则挖掘技术开发了两种新颖的推荐算法。当用户的偏好数据太受限制而无法生成推荐时,建议的算法,即FARAMS和CLARE,会利用在项目的概念层次结构中编码的项目之间的关系。具体来说,FARAMS利用项目类别之间的有趣关联来为具有有限已知偏好的用户找到推荐项目,而CLARE通过查找系统中与寒冷属性高度相关的其他项目来生成给定冷启动项目的建议-开始项目。我们基于广泛采用的CF基准数据集评估了这两种算法。结果表明,在相似的实验设置下,两种算法在解决数据稀疏和冷启动问题方面均优于相关算法。其次,我们研究了用户生成的评论在生成个性化推荐中的使用。我们在这一领域做出了三项重大贡献。首先,我们收集并分析了一组电影评论,以了解如何在用户生成的评论中表达用户意见,这些评论是用自然语言编写的自由格式文本。根据我们的分析结果,我们提出了一种确定情感倾向和用户意见强度的新方法。其次,我们提出了一个评级推断框架,即PREF,用于增强CF的评级。 PREF旨在确定和表示评论中表达的整体情感,以数字评分表示,这些评分可以被现有CF算法轻松使用。换句话说,PREF使现有的CF算法能够将文本评论用作用户偏好的其他来源,从而减轻了数据稀疏的问题。第三,我们发现用户生成的评论包含有价值的信息,这些信息可用于根据旅游景点评论的真实世界数据构建用户和领域项目的兴趣概况。使用此类信息生成个性化推荐会显着提高传统CF算法的预测质量和覆盖范围。尽管现有的CF算法可在数值等级或项目的二进制票数上运行,但我们的研究代表了迈向基于用户生成的评论的新颖CF范例的重要先驱一步。

著录项

  • 作者

    Leung, Wing Ki Cane.;

  • 作者单位

    Hong Kong Polytechnic University (Hong Kong).;

  • 授予单位 Hong Kong Polytechnic University (Hong Kong).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 196 p.
  • 总页数 196
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:38:16

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号