首页> 外文期刊>Intelligent decision technologies >Tuning machine learning algorithms for content-based movie recommendation
【24h】

Tuning machine learning algorithms for content-based movie recommendation

机译:调整机器学习算法以基于内容的电影推荐

获取原文
获取原文并翻译 | 示例
           

摘要

Machine learning algorithms are often used in content-based recommender systems since a recommendation task can naturally be reduced to a classification problem: A recommender needs to learn a classifier for a given user where learning examples are characteristics of items previously liked/bought/seen by the user. However, multi-valued and continuous attributes require special approaches for classifier implementation as they can significantly influence classifier accuracy. In this paper we propose novel approaches for handling multi-valued and continuous attributes adequate for the naieve Bayes classifier and decision trees classifier, and tune it for content-based movie recommendation. We evaluate the performance of the resulting approaches using the MovieLens data set enriched with movie details retrieved from the Internet Movie Database. Our empirical results demonstrate that the naieve Bayes classifier is more suitable for content-based movie recommendation than the decision trees algorithm. In addition, the naieve Bayes classifier achieves better results with smart discretization of continuous attributes compared to the approach which models continuous attributes with a Gaussian distribution. Finally, we combine our best performing content-based algorithm with the k-means clustering algorithm typically used for collaborative filtering, and evaluate the performance of the resulting hybrid approach for a movie recommendation task. The experimental results clearly show that the hybrid approach significantly increases recommendation accuracy compared to collaborative filtering while reducing the risk of over specification, which is a typical problem of content-based approaches.
机译:机器学习算法通常用于基于内容的推荐器系统中,因为推荐任务自然可以归结为分类问题:推荐器需要为给定用户学习分类器,其中学习示例是先前喜欢/购买/观看过的商品的特征用户。但是,多值和连续属性需要特殊的方法来实现分类器,因为它们会显着影响分类器的准确性。在本文中,我们提出了新颖的方法来处理足以满足朴素贝叶斯分类器和决策树分类器的多值和连续属性,并对其进行调整以用于基于内容的电影推荐。我们使用MovieLens数据集(从Internet电影数据库检索到的电影详细信息)来评估所得方法的性能。我们的经验结果表明,与决策树算法相比,朴素的贝叶斯分类器更适合基于内容的电影推荐。此外,与对具有高斯分布的连续属性进行建模的方法相比,朴素的贝叶斯分类器通过对连续属性进行智能离散化,可以获得更好的结果。最后,我们将性能最佳的基于内容的算法与通常用于协作过滤的k-means聚类算法相结合,并评估了用于电影推荐任务的混合方法的性能。实验结果清楚地表明,与协作过滤相比,混合方法显着提高了推荐准确性,同时降低了超规范的风险,这是基于内容的方法的典型问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号