首页> 外文会议>2011 IEEE World Congress on Services >Scaling-Up Item-Based Collaborative Filtering Recommendation Algorithm Based on Hadoop
【24h】

Scaling-Up Item-Based Collaborative Filtering Recommendation Algorithm Based on Hadoop

机译:基于Hadoop的基于项的放大协同过滤推荐算法。

获取原文

摘要

Collaborative filtering (CF) techniques have achieved widespread success in E-commerce nowadays. The tremendous growth of the number of customers and products in recent years poses some key challenges for recommender systems in which high quality recommendations are required and more recommendations per second for millions of customers and products need to be performed. Thus, the improvement of scalability and efficiency of collaborative filtering (CF) algorithms become increasingly important and difficult. In this paper, we developed and implemented a scaling-up item-based collaborative filtering algorithm on MapReduce, by splitting the three most costly computations in the proposed algorithm into four Map-Reduce phases, each of which can be independently executed on different nodes in parallel. We also proposed efficient partition strategies not only to enable the parallel computation in each Map-Reduce phase but also to maximize data locality to minimize the communication cost. Experimental results effectively showed the good performance in scalability and efficiency of the item-based CF algorithm on a Hadoop cluster.
机译:如今,协作过滤(CF)技术已在电子商务中取得了广泛的成功。近年来,客户和产品数量的巨大增长对推荐系统提出了一些关键挑战,在这些推荐系统中,需要高质量的建议,每秒需要为数百万个客户和产品执行更多的建议。因此,协作过滤(CF)算法的可伸缩性和效率的提高变得越来越重要和困难。在本文中,我们通过将提出的算法中的三个最昂贵的计算分为四个Map-Reduce阶段,在MapReduce上开发并实现了一个基于项目的按比例放大协作过滤算法,每个阶段都可以在不同的节点上独立执行。平行线。我们还提出了有效的分区策略,不仅可以在每个Map-Reduce阶段中进行并行计算,而且还可以最大化数据局部性以最小化通信成本。实验结果有效地表明了基于项目的CF算法在Hadoop集群上的可扩展性和效率方面的良好性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号