首页> 中文期刊> 《计算机技术与发展 》 >基于矩阵分解的协同过滤算法的并行化研究

基于矩阵分解的协同过滤算法的并行化研究

             

摘要

基于矩阵分解的协同过滤算法是近几年提出的一种协同过滤推荐技术,但其每项预测评分的计算都要综合大量评分数据,同时在计算时还需要存储庞大的特征矩阵,用单一节点来进行推荐将会遇到计算时间和计算资源的瓶颈。通过对现有的基于ALS(最小二乘法)的协同过滤算法在Hadoop上并行化实现的原理和特点进行深入的研究,得到了传统的迭代式算法在Hadoop上运算效率不高的原因。根据迭代式MapReduce思想,提出了循环感知任务调度算法、缓存静态数据、任务循环控制、迭代终止条件检测等方法。通过在Netflix数据集上的实验表明,迭代式MapReduce思想提高了基于ALS的协同过滤算法的并行化计算的效率。%Collaborative filtering algorithm based on matrix factorization is a collaborative filtering recommendation technique proposed in recent years. In the process of recommendation each prediction depends on the collaboration of the whole known rating set and the feature matrices need huge storage. So the recommendation with only one node will meet the bottleneck of time and resource. Through in-depth study on the principle and feature of current parallel implementation of a collaborative filtering algorithm based on ALS ( Alternating-Least-Squares) ,get the reason why the computing efficiency of the implementation of traditional iterative algorithm on Hadoop is very low. According to the idea of iterative MapReduce,some methods such as loop-aware scheduling algorithm,static data caching,job loop controlling,fixed point detecting are proposed. The experiment on Netflix data set shows that the iterative MapReduce has improved the parallel computing efficiency of collaborative filtering algorithm based on ALS.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号