首页> 外文期刊>Journal of Parallel and Distributed Computing >Improvement of recommendation algorithm based on Collaborative Deep Learning and its Parallelization on Spark
【24h】

Improvement of recommendation algorithm based on Collaborative Deep Learning and its Parallelization on Spark

机译:基于协同深度学习的推荐算法及其对火花的平行化改进

获取原文
获取原文并翻译 | 示例
       

摘要

Collaborative Deep Learning (CDL) utilizes the strong feature learning capability of neural network and the model fitting robustness to solve the problem that the performance of Recommender System drops dramatically when the data is sparse. However, it makes the model training become difficult to maintain when Recommender System faces a large amount of data, and a variety of unpredictable problems will arise. In order to solve the above problems, collaborative deep learning and its parallelization methods were studied in this study, and an improved model CDL-I (CDL with item private node) aiming at item content optimization based on collaborative deep learning was proposed, which improved SDAE on the basis of CDL, added private network nodes; in case of sharing the network parameters of the model, private bias terms were added for each item. As a result, the network may learn the item content parameters in a more targeted manner, thereby enhancing the detection performance of the model on item content in Recommender System. Furthermore, the algorithm was parallelized by splitting the model, and a parallel training CDL-I method was also proposed, which was transplanted to the Spark distributed cluster. The parameters of each part of the model were trained and optimized in parallel to enhance the scale and scalability of data that the model could process. The experiments on multiple real datasets have verified the effectiveness and efficiency of the proposed parallel CDL-I algorithm.
机译:协作深度学习(CDL)利用神经网络的强大特征学习能力和模型拟合稳健性,解决了在数据稀疏时巨大幅度下降的问题的问题。然而,当推荐系统面临大量数据时,它使模型训练变得难以维持,并且会出现各种不可预测的问题。为了解决上述问题,在本研究中研究了协作深度学习及其并行化方法,提出了一种基于协作深度学习的项目内容优化的改进的模型CDL-I(CDL与项目私人节点),提高了SDAE在CDL的基础上,添加了私有网络节点;在共享模型的网络参数的情况下,为每个项目添加私有偏差术语。结果,该网络可以以更具目标的方式学习项目内容参数,从而提高推荐系统中的项目内容的模型的检测性能。此外,通过分离模型并行化算法,并且还提出了一种并联训练CDL-1方法,其移植到火花分布簇。培训模型的每个部分的参数并并行培训并优化,以增强模型可以处理的数据的比例和可扩展性。多个实时数据集的实验已经验证了所提出的并行CDL-I算法的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号