DCF: A Dataflow-Based Collaborative Filtering Training Algorithm

Xiangyu Ju; Quan Chen; Zhenning Wang; Minyi Guo; Guang R. Gao

首页> 外文期刊>International journal of parallel programming >DCF: A Dataflow-Based Collaborative Filtering Training Algorithm

【24h】

DCF: A Dataflow-Based Collaborative Filtering Training Algorithm

机译：DCF：一种基于数据流的协同过滤训练算法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Emerging recommender systems often adopt collaborative filtering techniques to improve the recommending accuracy. Existing collaborative filtering techniques are implemented with either alternating least square algorithm or gradient descent (GD) algorithm. However, both of the two algorithms are not scalable because ALS suffers from high computation complexity and GD suffers from severe synchronization problem and tremendous data movement. To solve the above problems, we proposed a Dataflow-based Collaborative Filtering (DCF) algorithm. More specifically, DCF exploits fine-grain asynchronous feature of dataflow model to minimize synchronization overhead; leverages mini-batch technique to reduce computation and communication complexities; uses dummy edge and multicasting techniques to avoid fine-grain overhead of dependency checking and reduce data movement. By utilizing all the above techniques, DCF is able to significantly improve the performance of collaborative filtering. Our experiment on a cluster with one master node and ten slave nodes show that DCF achieves 23 $$imes $$ × speedup over ALS on Spark and 18 $$imes $$ × speedup over GD on Graphlab in public datasets.

机译：新兴的推荐系统通常采用协作过滤技术来提高推荐准确性。现有的协同过滤技术是通过交替最小二乘算法或梯度下降（GD）算法实现的。但是，这两种算法都无法扩展，因为ALS遭受了很高的计算复杂度，而GD遭受了严重的同步问题和巨大的数据移动。为了解决上述问题，我们提出了一种基于数据流的协同过滤（DCF）算法。更具体地说，DCF利用数据流模型的细粒度异步功能来最小化同步开销。利用小批量技术降低计算和通信复杂性；使用伪边缘和多播技术来避免依赖检查的细粒度开销并减少数据移动。通过利用以上所有技术，DCF能够显着提高协作过滤的性能。我们在具有一个主节点和十个从属节点的群集上进行的实验表明，在公共数据集中，DCF在Graphlab上比ALS在ALS上提高了23 $$ 倍$$×GD在Graphlab上实现了18 $$ 倍$$×GD的加速。

著录项

来源
《International journal of parallel programming》 |2018年第4期|686-698|共13页
作者
Xiangyu Ju; Quan Chen; Zhenning Wang; Minyi Guo; Guang R. Gao;
展开▼
作者单位

Shanghai Jiao Tong University;

Shanghai Institute for Advanced Communication and Data Science, Shanghai Jiao Tong University;

Shanghai Jiao Tong University;

Shanghai Institute for Advanced Communication and Data Science, Shanghai Jiao Tong University;

University of Delaware;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
DCF; Dataflow; Collaborative filtering; Gradient descent; Asynchronous; Fine-grain; Parallel;

机译：DCF;数据流;协作过滤;梯度下降;异步;细粒度;并行;

相似文献

外文文献
中文文献
专利

1. DCFLA: A distributed collaborative-filtering neighbor-locating algorithm [J] . Xie B, Han P, Yang F, Information Sciences: An International Journal . 2007,第6期

机译：DCFLA：一种分布式协同过滤邻居定位算法
2. A content-boosted collaborative filtering algorithm for personalized training in interpretation of radiological imaging [J] . LinH., YangX., WangW. Journal of digital imaging: the official journal of the Society for Computer Applications in Radiology . 2014,第4期

机译：一种内容增强协作过滤算法，用于放射成像解释中的个性化培训
3. Robustness analysis of multi-criteria collaborative filtering algorithms against shilling attacks [J] . Turk Ahmet Murat, Bilge Alper Expert Systems with Application . 2019,第JANa期

机译：多准则协同过滤算法针对先兆攻击的鲁棒性分析
4. TDCF: Time Distribution Collaborative Filtering Algorithm [C] . Zhao Jiguang, Yu Xueli, Sun Jingyu International Symposium on Information Science and Engineering . 2008

机译：TDCF：时间分布协同滤波算法
5. A Comparative Study of Collaborative Filtering Recommendation Systems Using Algorithms to Impute Large Sparse Matrices. [D] . Lindo, Steven Christopher. 2016

机译：使用算法插补大稀疏矩阵的协同过滤推荐系统的比较研究。
6. A Content-Boosted Collaborative Filtering Algorithm for Personalized Training in Interpretation of Radiological Imaging [O] . Hongli Lin, Xuedong Yang, Weisheng Wang 2014

机译：一种内容增强的协同过滤算法用于放射成像解释的个性化培训
7. CoupledCF: Learning Explicit and Implicit User-item Couplings in Recommendation for Deep Collaborative Filtering [O] . Quangui Zhang, Longbing Cao, Chengzhang Zhu, 2018

机译：COMPECEDCF：在深度协同过滤的建议书中学习显式和隐式用户项目耦合

DCF: A Dataflow-Based Collaborative Filtering Training Algorithm

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅