多核/众核平台上推荐算法的实现与性能评估

陈静; 方建滨; 唐滔; 杨灿群

首页> 中文期刊> 《计算机科学》 >多核/众核平台上推荐算法的实现与性能评估

多核/众核平台上推荐算法的实现与性能评估

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper,we designed and implemented two typical recommender algorithms,alternating least squares and cyclic coordinate descent in openCL.Then we evaluated them on Intel CPUs,NV1DIA GPUs and Intel MIC,and investigated the performance impacting factors:potential feature dimension and the number of thread.Meanwhile,we compared the OpenCL implementation with that of CUDA and OpenMP.Our experimental results show that in the same condition,CCD converges faster and performs more steadily,but is more time-consuming than ALS.We also observed that the performance based on OpenCL is better than CUDA and OpenMP when running on the same platform:the training time on GPU is slightly faster than that of the CUDA implementation (1.03x for CCD and 1.2x for ALS),and the training time on CPU is 1.6～1.7 times less than that of the OpenMP implementation with 16 threads.When running the OpenCL implementation on different platforms,we noticed that CPU performs better than both the GPU and the MIC.%用OpenCL语言标准设计并实现了推荐系统领域的两种经典算法:交替最小二乘法(Alternating Least Squares,ALS)与循环坐标下降法(Cyclic Coordinate Descent,CCD).将其应用到CPU,GPU,MIC多核与众核平台上,探索了在该平台上影响算法性能的因子:潜在特征维数与线程个数.同时,将OpenCL实现的两种算法与CUDA和OpenMP的实现进行比较,得出了一系列结论.在同等条件下,与ALS算法相比,CCD算法的精度更高,收敛速度更快且更稳定,但所耗时间更长.ALS和CCD算法基于OpenCL的实现性能不亚于CUDA(CCD上加速比为1.03x,ALS上加速比为1.2x)和OpenMP的实现(CCD与ALS上加速比大约为1.6～1.7x),并且两种算法在CPU平台上的性能均比GPU与MIC好.

著录项

来源
《计算机科学》 |2017年第10期|71-74|共4页
作者
陈静; 方建滨; 唐滔; 杨灿群;
展开▼
作者单位

国防科学技术大学计算机学院长沙410073;

国防科学技术大学计算机学院长沙410073;

国防科学技术大学计算机学院长沙410073;

国防科学技术大学计算机学院长沙410073;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序设计、软件工程;
关键词
推荐系统; OpenCL; ALS; CCD;

相似文献

中文文献
外文文献
专利

1. 两种片上多核通讯结构的FPGA实现与性能评估 [J] . 刘艳 ,王少轩 . 集成电路通讯 . 2010,第001期
2. 基于众核平台的 CLCG 并行化设计与实现 [J] . 杨杰 ,宋博文 ,张保东 . 西安邮电学院学报 . 2015,第004期
3. 有限元网格积分算法在MIC众核平台上的并行实现 [J] . 寇大治 ,孔大力 . 计算机科学 . 2015,第011期
4. 基于众核平台的CLCG并行化设计与实现 [J] . 杨杰1 ,宋博文23 ,张保东23 . 西安邮电大学学报 . 2015,第004期
5. SOM算法在申威众核上的实现和优化 [J] . 姚庆 ,郑凯 ,刘垚 . 计算机科学 . 2018,第0z2期
6. 多核/众核平台上推荐算法实现与性能评估 [C] . Chen Jing ,陈静 ,Fang Jianbin . 2016年全国高性能计算学术年会 . 2016
7. HPCG在多核/众核平台上的实现与优化 [A] . 廖陈志 . 2018

多核/众核平台上推荐算法的实现与性能评估

摘要

著录项

相似文献

相关主题

期刊订阅