Research on OpenCL optimization for FPGA deep learning application

机译：FPGA深度学习应用的OpenCL优化研究

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, with the development of computer science, deep learning is held as competent enough to solve the problem of inference and learning in high dimensional space. Therefore, it has received unprecedented attention from both the academia and the business community. Compared with CPU/GPU, FPGA has attracted much attention for its high-energy efficiency, short development cycle and reconfigurability in the aspect of deep learning algorithm. However, because of the limited research on OpenCL optimization on FPGA of deep learning algorithms, OpenCL tools and models applied to CPU/GPU cannot be directly used on FPGA. This makes it difficult for software programmers to use FPGA when implementing deep learning algorithms for a rewarding performance. To solve this problem, this paper proposed an OpenCL computational model based on FPGA template architecture to optimize the time-consuming convolution layer in deep learning. The comparison between the program applying the computational model and the corresponding optimization program provided by Xilinx indicates that the former is 8-40 times higher than the latter in terms of performance.

机译：近年来，随着计算机科学的发展，深度学习被认为足以解决高维空间中的推理和学习问题。因此，它受到了学术界和商业界的空前关注。与CPU / GPU相比，FPGA在深度学习算法方面具有高能效，较短的开发周期和可重构性，因此备受关注。但是，由于深度学习算法在FPGA上对OpenCL优化的研究有限，因此，应用于CPU / GPU的OpenCL工具和模型无法直接在FPGA上使用。这使得软件程序员难以在实现深度学习算法以提高性能时使用FPGA。为了解决这个问题，本文提出了一种基于FPGA模板架构的OpenCL计算模型，以优化深度学习中耗时的卷积层。应用计算模型的程序与Xilinx提供的相应优化程序之间的比较表明，在性能方面，前者比后者高8-40倍。

著录项

期刊名称 PLoS Clinical Trials
作者
Shuo Zhang; Yanxia Wu; Chaoguang Men; Hongtao He; Kai Liang;
展开▼
作者单位

展开▼
年(卷),期 2012(14),10
年度 2012
页码 e0222984
总页数 19
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Research on OpenCL optimization for FPGA deep learning application [J] . Shuo Zhang, Yanxia Wu, Chaoguang Men, PLoS One . 2019,第10期

机译：FPGA深度学习应用OpenCL优化研究
2. Development of deep learning applications in FPGA-based fusion diagnostics using IRIO-OpenCL and NDS [J] . Astrain M., Ruiz M., Carpeno A., Fusion Engineering and Design . 2021,第Jula期

机译：使用IRIO-OPENCL和NDS开发基于FPGA的融合诊断中的深度学习应用
3. OpenCL-Darknet: implementation and optimization of OpenCL-based deep learning object detection framework [J] . Koo Yongbon, Kim Sunghoon, Ha Young-guk World Wide Web . 2021,第4期

机译：OpenCL-Darknet：基于OpenCL的深度学习对象检测框架的实现与优化
4. A Performance Analysis Framework for Optimizing OpenCL Applications on FPGAs [C] . Zeke Wang, Bingsheng He, Wei Zhang, IEEE International Symposium on High Performance Computer Architecture . 2016

机译：用于优化FPGA的OpenCL应用的性能分析框架
5. Data-Driven Optimization under Uncertainty in the Era of Big Data and Deep Learning: General Frameworks, Algorithms, and Applications [D] . Ning, Chao. 2020

机译：数据驱动优化在大数据和深度学习时代的不确定性下：一般框架，算法和应用程序
6. Optimization of Deep Neural Networks Using SoCs with OpenCL [O] . Rafael Gadea-Gironés, Ricardo Colom-Palero, Vicente Herrero-Bosch 2018

机译：使用具有OpenCL的SoC优化深层神经网络
7. Research on OpenCL optimization for FPGA deep learning application [O] . Shuo Zhang, Yanxia Wu, Chaoguang Men, 2019

机译：FPGA深度学习应用的OpenCL优化研究

Research on OpenCL optimization for FPGA deep learning application

摘要

著录项

相似文献

相关主题

期刊订阅