Improving Deep Learning with a customizable GPU-like FPGA-based accelerator

机译：使用可定制的类似GPU的基于FPGA的加速器改善深度学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An ever increasing number of challenging applications are being approached using Deep Learning, obtaining impressive results in a variety of different domains. However, state-of-the-art accuracy requires deep neural networks with a larger number of layers and a huge number of different filters with millions of weights. GPUand FPGA-based architectures have been proposed as a possible solution for facing this enormous demand of computing resources. In this paper, we investigate the adoption of different architectural features, i.e. SIMD paradigm, multithreading, and non-coherent on-chip memory for Deep Learning oriented FPGA-based accelerator designs. Experimental results on a Xilinx Virtex-7 FPGA show that the SIMD paradigm and multithreading can lead to an improvement in the execution time up to 5× and 3.5×, respectively. A further enhancement up to 1.75× can be obtained using a non-coherent on-chip memory.

机译：深度学习正在处理越来越多具有挑战性的应用程序，从而在各种不同领域中均获得了令人印象深刻的结果。但是，最新的准确性要求深度神经网络具有更多的层数以及大量具有数百万个权重的不同过滤器。已经提出了基于GPU和FPGA的体系结构作为应对这种巨大的计算资源需求的可能解决方案。在本文中，我们研究了针对面向深度学习的基于FPGA的加速器设计采用不同的架构功能（即SIMD范例，多线程和非一致性片上存储器）的情况。在Xilinx Virtex-7 FPGA上的实验结果表明，SIMD范例和多线程可以分别将执行时间缩短多达5倍和3.5倍。使用非相干的片上存储器可以获得高达1.75x的进一步增强。

著录项

来源
《Conference on Ph.D. Research in Microelectronics and Electronics》|2018年|273-276|共4页
会议地点
作者
Mirko Gagliardi; Edoardo Fusella; Alessandro Cilardo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Instruction sets; Computer architecture; Machine learning; Hardware; Field programmable gate arrays; Multithreading;

机译：指令集;计算机体系结构;机器学习;硬件;现场可编程门阵列;多线程;

相似文献

外文文献
中文文献
专利

1. DeepBurning: automatic generation of FPGA-based learning accelerators for the neural network family. [J] . S. M. Godwin Computing reviews . 2020,第5期

机译：DeepBurning：为神经网络系列自动生成基于FPGA的学习加速器。
2. An FPGA-Based Resource-Saving Hardware Accelerator for Deep Neural Network [J] . Han Jia, Xuecheng Zou International Journal of Intelligence Science . 2021,第2期

机译：基于FPGA的深神经网络的资源节约用品加速器
3. An FPGA-Based Resource-Saving Hardware Accelerator for Deep Neural Network [J] . Han Jia, Xuecheng Zou 智能科学国际期刊（英文） . 2021,第002期

机译：基于FPGA的深神经网络的资源节约用品加速器
4. Improving Deep Learning with a customizable GPU-like FPGA-based accelerator [C] . Mirko Gagliardi, Edoardo Fusella, Alessandro Cilardo Conference on Ph.D. Research in Microelectronics and Electronics . 2018

机译：通过可定制的基于GPU的基于FPGA的加速器改善深入学习
5. An FPGA-Based Hardware Accelerator for K-Nearest Neighbor Classification for Machine Learning [D] . Mohsin, Mokhles Aamel. 2017

机译：基于FPGA的硬件加速器，用于机器学习的K近邻分类
6. A customizable deep learning model for nosocomial risk prediction from critical care notes with indirect supervision [O] . Travis R Goodwin, Dina Demner-Fushman 2020

机译：间接监督关键监护备注的一种可定制深层学习模型
7. An FPGA-based accelerator for deep neural network with novel reconfigurable architecture [O] . Han Jia, Daming Ren, Xuecheng Zou 2021

机译：基于FPGA的深神经网络加速器，具有新型可重构架构

Improving Deep Learning with a customizable GPU-like FPGA-based accelerator

摘要

著录项

相似文献

相关主题

期刊订阅