Concurrent MAC unit design using VHDL for deep learning networks on FPGA

机译：在FPGA上使用VHDL进行深度学习网络的并行MAC单元设计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural network algorithms have proven their enormous capabilities in wide range of artificial intelligence applications, specially in Printed/Handwritten text recognition, Multimedia processing, Robotics and many other high end technological trends. The most challenging aspect nowadays is to overcome the extremely computational processing demands in applying such algorithms, especially in real-time systems. Recently, the Field Programmable Gate Array (FPGA) has been considered as one of the optimum hardware accelerator platform for accelerating the deep neural network architectures due to its large adaptability and the high degree of parallelism it offers. In this paper, the proposed 8-bits fixed-point parallel multiply-accumulate (MAC) unit architecture aimed to create a fully-customize MAC unit for the Convolutional Neural Networks (CNN) instead of depending on the conventional DSP blocks and embedded memories units on the FPGAs architecture silicon fabrics. The proposed 8-bits fixed-point parallel multiply-accumulate (MAC) unit architecture is designed using VHDL language and can performs a computational speed up to 4.17 Giga Operation per Second (GOPS) using high-density FPGAs.

机译：深度神经网络算法已在各种人工智能应用中证明了其巨大的功能，特别是在印刷/手写文本识别，多媒体处理，机器人技术和许多其他高端技术趋势中。如今，最具挑战性的方面是克服在应用此类算法时（特别是在实时系统中）的极高的计算处理需求。最近，由于它的大的适应性和高度的并行性，现场可编程门阵列（FPGA）被认为是用于加速深度神经网络体系结构的最佳硬件加速器平台之一。在本文中，提出的8位定点并行乘法累加（MAC）单元架构旨在为卷积神经网络（CNN）创建完全自定义的MAC单元，而不是依赖于传统的DSP块和嵌入式存储器单元在FPGA体系结构硅架构上。拟议的8位定点并行乘法累加（MAC）单元架构是使用VHDL语言设计的，并且使用高密度FPGA可以执行高达4.17每秒千兆位运算（GOPS）的计算速度。

著录项

来源
《IEEE Symposium on Computer Applications and Industrial Electronics》|2018年|31-36|共6页
会议地点
作者
Hossam O. Ahmed; Maged Ghoneima; Mohamed Dessouky;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Field programmable gate arrays; Computer architecture; Machine learning; Convolutional neural networks; Fabrics; Silicon;

机译：现场可编程门阵列;计算机体系结构;机器学习;卷积神经网络;织物;硅;

相似文献

外文文献
中文文献
专利

1. Design of FPGA based 32-bit Floating Point Arithmetic Unit and verification of its VHDL code using MATLAB [J] . Naresh Grover, M.K.Soni International Journal of Information Engineering and Electronic Business . 2014,第1期

机译：基于FPGA的32位浮点算术单元的设计及其VMATL代码的MATLAB验证。
2. On Leveraging Machine and Deep Learning for Throughput Prediction in Cellular Networks: Design, Performance, and Challenges [J] . Raca Darijo, Zahran Ahmed H., Sreenan Cormac J., IEEE Communications Magazine . 2020,第3期

机译：关于杠杆网络吞吐量预测的利用机器和深度学习：设计，性能和挑战
3. Community detection in complex networks using deep auto-encoded extreme learning machine [J] . Wang Feifan, Zhang Baihai, Chai Senchun, Modern Physics Letters, B. Condensed Matter Physics, Statistical Physics, Applied Physics . 2018,第16期

机译：使用深度自动编码的极端学习机复杂网络中的社区检测
4. Concurrent MAC unit design using VHDL for deep learning networks on FPGA [C] . Hossam O. Ahmed, Maged Ghoneima, Mohamed Dessouky IEEE Symposium on Computer Applications and Industrial Electronics . 2018

机译：同时MAC单元设计使用VHDL进行FPGA的深度学习网络
5. Emerging Opportunities in Machine Learning Hardware Acceleration: From Advanced Neural Networks Implementation to Ultra-efficient Deep Learning Framework Using Next Generation Technology [D] . ?Cai, Ruizhe 2020

机译：机器学习硬件加速的新兴机会：从先进的神经网络实现，使用下一代技术实现超高效的深度学习框架
6. Machine Learning: Deep Learning Spectroscopy: Neural Networks for Molecular Excitation Spectra (Adv. Sci. 9/2019) [O] . Kunal Ghosh, Annika Stuke, Milica Todorović, 2019

机译：机器学习：深度学习光谱学：分子激发光谱的神经网络（Adv。Sci.9 / 2019）
7. Learning VHDL through teamwork FPGA game design [O] . Carlos Jesus Jimenez-Fernandez, Carmen Baena Oliva, Pilar Parra Fernandez, 2020

机译：通过团队合作FPGA游戏设计学习VHDL

Concurrent MAC unit design using VHDL for deep learning networks on FPGA

摘要

著录项

相似文献

相关主题

期刊订阅